IBM Updates Granite Models
Written by Kay Ewbank   
Monday, 28 October 2024

IBM has released new Granite models that it says provide state-of-the-art performance relative to model size. The Granite 3.0 collection includes a new, instruction-tuned, dense decoder-only LLM.

The Granite series is a collection of generative AI models that apply generative AI to the modalities of language and code. The models come in different sizes and are built on a decoder-only architecture. They are trained on business-relevant domains, including financial, legal, IT, coding, and academic domains. The model type is designed to be used for retrieval augmented generation (RAG) for searching knowledge bases to generate tailored responses to customer inquiries, or to condense content into short descriptions.

watsonlogo

IBM says the new Granite 3.0 8B Instruct has been trained using a novel two-phase method on over 12 trillion tokens of carefully vetted data across 12 different natural languages and 116 different programming languages. IBM says that fine-tuning smaller, fit-for-purpose models like Granite provides enterprises with a way to get frontier model performance at a fraction of the cost. Key to this is the ability to tailor Granite models to an organization's needs through InstructLab.

InstructLab is an AI project developed by IBM and Red Hat that can be used by developers to enhance Large Language Models (LLMs) for specific business needs. It is open source and provides systematically generated synthetic data and phased-training protocols.

All Granite models are released under the permissive Apache 2.0 license, and IBM is providing a detailed disclosure of training data sets and methodologies in the Granite 3.0 technical paper. 

IBM has also placed emphasis on model safety in this release, saying. Granite 3.0 8B Instruct demonstrates industry-leading robustness on the AttaQ benchmark, which measures an LLM's vulnerability to adversarial prompts designed to provoke models into generating harmful, inappropriate or otherwise undesirable prompts.

The updated collection of Granite including recipes and how-to guides is available on Github, and developers can also experiment with the new Granite 3.0 8B Instruct model on the IBM Granite Playground.

Granite 3.0 models are now available on IBM watsonx.ai through platform partners such as Google Vertex AI (through Google Cloud's Vertex AI Model Garden integrations with Hugging Face), Hugging Face, NVIDIA (as NIM microservices), Ollama and Replicate.

watsonlogo

More Information

Watsonx Website

IBM Granite Playground

Related Articles

IBM Launches The Granite Code LLM Series

IBM Releases Watsonx Granite Models

IBM Announces WatsonX AI Platform

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Banner


Gender Differences In Coding Style
13/11/2024

A novel investigation into the gender gap between men and women regarding coding ability was undertaken by Dr Siân Brooke. Her conclusion? There is a difference in the Python code [ ... ]



C23 ISO Standard Is Here But You Probably Won't Read It
06/11/2024

At last ISO C23 has been published, but at $250 you probably aren't going to read it. Can we really tolerate this sort of profiteering on the work of others? This is worse than academic publishing!


More News

espbook

 

Comments




or email your comment to: comments@i-programmer.info