granite-3.2-2b-instruct

ibm-granite

IBM's 2B parameter instruction-tuned LLM with enhanced reasoning capabilities, supporting 12 languages and specialized for controllable thinking tasks.

Property	Value
Parameter Count	2 Billion
Release Date	February 26th, 2025
License	Apache 2.0
Developer	IBM Granite Team
Model URL	huggingface.co/ibm-granite/granite-3.2-2b-instruct

What is granite-3.2-2b-instruct?

Granite-3.2-2B-Instruct is IBM's latest 2-billion parameter language model, specifically designed for enhanced reasoning and instruction-following capabilities. Built upon its predecessor Granite-3.1-2B-Instruct, this model introduces controllable thinking abilities and supports 12 languages, making it versatile for various business applications and AI assistance tasks.

Implementation Details

The model is trained using IBM's Blue Vela supercomputing cluster with NVIDIA H100 GPUs, utilizing a combination of permissively licensed open-source datasets and internally generated synthetic data. It implements a unique controllable thinking mechanism that can be toggled based on task requirements, offering flexibility in reasoning depth.

Supports multiple languages including English, German, Spanish, French, Japanese, and others
Implements controllable thinking through a 'thinking' parameter in the generation process
Trained on specialized synthetic data for enhanced reasoning capabilities
Optimized for long-context tasks and document processing

Core Capabilities

Advanced thinking and reasoning tasks
Text summarization and classification
Information extraction and question-answering
Retrieval Augmented Generation (RAG)
Code-related tasks and function calling
Long-context document processing
Multilingual dialogue support

Frequently Asked Questions

Q: What makes this model unique?

The model's distinguishing feature is its controllable thinking capability, allowing users to activate advanced reasoning when needed while maintaining efficiency for simpler tasks. It also shows impressive performance on benchmark tests, particularly in ArenaHard and Alpaca-Eval-2 compared to its predecessor.

Q: What are the recommended use cases?

The model excels in business applications requiring reasoning, document processing, and multilingual support. It's particularly suited for tasks like document summarization, complex question-answering, and code-related applications, with benchmark scores showing strong performance in human evaluation and truthfulness tests.