granite-3.2-8b-instruct-bnb-4bit

Maintained By
unsloth

Granite-3.2-8B-Instruct

PropertyValue
Parameter Count8 Billion
LicenseApache 2.0
Release DateFebruary 26th, 2025
DeveloperIBM Granite Team
Model URLhttps://huggingface.co/unsloth/granite-3.2-8b-instruct-bnb-4bit

What is granite-3.2-8b-instruct-bnb-4bit?

Granite-3.2-8B-Instruct is an advanced language model that builds upon its predecessor, featuring enhanced reasoning capabilities and multi-lingual support. This quantized version (4-bit) offers improved efficiency while maintaining performance across various tasks. The model demonstrates significant improvements in benchmark scores, particularly in ArenaHard (55.25) and Alpaca-Eval-2 (61.19), showing substantial gains over its previous version.

Implementation Details

The model is trained on IBM's Blue Vela supercomputing cluster using NVIDIA H100 GPUs, combining permissively licensed open-source datasets with synthetic data specifically designed for reasoning tasks. It implements a unique controllable thinking capability that can be toggled based on task requirements.

  • 4-bit quantization for efficient deployment
  • Supports 12 languages including English, German, Spanish, and more
  • Controllable thinking capability via API parameter
  • Long-context support for document processing

Core Capabilities

  • Advanced reasoning and thinking tasks
  • Text summarization and classification
  • Question-answering and information extraction
  • Retrieval Augmented Generation (RAG)
  • Code-related tasks and function calling
  • Multi-lingual dialogue processing
  • Long document analysis and summarization

Frequently Asked Questions

Q: What makes this model unique?

The model's distinguishing feature is its controllable thinking capability and significant performance improvements in reasoning tasks, demonstrated by its superior benchmark scores compared to previous versions. The 4-bit quantization makes it more efficient while maintaining high performance.

Q: What are the recommended use cases?

The model excels in business applications requiring complex reasoning, document processing, and multi-lingual support. It's particularly suitable for AI assistants, document analysis, code-related tasks, and scenarios requiring detailed analytical thinking.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.