Neural Chat 7B v3-2

Property	Value
Model Size	7B parameters
Context Length	8192 tokens
License	Apache 2.0
Paper	Research Paper
Base Model	Mistral-7B-v0.1

What is neural-chat-7b-v3-2?

Neural Chat 7B v3-2 is Intel's advanced language model, specifically fine-tuned on the MetaMathQA dataset using Intel Gaudi 2 processors. It represents an evolution of the neural-chat series, built upon the Mistral-7B foundation and optimized using Direct Performance Optimization (DPO) method.

Implementation Details

The model supports multiple inference modes including FP32, BF16, and INT4 quantization, making it versatile for different deployment scenarios. It demonstrates strong performance across various benchmarks, with particularly impressive results in HellaSwag (83.92%) and Winogrande (79.95%).

Supports context length of 8192 tokens
Multiple quantization options for optimization
Integrated with Intel Extension for Transformers
Average benchmark score of 68.29

Core Capabilities

Advanced mathematical problem-solving
General language understanding and generation
Step-by-step solution explanation
Efficient performance on math-related tasks

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its specialized mathematical capabilities while maintaining strong general language understanding. It's optimized for Intel hardware and offers flexible deployment options with various quantization levels.

Q: What are the recommended use cases?

The model excels in mathematical problem-solving, educational assistance, and general language tasks. It's particularly suitable for applications requiring detailed step-by-step explanations and mathematical reasoning.