neural-chat-7b-v3-1

Maintained By
Intel

Neural Chat 7B v3.1

PropertyValue
Parameter Count7.24B
Base ModelMistral-7B-v0.1
LicenseApache 2.0
Context Length8192 tokens
Training DatasetOpen-Orca/SlimOrca

What is neural-chat-7b-v3-1?

Neural Chat 7B v3.1 is an advanced language model developed by Intel, built upon the Mistral-7B architecture and fine-tuned using the Open-Orca/SlimOrca dataset. This model represents a significant improvement over its predecessors, featuring enhanced performance across multiple benchmarks and optimized for deployment on Intel Gaudi 2 processors.

Implementation Details

The model employs Direct Preference Optimization (DPO) for alignment and supports multiple inference modes including FP32, BF16, and INT4 quantization. It was trained using 8 Gaudi2 cards with DeepSpeed Zero2 optimization, achieving impressive benchmark scores across various tasks.

  • Supports flexible deployment options with both Transformers and Intel Extensions
  • Features comprehensive quantization capabilities for efficient inference
  • Implements advanced training techniques including DPO alignment

Core Capabilities

  • Strong performance on ARC (66.21%) and HellaSwag (83.64%) benchmarks
  • Enhanced truthfulness with TruthfulQA score of 59.65%
  • Improved mathematical reasoning with GSM8K score of 19.56%
  • Versatile text generation and conversation capabilities

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its balanced performance across multiple benchmarks and its optimization for Intel hardware. It shows significant improvements over the base Mistral-7B model while maintaining efficient deployment options through various quantization schemes.

Q: What are the recommended use cases?

The model is well-suited for general language tasks, conversational AI, and educational applications. It performs particularly well in scenarios requiring truthful responses and can handle both analytical and creative tasks effectively.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.