DeciLM-7B

Maintained By
Deci

DeciLM-7B

PropertyValue
Parameter Count7.04B
LicenseApache 2.0
Context Length8192 tokens
Architecture32 layers, 32 heads with Variable GQA
LanguageEnglish

What is DeciLM-7B?

DeciLM-7B is a state-of-the-art language model that represents a significant advancement in the field of natural language processing. Developed by Deci, it's currently the top-performing 7B base language model on the Open LLM Leaderboard. The model combines exceptional accuracy with remarkable computational efficiency, achieved through its innovative use of variable Grouped-Query Attention (GQA) and architecture optimization via Deci's proprietary Neural Architecture Search technology, AutoNAC.

Implementation Details

The model features a sophisticated architecture optimized for both performance and efficiency:

  • 7.04 billion parameters distributed across 32 layers
  • 32 attention heads with variable GQA implementation
  • 8,192 token sequence length capability
  • Optimized decoder-only architecture
  • BF16 tensor type for efficient computation

Core Capabilities

  • Top-tier performance on multiple benchmarks (61.55 average score)
  • Up to 4.4x faster throughput compared to Mistral-7B
  • Exceptional performance on tasks like HellaSwag (82.51) and Winogrande (79.95)
  • Commercial and research applications support
  • Fine-tuning capability for various tasks and domains

Frequently Asked Questions

Q: What makes this model unique?

DeciLM-7B stands out for its optimal balance between accuracy and computational efficiency, achieved through variable GQA and AutoNAC architecture optimization. It delivers superior performance while maintaining high throughput, making it ideal for production deployments.

Q: What are the recommended use cases?

The model is well-suited for commercial and research applications in English, including text generation, content creation, and various NLP tasks. Its efficient architecture makes it particularly valuable for applications requiring high throughput and accuracy within resource constraints.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.