DeciLM-6b-instruct

Maintained By
Deci

DeciLM-6b-instruct

PropertyValue
Parameter Count5.72B
Model TypeInstruction-tuned Language Model
LicenseLlama 2 Community License
Training DataSlimPajama-627B and OpenOrca
LanguageEnglish

What is DeciLM-6b-instruct?

DeciLM-6b-instruct is an advanced language model developed by Deci AI, specifically designed for short-form instruction following. It's built upon the base DeciLM 6B model and fine-tuned using LoRA on the OpenOrca dataset. The model implements an optimized transformer decoder architecture with variable Grouped-Query Attention, achieving impressive performance across multiple benchmarks.

Implementation Details

The model utilizes BF16 tensor types and demonstrates remarkable inference speed, achieving 652.49 tokens/sec on an A10 GPU using PyTorch, and up to 2,029.6 tokens/sec using Infery LLM. The architecture incorporates advanced proprietary methodologies that enable faster training and inference compared to similar-sized models.

  • Optimized transformer decoder architecture
  • Variable Grouped-Query Attention implementation
  • BF16 precision for efficient computation
  • Comprehensive benchmark performance across 9 different tasks

Core Capabilities

  • Strong performance on BoolQ (77.34%) and PIQA (77.52%)
  • Effective reasoning capabilities demonstrated by HellaSwag score (74.57%)
  • Reliable performance on LAMBDA OpenAI benchmark (70.1%)
  • Suitable for commercial and research applications

Frequently Asked Questions

Q: What makes this model unique?

DeciLM-6b-instruct stands out due to its optimized architecture with variable Grouped-Query Attention, making it significantly faster than comparable models while maintaining strong performance across various benchmarks. Its efficient design allows for exceptional inference speeds, particularly when using specialized inference tools.

Q: What are the recommended use cases?

The model is particularly well-suited for short-form instruction following tasks, commercial applications, and research use in English. It can be fine-tuned for other languages and shows strong performance in question-answering, reasoning, and general language understanding tasks.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.