StableBeluga-7B

stabilityai

StableBeluga-7B is a 6.74B parameter LLaMA2-based language model fine-tuned on Orca-style datasets, optimized for instruction-following and safe interactions.

Property	Value
Parameter Count	6.74B
Model Type	Language Model (LLaMA2-based)
License	STABLE BELUGA NON-COMMERCIAL COMMUNITY LICENSE
Research Paper	LLaMA2 Paper
Developer	Stability AI

What is StableBeluga-7B?

StableBeluga-7B is an advanced language model developed by Stability AI, built upon the LLaMA2 7B architecture and fine-tuned using an Orca-style dataset. The model is specifically designed to excel at instruction-following tasks while maintaining safety and ethical considerations in its responses.

Implementation Details

The model implements a sophisticated training procedure using mixed-precision (BF16) and AdamW optimization. It utilizes a two-phase training approach with carefully tuned hyperparameters, including batch sizes of 256/512 and learning rates of 3e-5 with cosine decay.

Supports both F32 and FP16 tensor types for flexibility
Employs a specific prompt format with System, User, and Assistant components
Trained on four specialized datasets for comprehensive knowledge
Implements automatic device mapping for efficient resource utilization

Core Capabilities

Advanced instruction following with safety considerations
Natural language generation and understanding
Context-aware responses with system-guided behavior
Support for both inference endpoints and local deployment

Frequently Asked Questions

Q: What makes this model unique?

StableBeluga-7B stands out for its careful balance of performance and safety, using a specialized Orca-style dataset and implementing strict ethical guidelines. The model's architecture is optimized for instruction-following while maintaining reasonable resource requirements.

Q: What are the recommended use cases?

The model is well-suited for research and non-commercial applications requiring sophisticated language understanding and generation. It excels in scenarios requiring safe, controlled responses while maintaining high-quality output. The model is particularly effective when used with its specific prompt format for consistent results.