NLLB-200 Distilled 1.3B
Property | Value |
---|---|
Model Size | 1.3B parameters |
License | CC-BY-NC-4.0 |
Author | |
Languages | 200 languages |
Task | Translation |
What is nllb-200-distilled-1.3B?
NLLB-200 distilled 1.3B is a compressed version of Facebook's No Language Left Behind translation model, designed to provide efficient multilingual translation across 200 languages. This distilled variant maintains high performance while reducing computational requirements compared to the larger 3.3B parameter model.
Implementation Details
The model utilizes transformer architecture optimized for translation tasks, with special attention to low-resource languages. It supports translation between any pair of its 200 supported languages, including many African and Asian languages that are typically underserved in machine translation.
- Trained on diverse parallel and monolingual data from various sources
- Maximum input sequence length of 512 tokens
- Evaluated using BLEU, spBLEU, and chrF++ metrics
- Implements SentencePiece tokenization
Core Capabilities
- Direct translation between 200 languages without pivot languages
- Specialized handling of different scripts and writing systems
- Support for low-resource languages
- Single sentence translation optimization
- Research-focused implementation with emphasis on accessibility
Frequently Asked Questions
Q: What makes this model unique?
This model's primary strength lies in its comprehensive coverage of 200 languages, including many low-resource languages, while maintaining efficiency through distillation to 1.3B parameters. It's particularly notable for its support of various writing systems and scripts, from Latin to Arabic to various Asian scripts.
Q: What are the recommended use cases?
The model is best suited for research purposes and single-sentence translation tasks. It's specifically designed for general domain text and should not be used for specialized domains like medical or legal translation. It's also not intended for production deployment or certified translations.