Mistral-Nemo-Instruct-Uz

Property	Value
License	Apache 2.0
Languages	Uzbek, English
Base Model	Mistral-Nemo-Instruct-2407
Framework	Transformers

What is Mistral-Nemo-Instruct-Uz?

Mistral-Nemo-Instruct-Uz is a specialized language model developed by a team of researchers including Eldor Fozilov, Azimjon Urinov, and Khurshid Juraev. It's built upon the Mistral-Nemo-Instruct base model and has been specifically optimized for Uzbek language tasks while maintaining strong English language capabilities.

Implementation Details

The model has been continually pre-trained and instruction-tuned using a diverse mix of datasets including uz-crawl, c4, Wikipedia-uzbek, and custom translation instruction sets. It demonstrates impressive performance metrics, particularly in translation tasks, with BLEU scores of 30.49 for Uzbek-to-English and 15.52 for English-to-Uzbek translation.

Achieves 87.04 COMET score for Uz-En translation
Maintains 82.05% accuracy on Uzbek sentiment analysis
Scores 67.36 on MMLU (English) benchmark

Core Capabilities

Bilateral translation between Uzbek and English
Text summarization in both languages
Question-answering capabilities
Sentiment analysis and news classification
Instruction following in both languages

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its specialized optimization for the Uzbek language while maintaining strong performance in English. It outperforms base models in translation benchmarks and shows minimal degradation in general language understanding tasks.

Q: What are the recommended use cases?

The model excels in translation tasks, content summarization, and various NLP tasks in both Uzbek and English. It's particularly suitable for applications requiring bilingual capabilities in these languages.