mDeBERTa-v3-base-mnli-xnli

Maintained By
MoritzLaurer

mDeBERTa-v3-base-mnli-xnli

PropertyValue
Parameter Count279M
LicenseMIT
Languages Supported100+ (15 primary)
Primary PaperDeBERTa-v3 Paper

What is mDeBERTa-v3-base-mnli-xnli?

mDeBERTa-v3-base-mnli-xnli is a state-of-the-art multilingual model designed for natural language inference (NLI) and zero-shot classification tasks. Developed by Microsoft and fine-tuned by Moritz Laurer, this model represents a significant advancement in multilingual NLP, supporting 100 languages with particular optimization for 15 core languages.

Implementation Details

The model is built on Microsoft's mDeBERTa architecture, pre-trained on the CC100 multilingual dataset and fine-tuned on both XNLI and MNLI datasets. It achieves an impressive average accuracy of 80.8% across 15 languages, with English performance reaching 88.3%.

  • Pre-trained on CC100 multilingual dataset
  • Fine-tuned on professionally translated XNLI development set (37,350 texts)
  • Incorporates English MNLI training set (392,702 texts)
  • Supports zero-shot classification across languages

Core Capabilities

  • Multilingual Natural Language Inference
  • Zero-shot Classification
  • Cross-lingual Transfer Learning
  • Text Classification across 100+ languages

Frequently Asked Questions

Q: What makes this model unique?

The model's uniqueness lies in its combination of broad language coverage (100+ languages) while maintaining state-of-the-art performance for a base-sized model. It specifically avoids using machine-translated training data, focusing instead on high-quality professional translations to prevent overfitting.

Q: What are the recommended use cases?

The model excels in zero-shot classification tasks and natural language inference across multiple languages. It's particularly useful for applications requiring multilingual text understanding without the need for language-specific training data.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.