DeBERTa-v3-base-mnli-fever-anli
Property | Value |
---|---|
Parameter Count | 184M |
License | MIT |
Paper | DeBERTa Paper |
Training Data | 763,913 NLI pairs |
What is DeBERTa-v3-base-mnli-fever-anli?
This is a specialized version of Microsoft's DeBERTa-v3-base model, fine-tuned specifically for natural language inference (NLI) tasks. The model has been trained on a comprehensive dataset combining MultiNLI, Fever-NLI, and Adversarial-NLI, making it particularly robust for zero-shot classification tasks. Notably, it outperforms many larger models on the ANLI benchmark despite its base size.
Implementation Details
The model leverages the advanced DeBERTa-v3 architecture with significant improvements in pre-training objectives compared to previous versions. It was trained using mixed precision training with specific hyperparameters including a learning rate of 2e-05, batch size of 32, and 3 training epochs.
- Zero-shot classification capability with simple pipeline implementation
- Supports both single-label and multi-label classification
- Optimized for inference tasks with three output labels: entailment, neutral, and contradiction
Core Capabilities
- High accuracy on MNLI benchmark (90.3%)
- Strong performance on FEVER-NLI (77.7%)
- Competitive ANLI performance (49.5% on R3)
- Efficient zero-shot classification for various tasks
Frequently Asked Questions
Q: What makes this model unique?
The model combines DeBERTa-v3's advanced architecture with comprehensive NLI training, making it particularly effective for zero-shot classification while being more efficient than larger models.
Q: What are the recommended use cases?
The model excels at zero-shot text classification, natural language inference, and hypothesis-premise pair analysis. It's particularly useful when you need to classify text without task-specific training data.