deberta-v3-base-tasksource-nli

Maintained By
sileod

DeBERTa-v3-base-tasksource-nli

PropertyValue
Parameter Count184M
LicenseApache 2.0
PaperarXiv:2301.05948
ArchitectureDeBERTa-v3-base

What is deberta-v3-base-tasksource-nli?

This is a powerful language model based on DeBERTa-v3-base architecture, fine-tuned through multi-task learning on over 600 NLP tasks. It excels in zero-shot classification and natural language inference (NLI) tasks, achieving impressive results like 70% accuracy on WNLI.

Implementation Details

The model underwent extensive training for 200,000 steps with a batch size of 384 and a peak learning rate of 2e-5. Training was conducted on an Nvidia A30 24GB GPU over 15 days. It implements task-specific CLS embeddings with a 10% dropout rate to ensure flexibility in usage.

  • Multi-task learning across 600+ tasks
  • Shared classification layers for multiple-choice tasks
  • Optimized weight sharing for matching labels
  • Specialized NLI dataset integration for improved zero-shot performance

Core Capabilities

  • Zero-shot classification with arbitrary labels
  • Natural Language Inference (NLI) tasks
  • Support for hundreds of tasks via tasksource-adapters
  • Fine-tuning capabilities for new tasks
  • Token classification and multiple-choice task handling

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its extensive multi-task training across 600+ tasks and ranked first among all models with the DeBERTa-v3-base architecture in IBM's model recycling evaluation. It offers exceptional flexibility through its zero-shot capabilities and tasksource-adapters integration.

Q: What are the recommended use cases?

The model is ideal for zero-shot classification tasks, natural language inference, and can be efficiently adapted for specific classification tasks through fine-tuning. It's particularly strong in scenarios requiring understanding of textual entailment and semantic relationships.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.