ModernBERT-base-squad2-v0.2

Maintained By
Praise2112

ModernBERT-base-squad2-v0.2

PropertyValue
Base ModelModernBERT-base-nli
Parameters149 million
Context Length8,192 tokens
Training DatasetSQuAD v2
Performance83.96% exact match, 87.04% F1
AuthorPraise2112
Model HubHugging Face

What is ModernBERT-base-squad2-v0.2?

ModernBERT-base-squad2-v0.2 is a specialized question-answering model fine-tuned on the SQuAD v2 dataset. Built upon the ModernBERT architecture, it leverages advanced features like Rotary Positional Embeddings (RoPE) and Local-Global Alternating Attention to handle long-form content effectively.

Implementation Details

The model incorporates several modern architectural improvements that enhance its performance and efficiency:

  • Utilizes Rotary Positional Embeddings for superior long-context support
  • Implements Local-Global Alternating Attention for efficient processing of long inputs
  • Features unpadding and Flash Attention for optimized inference
  • Trained with a maximum sequence length of 8,192 tokens
  • Uses AdamW optimizer with learning rate 3e-05 and linear scheduler

Core Capabilities

  • Advanced question-answering on long documents
  • Handles context lengths up to 8,192 tokens
  • Achieves 83.96% exact match accuracy on evaluation
  • Efficient processing through modern attention mechanisms
  • Suitable for document retrieval and semantic search tasks

Frequently Asked Questions

Q: What makes this model unique?

This model combines ModernBERT's advanced architecture with specific fine-tuning for question-answering tasks. Its 8,192 token context length and modern attention mechanisms make it particularly effective for long-document analysis.

Q: What are the recommended use cases?

The model excels in question-answering tasks, especially those involving long documents. It's ideal for applications in document retrieval, classification, and semantic search within large text corpora.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.