ModernBERT-base-squad2-v0.2

ModernBERT-base-squad2-v0.2

Praise2112

Fine-tuned ModernBERT model for question-answering on SQuAD v2, featuring 149M parameters and 8192 token context length. Achieves 83.96% exact match accuracy.

PropertyValue
Base ModelModernBERT-base-nli
Parameters149 million
Context Length8,192 tokens
Training DatasetSQuAD v2
Performance83.96% exact match, 87.04% F1
AuthorPraise2112
Model HubHugging Face

What is ModernBERT-base-squad2-v0.2?

ModernBERT-base-squad2-v0.2 is a specialized question-answering model fine-tuned on the SQuAD v2 dataset. Built upon the ModernBERT architecture, it leverages advanced features like Rotary Positional Embeddings (RoPE) and Local-Global Alternating Attention to handle long-form content effectively.

Implementation Details

The model incorporates several modern architectural improvements that enhance its performance and efficiency:

  • Utilizes Rotary Positional Embeddings for superior long-context support
  • Implements Local-Global Alternating Attention for efficient processing of long inputs
  • Features unpadding and Flash Attention for optimized inference
  • Trained with a maximum sequence length of 8,192 tokens
  • Uses AdamW optimizer with learning rate 3e-05 and linear scheduler

Core Capabilities

  • Advanced question-answering on long documents
  • Handles context lengths up to 8,192 tokens
  • Achieves 83.96% exact match accuracy on evaluation
  • Efficient processing through modern attention mechanisms
  • Suitable for document retrieval and semantic search tasks

Frequently Asked Questions

Q: What makes this model unique?

This model combines ModernBERT's advanced architecture with specific fine-tuning for question-answering tasks. Its 8,192 token context length and modern attention mechanisms make it particularly effective for long-document analysis.

Q: What are the recommended use cases?

The model excels in question-answering tasks, especially those involving long documents. It's ideal for applications in document retrieval, classification, and semantic search within large text corpora.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026