gte-modernbert-base

gte-modernbert-base

Alibaba-NLP

gte-modernbert-base is a 149M parameter text embedding model with 768-dimensional outputs, supporting 8192 token sequences and achieving 64.38 MTEB score.

PropertyValue
Model Size149M parameters
Output Dimension768
Max Sequence Length8192 tokens
MTEB Score64.38
DeveloperAlibaba-NLP (Tongyi Lab)
Primary LanguageEnglish

What is gte-modernbert-base?

gte-modernbert-base is an advanced text embedding model developed by Alibaba's Tongyi Lab, built upon the modernBERT pre-trained encoder-only foundation models. It represents a significant advancement in text embedding technology, offering competitive performance across various evaluation benchmarks while maintaining a relatively compact model size of 149M parameters.

Implementation Details

The model leverages modern architecture optimizations and can be easily integrated using popular frameworks like transformers and sentence-transformers. It supports Flash Attention 2 for improved efficiency and can process sequences up to 8192 tokens in length, outputting 768-dimensional embeddings.

  • Achieves 64.38 on MTEB benchmark evaluation
  • Demonstrates strong performance in BEIR (55.33) and LoCo (87.57) evaluations
  • Supports both text embedding and reranking tasks
  • Compatible with transformers.js for browser-based applications

Core Capabilities

  • Long-context understanding with 8192 token support
  • High-quality text embeddings for semantic search
  • Effective document retrieval and comparison
  • Strong performance in code retrieval tasks (79.31 COIR score)

Frequently Asked Questions

Q: What makes this model unique?

The model combines the benefits of modernBERT architecture with optimized training for text embeddings, offering an excellent balance between model size and performance. Its support for long sequences (8192 tokens) and competitive benchmark scores make it particularly valuable for practical applications.

Q: What are the recommended use cases?

The model excels in semantic search, document retrieval, text similarity analysis, and code search applications. It's particularly well-suited for applications requiring long-context understanding and efficient text representation.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026