bge-reranker-large

Maintained By
BAAI

BGE Reranker Large

PropertyValue
Parameter Count560M
LicenseMIT
LanguagesChinese, English
FrameworkPyTorch, ONNX

What is bge-reranker-large?

BGE Reranker Large is a powerful cross-encoder model designed for high-accuracy text similarity scoring and reranking. Unlike traditional embedding models, it directly processes query-document pairs to produce relevance scores, making it ideal for refining search results and improving information retrieval systems.

Implementation Details

The model utilizes a cross-encoder architecture that performs full-attention over input pairs, offering superior accuracy compared to bi-encoder approaches. It's built on XLM-RoBERTa architecture and supports both PyTorch and ONNX inference frameworks.

  • Optimized for both Chinese and English text processing
  • Supports efficient top-k reranking of search results
  • Achieves state-of-the-art performance on multiple benchmarks including MTEB and C-MTEB
  • Includes FP16 support for faster inference

Core Capabilities

  • Cross-lingual reranking with support for Chinese-English and English-Chinese pairs
  • Direct relevance scoring without need for separate embeddings
  • High performance on medical QA tasks (CMedQAv1/v2)
  • Efficient integration with existing search pipelines

Frequently Asked Questions

Q: What makes this model unique?

The model combines high accuracy with practical efficiency, using a cross-encoder architecture that directly computes relevance scores for text pairs. It achieves superior performance on both monolingual and cross-lingual tasks, making it especially valuable for multilingual applications.

Q: What are the recommended use cases?

The model is best used for reranking top-k results from a first-stage retrieval system. It's particularly effective for improving search quality in medical QA systems, document retrieval, and cross-lingual information retrieval tasks.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.