efficient-splade-VI-BT-large-doc

Property	Value
License	cc-by-nc-sa-4.0
Paper	View Paper
MRR@10 (MS MARCO dev)	38.0
Inference Latency	0.7ms

What is efficient-splade-VI-BT-large-doc?

This is a specialized document encoder that forms part of a two-model architecture for efficient passage retrieval. It represents the document-side component of the SPLADE (Sparse Lexical AndMask Distillation) architecture, optimized for both performance and efficiency. The model achieves an impressive balance between retrieval quality and computational efficiency, with a MRR@10 of 38.0 on MS MARCO dev set while maintaining extremely low inference latency of 0.7ms.

Implementation Details

The model utilizes a modified DistilBERT architecture with specific optimizations for document encoding. It implements several efficiency-focused techniques including L1 regularization, FLOPS-regularized middle-training, and separate document/query encoders to achieve state-of-the-art performance while maintaining competitive latency.

Achieves 97.8% R@1000 on MS MARCO dev set
Optimized for sparse representation learning
Implements bag-of-words approach with neural enhancement
Utilizes knowledge distillation for improved efficiency

Core Capabilities

Fast document encoding with 0.7ms inference latency
Efficient passage retrieval using sparse representations
Competitive performance comparable to traditional BM25
Scalable document indexing for large-scale applications

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its exceptional efficiency-performance trade-off, achieving similar latency to traditional BM25 systems while maintaining competitive retrieval performance. The separation of document and query encoders allows for optimized inference speeds.

Q: What are the recommended use cases?

The model is specifically designed for large-scale passage retrieval tasks where both efficiency and effectiveness are crucial. It's particularly well-suited for applications requiring fast document indexing and retrieval with near state-of-the-art performance.