all-MiniLM-L6-v2-onnx

all-MiniLM-L6-v2-onnx

Qdrant

ONNX-optimized version of all-MiniLM-L6-v2 for efficient sentence similarity and text embeddings, with Apache 2.0 license and 215K+ downloads

PropertyValue
LicenseApache 2.0
Downloads215,331
AuthorQdrant
Primary UseSentence Similarity & Text Embeddings

What is all-MiniLM-L6-v2-onnx?

all-MiniLM-L6-v2-onnx is an ONNX-optimized version of the popular sentence-transformers/all-MiniLM-L6-v2 model, specifically designed for efficient text classification and similarity searches. This model has been optimized for production deployment with ONNX runtime, making it particularly suitable for use with Qdrant vector database.

Implementation Details

The model is implemented using the ONNX framework and is designed to work seamlessly with FastEmbed for generating text embeddings. It requires specific configuration with Qdrant's Modifier.IDF for optimal performance in vector similarity searches.

  • ONNX optimization for improved inference speed
  • Compatible with FastEmbed library
  • Integrated support for Qdrant vector database
  • Specialized for sentence similarity tasks

Core Capabilities

  • Text embedding generation
  • Sentence similarity comparison
  • Efficient vector representation of text
  • Production-ready inference

Frequently Asked Questions

Q: What makes this model unique?

This model's ONNX optimization makes it particularly efficient for production deployments, while maintaining the powerful semantic understanding capabilities of the original all-MiniLM-L6-v2 model.

Q: What are the recommended use cases?

The model is ideal for applications requiring semantic search, document similarity comparison, and text classification tasks, particularly when integrated with Qdrant vector database.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026