Snowflake Arctic Embed M

Property	Value
Parameter Count	109M
Embedding Dimension	768
License	Apache-2.0
Paper	Technical Report

What is snowflake-arctic-embed-m?

Snowflake Arctic Embed M is a medium-sized text embedding model that represents the optimal balance between performance and efficiency in the Arctic Embed family. Based on the intfloat/e5-base-unsupervised architecture, it achieves state-of-the-art retrieval performance with an MTEB Retrieval Score (NDCG@10) of 54.90, surpassing comparable models in its class.

Implementation Details

The model is trained through a sophisticated multi-stage pipeline that begins with pretraining on approximately 400M samples of mixed public datasets and proprietary web search data. This is followed by focused training on 1M carefully curated triplets of query-positive-negative documents, utilizing hard negative mining techniques. The model produces 768-dimensional embeddings and supports a context window of 512 tokens.

Multi-stage training pipeline with hard negative mining
Optimized for retrieval tasks with 768-dimensional embeddings
Supports both sentence-transformers and Hugging Face transformers implementations
Includes specialized query prefixing for enhanced retrieval performance

Core Capabilities

State-of-the-art retrieval performance in its size class
Efficient processing with 109M parameters
Versatile implementation options including JavaScript support via Transformers.js
Optimized for both symmetric and asymmetric similarity search

Frequently Asked Questions

Q: What makes this model unique?

The model achieves superior retrieval performance through its specialized training pipeline and optimization techniques, offering a perfect balance between model size and performance. It outperforms other base-sized models while maintaining practical deployment requirements.

Q: What are the recommended use cases?

The model excels in text retrieval tasks, semantic search applications, and document similarity comparisons. It's particularly well-suited for enterprise-scale deployments where both accuracy and efficiency are crucial.