snowflake-arctic-embed-m-v1.5

Maintained By
Snowflake

Snowflake Arctic-embed-m-v1.5

PropertyValue
Parameter Count109M
LicenseApache 2.0
PaperarXiv:2407.18887
MTEB Retrieval Score55.14 (NDCG@10)

What is snowflake-arctic-embed-m-v1.5?

Snowflake's arctic-embed-m-v1.5 is an advanced text embedding model designed to generate highly compressible embedding vectors while maintaining exceptional retrieval quality. This model represents a significant improvement over its predecessor, achieving better performance even when compressed to as little as 128 bytes per vector through innovative compression techniques.

Implementation Details

The model employs a sophisticated combination of Matryoshka Representation Learning (MRL) and uniform scalar quantization to achieve its impressive compression capabilities. It generates 768-dimensional vectors that can be effectively truncated to 256 dimensions while preserving semantic meaning.

  • Achieves 98% of original performance even at 128-byte compression (24x reduction)
  • Supports both 4-bit and 8-bit uniform scalar quantization
  • Optimized ranges: -0.18 to +0.18 for 4-bit, -0.3 to +0.3 for 8-bit quantization
  • Compatible with popular frameworks including Sentence Transformers and Hugging Face Transformers

Core Capabilities

  • State-of-the-art embedding generation with 55.14 MTEB Retrieval Score
  • Superior compression maintaining 99% quality at 256 dimensions
  • Efficient storage with up to 7.8M vectors per GB at maximum compression
  • Cross-framework compatibility and easy integration

Frequently Asked Questions

Q: What makes this model unique?

The model's ability to maintain high performance even under extreme compression sets it apart. It achieves better retrieval quality than larger models like Google's gecko (1200M parameters) when compressed to 256 dimensions.

Q: What are the recommended use cases?

The model is ideal for large-scale retrieval systems where storage efficiency is crucial. It's particularly well-suited for applications requiring high-quality semantic search with minimal storage overhead.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.