dunzhang-stella_en_400M_v5

Maintained By
Marqo

Marqo Stella v2

PropertyValue
Parameter Count435M
Model TypeSentence Transformer
LicenseMIT
FrameworkPyTorch

What is dunzhang-stella_en_400M_v5?

This model is an enhanced version of the original Dunzhang Stella 400M model, featuring a fused matryoshka layer that optimizes embedding generation while maintaining high performance. It's specifically designed for efficient sentence similarity and embedding tasks, achieving strong results across the MTEB benchmark suite.

Implementation Details

The model implements a hierarchical structuring through its matryoshka layer, which reduces computational overhead during embedding generation. It utilizes the transformers framework and can be easily integrated into existing pipelines for various NLP tasks.

  • Efficient embedding generation through fused matryoshka architecture
  • Optimized for sentence similarity tasks
  • Strong performance on MTEB benchmark suite
  • Compatible with PyTorch and Hugging Face transformers

Core Capabilities

  • Sentence embedding generation
  • Text similarity comparison
  • Document retrieval
  • Classification tasks with 85%+ accuracy on multiple datasets
  • Clustering with strong v-measure scores

Frequently Asked Questions

Q: What makes this model unique?

The model's fused matryoshka layer sets it apart by reducing computational overhead while maintaining performance metrics. This makes it particularly efficient for production deployments where resource optimization is crucial.

Q: What are the recommended use cases?

The model excels in semantic search, document similarity, clustering, and classification tasks. It's particularly well-suited for applications requiring efficient text embedding generation while maintaining high accuracy.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.