nv-embed-v2-ontada-twab-peft

nv-embed-v2-ontada-twab-peft

MendelAI

Advanced sentence embedding model based on NV-Embed-v2, fine-tuned for medical text similarity with 7.85B parameters and 4096-dim outputs

PropertyValue
Parameter Count7.85B
Output Dimensions4096
Base Modelnvidia/NV-Embed-v2
Maximum Sequence Length1024 tokens
Training Dataset Size16,186 samples
PaperSBERT Paper

What is nv-embed-v2-ontada-twab-peft?

This is a specialized sentence transformer model fine-tuned from NVIDIA's NV-Embed-v2 base model, designed specifically for medical text similarity tasks. It transforms medical text into high-dimensional vector representations, enabling sophisticated semantic search and similarity comparisons in healthcare contexts.

Implementation Details

The model implements a sophisticated architecture combining a transformer encoder with special pooling and normalization layers. It was trained using MultipleNegativesRankingLoss with a batch size of 4 and learning rate of 2e-05, achieving impressive evaluation metrics including a cosine NDCG@10 score of 0.8649.

  • BF16 mixed precision training for improved efficiency
  • Custom prompt-based input formatting
  • Specialized pooling mechanism for optimal sentence embeddings
  • Linear learning rate scheduler with 10% warmup

Core Capabilities

  • Semantic textual similarity in medical contexts
  • Patient question-answering support
  • Medical document retrieval and comparison
  • Support for sequences up to 1024 tokens
  • High-dimensional (4096) semantic embeddings

Frequently Asked Questions

Q: What makes this model unique?

This model combines the power of NVIDIA's NV-Embed-v2 with specialized medical domain fine-tuning, offering state-of-the-art performance for medical text similarity tasks with impressive accuracy metrics, particularly its 0.8649 NDCG@10 score.

Q: What are the recommended use cases?

The model excels at medical document retrieval, patient question answering, and semantic similarity tasks in healthcare contexts. It's particularly well-suited for applications requiring precise understanding of medical terminology and context.

Related Models

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026