st-codesearch-distilroberta-base

st-codesearch-distilroberta-base

flax-sentence-embeddings

A specialized sentence embedding model based on DistilRoBERTa, designed for code search applications with 768-dimensional vectors, trained on code_search_net dataset.

PropertyValue
ArchitectureDistilRoBERTa-base
Vector Dimensions768
Training Datasetcode_search_net
Downloads119,720

What is st-codesearch-distilroberta-base?

This is a specialized sentence transformer model designed for code search applications. Built on the DistilRoBERTa architecture, it maps both natural language sentences and code snippets into a 768-dimensional vector space, enabling semantic search and similarity comparison between text and program code.

Implementation Details

The model was trained using the sentence-transformers framework with a MultipleNegativesRankingLoss approach. Training involved 10,000 steps with a batch size of 256, utilizing AdamW optimizer with a learning rate of 2e-05 and warmup steps of 500. The architecture includes a transformer layer, pooling layer, and normalization layer.

  • Maximum sequence length: 128 tokens
  • Pooling mode: mean tokens
  • Training optimizer: AdamW with weight decay of 0.01
  • Similarity function: dot score with scale 20

Core Capabilities

  • Code search using natural language queries
  • Semantic similarity between code snippets
  • Vector embeddings for both text and code
  • Clustering of similar code segments

Frequently Asked Questions

Q: What makes this model unique?

This model specializes in bridging the gap between natural language and code understanding, making it particularly useful for code search applications. Its architecture is optimized for both text and code representation in the same vector space.

Q: What are the recommended use cases?

The model excels in code search applications, semantic code similarity comparison, and code clustering tasks. It's particularly useful for developers building code search engines or code recommendation systems.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026