sentence-camembert-large

sentence-camembert-large

dangvantuan

French sentence embedding model based on CamemBERT-Large (337M params), achieving 85.9% Pearson correlation on STS benchmark. Optimized for semantic similarity.

PropertyValue
Parameter Count337M
LicenseApache 2.0
Research PaperSentence-BERT Paper
ArchitectureCamemBERT Large with Sentence Transformers

What is sentence-camembert-large?

Sentence-CamemBERT-Large is a specialized French language embedding model developed by La Javaness. Built on facebook/camembert-large architecture, it's specifically designed to convert French text into meaningful vector representations, enabling advanced semantic search and similarity comparisons. With 337M parameters, it significantly outperforms existing multilingual models on French semantic tasks.

Implementation Details

The model is implemented using the Sentence-Transformers framework and fine-tuned on the French portion of the STSB dataset. It achieves impressive performance metrics, including 85.9% Pearson correlation and 85.8% Spearman correlation on the test set, surpassing both GPT-3 and other multilingual models.

  • Built on CamemBERT-Large architecture
  • Fine-tuned using Siamese BERT-Networks
  • Optimized for French language understanding
  • State-of-the-art performance on semantic similarity tasks

Core Capabilities

  • Semantic sentence encoding for French text
  • High-quality sentence embeddings for similarity comparison
  • Efficient vector representation of text meaning
  • Superior performance compared to multilingual alternatives

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its specialized focus on French language understanding, achieving state-of-the-art performance metrics that surpass both general-purpose models like GPT-3 and multilingual alternatives. Its architecture is specifically optimized for semantic similarity tasks in French.

Q: What are the recommended use cases?

The model is ideal for semantic search applications, document similarity comparison, text clustering, and any NLP task requiring deep understanding of French text semantics. It's particularly effective for applications requiring precise semantic matching between French sentences.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026