opus-mt-tc-big-en-fr

opus-mt-tc-big-en-fr

Helsinki-NLP

High-performance English-to-French neural translation model from OPUS-MT project, achieving BLEU scores of 53.2 on Tatoeba test set with transformer-big architecture.

PropertyValue
Model TypeNeural Machine Translation
Architecturetransformer-big
Release Date2022-03-09
Source LanguageEnglish
Target LanguageFrench
PaperOPUS-MT Paper

What is opus-mt-tc-big-en-fr?

opus-mt-tc-big-en-fr is a state-of-the-art neural machine translation model specifically designed for English to French translation. Developed by Helsinki-NLP as part of the OPUS-MT project, this model leverages the transformer-big architecture and is trained on the comprehensive OPUS dataset. The model demonstrates exceptional performance, achieving a BLEU score of 53.2 on the Tatoeba test set.

Implementation Details

The model is implemented using Marian NMT framework and has been converted to PyTorch using the Hugging Face transformers library. It utilizes SentencePiece tokenization with a 32k vocabulary for both source and target languages.

  • Built on transformer-big architecture
  • Trained on opusTCv20210807+bt dataset
  • Implements SentencePiece tokenization (spm32k)
  • Supports batch translation and pipeline integration

Core Capabilities

  • High-quality English to French translation with state-of-the-art performance
  • Achieves 53.2 BLEU score on Tatoeba test set
  • Strong performance across multiple test sets including news and multi30k
  • Seamless integration with Hugging Face transformers library

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its exceptional performance on various benchmarks and its integration into the OPUS-MT ecosystem, making it particularly reliable for English to French translation tasks. The transformer-big architecture and comprehensive training data ensure high-quality translations across different domains.

Q: What are the recommended use cases?

The model is well-suited for professional translation tasks, content localization, and applications requiring high-quality English to French translation. It performs particularly well on various test sets, including news content, general conversation (Tatoeba), and multi-modal contexts (Multi30k).

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026