mbarthez

Maintained By
moussaKam

mBARThez

PropertyValue
Parameter Count458M
Model TypeSequence-to-sequence
ArchitectureBART LARGE (24 layers)
PaperarXiv:2010.12321
AuthormoussaKam

What is mBARThez?

mBARThez is an advanced French sequence-to-sequence model based on the BART architecture. It represents a significant advancement in French language processing, created by continuing the pretraining of multilingual BART (mBART) specifically for French language tasks. The model is trained on an impressive 66GB corpus of French text, making it particularly robust for generative tasks.

Implementation Details

The model utilizes a LARGE architecture configuration with 24 layers and 458M parameters, significantly more sophisticated than its BASE counterpart BARThez (165M parameters). It's trained using a reconstruction objective where the model learns to reconstruct corrupted input sentences, enabling strong understanding of French language structure and semantics.

  • Pretrained on 66GB of French text data
  • 24-layer LARGE architecture
  • 458M parameters for enhanced performance
  • Built upon multilingual BART foundation

Core Capabilities

  • Generative text tasks (particularly abstractive summarization)
  • Enhanced performance in both discriminative and generative tasks
  • Superior to traditional BERT-based French models for generation
  • Comprehensive French language understanding and generation

Frequently Asked Questions

Q: What makes this model unique?

Unlike existing French language models like CamemBERT and FlauBERT that are BERT-based, mBARThez is specifically designed for generative tasks. Its uniqueness lies in having both a pretrained encoder and decoder, making it particularly effective for tasks requiring text generation.

Q: What are the recommended use cases?

mBARThez excels in generative tasks, particularly abstractive summarization. It's well-suited for applications requiring French text generation, translation, and other sequence-to-sequence tasks where both understanding and generation of French text is crucial.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.