wav2vec2-xls-r-2b-21-to-en

Maintained By
facebook

wav2vec2-xls-r-2b-21-to-en

PropertyValue
AuthorFacebook
LicenseApache 2.0
PaperXLS-R Paper
Supported Languages21 languages to English

What is wav2vec2-xls-r-2b-21-to-en?

This is a sophisticated SpeechEncoderDecoderModel developed by Facebook for multilingual speech translation. The model combines a wav2vec2-xls-r-2b encoder with an mbart-large-50 decoder, specifically fine-tuned to translate speech from 21 different languages into English. It represents a significant advancement in cross-lingual speech processing technology.

Implementation Details

The model architecture consists of two main components: an encoder pre-trained on wav2vec2-xls-r-2b and a decoder initialized from mbart-large-50. It was fine-tuned on the Covost2 dataset for speech translation tasks.

  • Supports translation from 21 languages including French, German, Spanish, Russian, and more
  • Utilizes transformer-based architecture for both encoding and decoding
  • Implements automatic speech recognition pipeline for easy deployment

Core Capabilities

  • Direct speech-to-text translation from multiple languages to English
  • High-performance speech recognition across diverse language families
  • Batch processing support for efficient translation
  • Integration with Hugging Face's transformers library

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its ability to handle 21 different source languages and translate them directly to English, using a massive 2 billion parameter architecture that achieves state-of-the-art performance on the Covost2 benchmark.

Q: What are the recommended use cases?

The model is ideal for applications requiring multilingual speech translation, including international communication platforms, content localization services, and cross-language media processing systems.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.