wav2vec2-large-xlsr-53-faroese-100h

Property	Value
License	cc-by-4.0
Paper	ASR Language Resources for Faroese
WER (Test)	7.6%
WER (Dev)	5.5%

What is wav2vec2-large-xlsr-53-faroese-100h?

This is a specialized automatic speech recognition (ASR) model designed specifically for the Faroese language. It was developed by fine-tuning the facebook/wav2vec2-large-xlsr-53 model using 100 hours of Faroese speech data from the Ravnur Project. The model represents a significant advancement in Faroese language technology, achieving impressive word error rates of 7.6% on test data.

Implementation Details

The model was fine-tuned at the Language and Voice Lab at Reykjavík University using the Ravnursson Faroese Speech and Transcripts dataset. It implements the wav2vec2 architecture, which has proven highly effective for low-resource languages like Faroese.

Base Architecture: wav2vec2-large-xlsr-53
Training Data: 100 hours of Faroese audio
Sampling Rate: 16kHz
Evaluation Metric: Word Error Rate (WER)

Core Capabilities

Automatic speech recognition for Faroese language
High accuracy with 7.6% WER on test set
Support for 16kHz audio input
Batch processing capabilities
Integration with Hugging Face Transformers library

Frequently Asked Questions

Q: What makes this model unique?

This model is one of the first high-performing ASR models specifically trained for the Faroese language, making it a crucial resource for Faroese language technology development.

Q: What are the recommended use cases?

The model is ideal for Faroese speech transcription tasks, language documentation efforts, and building Faroese language applications requiring speech recognition capabilities.