whisper-large-v2-cv11-german

whisper-large-v2-cv11-german

bofenghuang

German ASR model fine-tuned from Whisper-large-v2, achieving 5.76 WER on Common Voice 11.0, optimized for 16kHz audio with 1.55B parameters.

PropertyValue
Parameter Count1.55B
LicenseApache 2.0
LanguageGerman
WER Score5.76%

What is whisper-large-v2-cv11-german?

This is a specialized German Automatic Speech Recognition (ASR) model, fine-tuned from OpenAI's Whisper Large V2 architecture on the Mozilla Common Voice 11.0 dataset. The model represents a significant improvement over the base Whisper implementation for German speech recognition, achieving a Word Error Rate (WER) of 5.76% compared to the original model's 6.4% on Common Voice.

Implementation Details

The model is built on the powerful Whisper Large V2 architecture, utilizing a sequence-to-sequence approach for speech recognition. It's specifically optimized for 16kHz audio input and includes capabilities for predicting proper casing and punctuation in transcriptions.

  • Transformer-based sequence-to-sequence architecture
  • Optimized for German language processing
  • Supports 16kHz audio input
  • Includes automatic punctuation and casing

Core Capabilities

  • High-accuracy German speech transcription
  • Automatic punctuation and capitalization
  • Support for both greedy and beam search decoding
  • Flexible integration through Hugging Face Transformers

Frequently Asked Questions

Q: What makes this model unique?

This model stands out through its superior performance on German ASR tasks, achieving a 5.76% WER on Common Voice 11.0, making it significantly more accurate than smaller variants and the base model. It includes built-in support for punctuation and casing, which many ASR models don't provide.

Q: What are the recommended use cases?

The model is ideal for production-grade German speech recognition tasks requiring high accuracy, such as transcription services, subtitle generation, and voice command systems. It's particularly suitable when proper punctuation and capitalization are important.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026