whisper-large-czech-cv11

whisper-large-czech-cv11

mikr

A fine-tuned Whisper Large model optimized for Czech speech recognition, achieving 10.83% WER on Common Voice 11.0, trained with linear learning rate scheduling and mixed-precision.

PropertyValue
Base ModelOpenAI Whisper Large
LicenseApache 2.0
WER Score10.83%
Training DatasetCommon Voice 11.0 (Czech)

What is whisper-large-czech-cv11?

Whisper Large Czech CV11 is a specialized speech recognition model fine-tuned from OpenAI's Whisper Large architecture specifically for the Czech language. The model demonstrates impressive performance with a Word Error Rate (WER) of 10.83% on the Common Voice 11.0 Czech test set, making it particularly effective for Czech speech recognition tasks.

Implementation Details

The model was trained using a sophisticated approach with mixed-precision training (Native AMP) and carefully tuned hyperparameters. The training process utilized the Adam optimizer with beta values (0.9, 0.999) and employed a linear learning rate scheduler with 500 warmup steps over 5000 total training steps.

  • Batch size configuration: 8 training, 4 evaluation (64 total with gradient accumulation)
  • Learning rate: 1e-05
  • Gradient accumulation steps: 8
  • Training duration: 21.04 epochs

Core Capabilities

  • Specialized Czech language speech recognition
  • Production-ready performance with 10.83% WER
  • Optimized for Common Voice dataset processing
  • Supports real-time transcription applications

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its specific optimization for Czech language speech recognition, achieving a notably low WER of 10.83%. The careful fine-tuning process, documented through training steps showing consistent improvement from 11.96% to 10.83% WER, demonstrates its reliability and effectiveness.

Q: What are the recommended use cases?

The model is ideal for Czech language speech recognition tasks, including transcription services, subtitle generation, and voice command systems. Its Apache 2.0 license makes it suitable for both commercial and non-commercial applications.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026