Belle-whisper-large-v3-turbo-zh

Belle-whisper-large-v3-turbo-zh

BELLE-2

Fine-tuned Whisper model optimized for Chinese ASR with 24-64% performance improvement over base model. Supports high-accuracy speech recognition with integrated punctuation.

PropertyValue
LicenseApache 2.0
Base Modelopenai/whisper-large-v3-turbo
Pipeline TagAutomatic Speech Recognition
FrameworkPyTorch & Transformers

What is Belle-whisper-large-v3-turbo-zh?

Belle-whisper-large-v3-turbo-zh is an advanced Chinese speech recognition model that builds upon OpenAI's Whisper architecture. It's specifically fine-tuned to enhance Chinese ASR capabilities, achieving remarkable improvements of 24-64% over the base model across various Chinese speech recognition benchmarks.

Implementation Details

The model has been fine-tuned using multiple prestigious Chinese speech datasets, including AISHELL-1, AISHELL-2, WenetSpeech, and HKUST. It incorporates sophisticated punctuation handling through integration with the punc_ct-transformer model, ensuring more natural and readable transcriptions.

  • Supports 16KHz audio input
  • Implements full fine-tuning approach
  • Includes automatic punctuation integration
  • Optimized for Chinese language processing

Core Capabilities

  • Achieves 3.07% CER on AISHELL-1 test set
  • Demonstrates superior performance on meeting transcriptions with 13.357% CER
  • Handles various Chinese speech contexts effectively
  • Seamless integration with Transformers pipeline

Frequently Asked Questions

Q: What makes this model unique?

The model stands out for its significant performance improvements over the base Whisper model, particularly in Chinese speech recognition. It achieves this through comprehensive fine-tuning on multiple Chinese speech datasets and integration of advanced punctuation handling.

Q: What are the recommended use cases?

The model is ideal for Chinese speech transcription tasks, particularly in scenarios requiring high accuracy such as meeting transcriptions, general speech recognition, and applications requiring precise Chinese language processing with proper punctuation.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026