whisper-v3-tiny-random

Maintained By
yujiepan

whisper-v3-tiny-random

PropertyValue
Authoryujiepan
Base ModelWhisper-large-v3
Model TypeSpeech Recognition
HuggingFace URLLink

What is whisper-v3-tiny-random?

whisper-v3-tiny-random is a specialized debugging model derived from OpenAI's Whisper-large-v3 architecture. It features a dramatically reduced architecture size while maintaining the same configuration structure, making it ideal for testing and development purposes.

Implementation Details

The model implements a minimalist version of the Whisper architecture with the following specifications: 2 hidden layers, 8-dimensional model size, 2 attention heads in both encoder and decoder, and 16-dimensional feed-forward networks. All parameters are randomly initialized within a uniform distribution between -0.5 and 0.5.

  • Reduced architecture: 2 encoder and decoder layers
  • 8-dimensional model size (d_model)
  • 2 attention heads in both encoder and decoder
  • 16-dimensional feed-forward networks
  • Random initialization with seed 42

Core Capabilities

  • Automatic speech recognition with timestamp support
  • Compatible with Hugging Face's pipeline architecture
  • Supports half-precision (float16) computation
  • Includes feature extraction and tokenization capabilities

Frequently Asked Questions

Q: What makes this model unique?

This model's primary purpose is debugging and development. Its uniqueness lies in maintaining the Whisper-v3 architecture while dramatically reducing its size and using random initialization, making it perfect for testing pipelines and implementations.

Q: What are the recommended use cases?

The model is specifically designed for debugging purposes, pipeline testing, and development workflows. It should not be used for production speech recognition tasks as its parameters are randomly initialized.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.