wav2vec2-base-100k-gtzan-music-genres
Property | Value |
---|---|
Author | m3hrdadfi |
Framework | PyTorch |
Task | Audio Classification |
Overall Accuracy | 77.5% |
What is wav2vec2-base-100k-gtzan-music-genres?
This is a specialized audio classification model built on the Wav2Vec 2.0 architecture, designed specifically for music genre classification. The model can classify music into 10 distinct genres including blues, classical, country, disco, hiphop, jazz, metal, pop, reggae, and rock.
Implementation Details
The model utilizes the Wav2Vec 2.0 feature extractor and requires PyTorch for implementation. It processes audio inputs and provides probability scores for each music genre category. The implementation includes resampling capabilities to ensure compatibility with various input formats.
- Built on Wav2Vec 2.0 architecture
- Supports 10 music genres
- Includes automatic audio resampling
- Provides probability scores for each genre
Core Capabilities
- Highest accuracy for Jazz classification (100% precision)
- Strong performance in Classical music (90.5% F1-score)
- Effective genre disambiguation with probability scoring
- Real-time audio processing and classification
Frequently Asked Questions
Q: What makes this model unique?
The model combines the powerful Wav2Vec 2.0 architecture with specialized training for music genre classification, achieving particularly high accuracy in jazz and classical genres while maintaining robust performance across all categories.
Q: What are the recommended use cases?
This model is ideal for music library organization, automated playlist creation, music recommendation systems, and research applications requiring genre classification. It's particularly effective for classical and jazz music identification.