musicgen-melody

musicgen-melody

facebook

MusicGen Melody (1.56B params) - Facebook's controllable AI music generator supporting text-to-audio and melody-guided generation at 32kHz with 4 codebooks

PropertyValue
Parameter Count1.56B
LicenseCC-BY-NC 4.0
AuthorFacebook (Meta AI)
PaperSimple and Controllable Music Generation

What is musicgen-melody?

MusicGen Melody is a sophisticated AI model developed by Meta AI's FAIR team that generates music based on text descriptions or melody guidance. It's built on a single-stage autoregressive Transformer architecture that processes audio at 32kHz using an EnCodec tokenizer with 4 codebooks sampled at 50 Hz. Unlike other models, it can generate all codebooks in parallel, requiring only 50 autoregressive steps per second of audio.

Implementation Details

The model utilizes a unique architecture that combines an EnCodec model for audio tokenization with an autoregressive language model based on transformer architecture. Operating at 32kHz, it processes audio through 4 codebooks while maintaining high-quality output through parallel prediction capabilities.

  • 1.56B parameter model specifically tuned for melody-guided generation
  • Supports both text-to-music and melody-guided generation
  • Processes audio at 32kHz with 50Hz sampling rate
  • Implements parallel codebook prediction for efficient generation

Core Capabilities

  • Text-to-music generation with detailed control
  • Melody-guided music creation
  • Support for various musical styles and genres
  • Generation of instrumental music up to specified durations
  • High-quality audio output at 32kHz

Frequently Asked Questions

Q: What makes this model unique?

MusicGen Melody stands out for its ability to generate music without requiring self-supervised semantic representation, unlike competitors such as MusicLM. It can process all 4 codebooks in a single pass while supporting both text and melody-guided generation.

Q: What are the recommended use cases?

The model is primarily intended for research in AI-based music generation, including academic studies and explorations by machine learning enthusiasts. It's particularly suited for generating instrumental music and should not be used for commercial applications without proper licensing and risk evaluation.

Related Models

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026