---
title: Music Generation Studio
emoji: 🎵
colorFrom: purple
colorTo: pink
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit
---

# 🎵 Music Generation Studio

Create AI-powered music with intelligent prompt analysis and context-aware generation using DiffRhythm2 and LyricMind AI.

**⚠️ Important:** 
- This Space requires ZeroGPU to run
- **You must be logged in** to HuggingFace to use GPU features
- Free users get daily ZeroGPU quota - check your usage at https://huggingface.co/settings/billing
- If you see quota errors while logged in, try duplicating this Space to your account

## Features

- **Intelligent Music Generation**: DiffRhythm2 model for high-quality music with vocals
- **Smart Lyrics Generation**: LyricMind AI for context-aware lyric creation
- **Prompt Analysis**: Automatically detects genre, BPM, and mood from your description
- **Flexible Vocal Modes**:
  - Instrumental: Pure music without vocals
  - User Lyrics: Provide your own lyrics
  - Auto Lyrics: AI-generated lyrics based on prompt
- **Timeline Management**: Build complete songs clip-by-clip
- **Export**: Download your creations in WAV, MP3, or FLAC formats

## How to Use

1. **Generate Music**:
   - Enter a descriptive prompt (e.g., "energetic rock song with electric guitar at 140 BPM")
   - Choose vocal mode (Instrumental, User Lyrics, or Auto Lyrics)
   - Set duration (10-120 seconds)
   - Click "Generate Music Clip"

2. **Manage Timeline**:
   - View all generated clips in the timeline
   - Remove specific clips or clear all
   - Clips are arranged sequentially

3. **Export**:
   - Enter a filename
   - Choose format (WAV recommended for best quality)
   - Download your complete song

## Models

- **DiffRhythm2**: Music generation with integrated vocals ([ASLP-lab/DiffRhythm2](https://huggingface.co/ASLP-lab/DiffRhythm2))
- **MuQ-MuLan**: Music style encoding ([OpenMuQ/MuQ-MuLan-large](https://huggingface.co/OpenMuQ/MuQ-MuLan-large))

## Performance

⏱️ Generation time: ~2-4 minutes per 30-second clip on CPU (HuggingFace Spaces free tier)

💡 Tip: Start with shorter durations (10-20 seconds) for faster results

## Technical Details

- Built with Gradio and PyTorch
- Uses DiffRhythm2 for music generation with vocals
- Employs flow-matching techniques for high-quality audio synthesis
- Supports multiple languages for lyrics (English, Chinese, Japanese)

## Credits

- DiffRhythm2 by ASLP-lab
- MuQ-MuLan by OpenMuQ
- Application interface and integration by Music Generation App Team

## License

MIT License - See LICENSE file for details