The Advanced Music and Voice Generator is a cutting-edge project that combines generative AI tools (GAITs) to create music, generate lyrics, and clone voices. This Flask-based application offers a user-friendly interface where users can input prompts and receive fully composed songs with lyrics and voice overlays.

- Lyrics Generation: Dynamic and theme-specific lyrics using GPT-4o Mini.
- Music Generation: Instrumental music creation leveraging Suno AI via Ace Data Cloud.
- Voice Cloning: Custom voice overlays generated using the ElevenLabs API.
- Interactive Web Interface: Simple and intuitive design with Bootstrap for styling.
- Backend: Python, Flask
- Frontend: HTML, CSS, JavaScript (Bootstrap for styling)
- GAITs:
- GPT-4o Mini for lyrics generation
- Suno AI (via Ace Data Cloud) for music generation
- ElevenLabs for voice cloning
- Libraries: Pydub, ReportLab
- User provides input prompt with details such as mood, genre, tempo, and instruments.
- Lyrics are generated using GPT-4o Mini.
- Music is composed based on the input using Suno AI via Ace Data Cloud.
- Voice cloning is performed using ElevenLabs API to overlay vocals.
- Final outputs are integrated and presented to the user for download or playback.
- Clone the repository:
git clone https://github.com/yourusername/AI-Driven-Music-Composition-Suite.git
- Navigate to the project directory:
cd AI-Driven-Music-Composition-Suite
- Install the required dependencies:
pip install -r requirements.txt
- Set up API keys:
- Add your OpenAI, Ace Data Cloud, and ElevenLabs API keys in a
.env
file:OPENAI_API_KEY=your_openai_api_key ACE_DATA_API_KEY=your_ace_data_api_key ELEVENLABS_API_KEY=your_elevenlabs_api_key
- Add your OpenAI, Ace Data Cloud, and ElevenLabs API keys in a
- Run the application:
python main.py
- Open your browser and navigate to
http://127.0.0.1:5000
.
- Generate Lyrics: Enter a theme or prompt to create custom song lyrics.
- Create Music: Select mood, genre, and instruments to generate a matching track.
- Clone Voice: Upload or use preloaded audio to add vocals to the track.
- Preview and Download: Listen to the output and download the lyrics, music, or vocalized track.
- API Access: Suno AI’s API required integration via Ace Data Cloud.
- Audio Synchronization: Aligning music and vocals needed precise timing adjustments using Pydub.
- Ethical Considerations: Ensuring the system adheres to fair use and copyright guidelines.
- Add real-time generation capabilities for enhanced user experience.
- Incorporate more GAITs for diverse musical styles and voices.
- Optimize performance for faster processing and rendering.
- GPT-4o Mini for creative and dynamic lyrics generation.
- Suno AI for versatile music generation capabilities.
- ElevenLabs for realistic and customizable voice cloning.
Author: Likith Kadiyala
Feel free to reach out for collaborations or feedback!