Generating Voice over audio for Youtube Videos based on Youtube Title
This project is a Streamlit application that leverages various AI models to generate YouTube scripts and voiceovers. It aims to assist content creators in producing high-quality YouTube videos with minimal effort.
- No Elven labs feautre in this Link
- Title : what is phenomenon beyond northern lights --- Demo Samlpe Link
-
Clone the repository:
git clone https://github.com/your-username/AI-YouTube-Voice-Over-Generator.git cd AI-YouTube-Voice-Over-Generator
-
Install the required Python packages:
pip install -r requirements.txt
-
Run the Streamlit application:
streamlit run app.py
- Gemini Models: Enter your Gemini API key in the sidebar if you are using Gemini-based text generation models.
- Groq Models: Enter your Groq API key in the sidebar if you are using Groq-based text generation models.
- ElevenLabs TTS: Enter your ElevenLabs API key in the sidebar if you select the ElevenLabs text-to-speech model.
- Enter Video Title: Input the title for your YouTube video.
- Set Video Length: Specify the desired video length in minutes.
- Generate Script: Click the "Generate Script" button.
- Edit Script: Modify the generated script if needed.
- Select TTS Model: Choose a TTS model in the sidebar.
- Generate Audio: Click the "Generate Audio" button after the script is ready.
- Download Audio: Listen to and download the generated audio.
- Regenerate Script: Click to create a new script if desired.
- Edit Script: Update the script and convert it to audio.
Contributions are welcome! Please open an issue or submit a pull request if you have suggestions for improvements or new features.
- mrfakename for Hosting freely Hosting MELLO TTS model, without there contribution it woudn't be a possibilty to provide free TTS service to end users
- Local Text Generation: Implement the use of local text generation models to enhance performance and reduce dependency on external APIs.
- Local TTS Models: Integrate local TTS models for better audio generation and faster processing times.
- Voice Cloning: Allow users to choose from a variety of voices for TTS, including options for voice cloning based on user preferences.