AudioProcess

AudioProcess is a powerful YouTube audio processing tool that can download audio from YouTube videos, extract subtitles, transcribe content, and generate summaries.

Features

YouTube Audio Downloading: Download audio from YouTube videos in WebM format
Subtitle Extraction: Extract subtitles from YouTube videos when available
Audio Transcription: Transcribe audio using Alibaba Cloud's speech recognition service
Text Summarization: Generate summaries of subtitle/transcription content using large language models
Telegram Bot Integration: Two Telegram bots for convenient audio downloading and content summarization

System Requirements

Python 3.6+
Required dependencies (see below)

Dependencies

Main dependencies include:

yt-dlp (YouTube downloader)
oss2 (Alibaba Cloud OSS)
dashscope (Alibaba Cloud AI services)
python-telegram-bot (Telegram bot integration)
openai (API for summarization)
httpx (with SOCKS proxy support)

Installation

Clone the repository
Install the required dependencies:
```
pip install -r requirements.txt
```

Configuration

Before using the application, you need to set up your configuration:

Configure API keys for cloud services in audioprocess/config/settings.py
Set up your Telegram bot tokens (if using the Telegram bot features)
Configure proxy settings if needed

Usage

Starting the Bots

Use the start.sh script to start both bots (Audio Download Bot and Text Summary Bot):

./start.sh

This will launch:

Audio Download Bot: Downloads audio from YouTube videos when given a URL
Text Summary Bot: Extracts subtitles or transcribes audio from YouTube videos and generates summaries

Manual Operation

You can also use the core functionality directly:

from audioprocess.main import process_youtube_video

# Process a YouTube video (extract subtitles/transcribe and summarize)
result = process_youtube_video("https://www.youtube.com/watch?v=VIDEO_ID")

Bot Functionality

Audio Download Bot

Accepts YouTube URLs
Downloads audio in the best available quality
Sends the audio file back to the user via Telegram

Text Summary Bot

Accepts YouTube URLs
Extracts subtitles if available or downloads and transcribes the audio
Generates and sends back a summary of the content

Project Structure

audioprocess/core/: Core functionality modules
- youtube_downloader.py: Audio downloading from YouTube
- subtitle_extractor.py: YouTube subtitle extraction
- transcription.py: Audio transcription
- summarization.py: Text summarization
- oss_uploader.py: File uploading to Alibaba Cloud OSS
audioprocess/scripts/: Bot and utility scripts
- start_audio_bot.py: Audio Download Bot script
- start_summary_bot.py: Text Summary Bot script
audioprocess/utils/: Utility functions
audioprocess/config/: Configuration files

Troubleshooting

Proxy Issues

The system can use system-defined proxies or a default proxy if needed
For SOCKS proxies, ensure httpx[socks] is installed

Telegram Bot Problems

Verify your bot tokens are correct
Ensure your user ID is in the allowed users list if access is restricted

License

This project is for personal use.

Credits

Developed by CC.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
audioprocess		audioprocess
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ali_cloud_oss.py		ali_cloud_oss.py
main.py		main.py
requirements.txt		requirements.txt
setup.py		setup.py
start.sh		start.sh
youtube_audio_downloader.py		youtube_audio_downloader.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AudioProcess

Features

System Requirements

Dependencies

Installation

Configuration

Usage

Starting the Bots

Manual Operation

Bot Functionality

Audio Download Bot

Text Summary Bot

Project Structure

Troubleshooting

Proxy Issues

Telegram Bot Problems

License

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

junncao/audioProcess

Folders and files

Latest commit

History

Repository files navigation

AudioProcess

Features

System Requirements

Dependencies

Installation

Configuration

Usage

Starting the Bots

Manual Operation

Bot Functionality

Audio Download Bot

Text Summary Bot

Project Structure

Troubleshooting

Proxy Issues

Telegram Bot Problems

License

Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages