Whisper Transcription UI

Overview

Whisper Transcription UI is a user-friendly graphical user interface (GUI) for the whisper-standalone-win tool.

This intuitive application simplifies audio and video transcription and translation using various Whisper models. Customize settings to your liking and save them for future use.

✨ Features

Effortless File Handling: Browse, select, paste, or drag and drop multiple audio and video files.
Direct URL Input: Transcribe audio from online sources by providing the URL.
Flexible Transcription Options:
- Select the target language.
- Choose the Whisper model that best suits your needs.
- Transcribe or translate with ease.
- Define your preferred output format.
Advanced Customization: Fine-tune transcription parameters like FF MDX Kim2, VAD filter, word timestamps, temperature, and beam size.
Progress Monitoring: Keep track of the transcription process.
Persistent Settings: Save your preferred transcription and advanced settings.
Detailed Logging: Enable logging to monitor the transcription process and troubleshoot any issues.

🚀 Getting Started

Prerequisites

Whisper Standalone: Download and install the latest release.
Python 3.x

Installation

Clone the repository:

git clone https://github.com/Ognisty321/whisper-transcription-ui.git
cd whisper-transcription-ui

Install required packages:
```
pip install PyQt6 yt-dlp
```
Ensure faster-whisper-xxl.exe is available:
- Option 1: Place faster-whisper-xxl.exe in the same directory as main.py.
- Option 2: Specify the path to faster-whisper-xxl.exe in the config.ini file (see the note below about creating config.ini):
```
[Settings]
exe_path = path/to/faster-whisper-xxl.exe
```
Launch the application:
```
python main.py
```

Important Note:
The config.ini file is automatically created when you run python main.py for the first time. Once it is created, you can then edit the exe_path (or other settings) as needed. Attempting to manually create the config.ini file before running the application may lead to errors.

🎬 Usage

Select Files: Click Browse to choose files, drag and drop them into the interface, or paste file paths/URLs.
Set Output Directory: Specify where transcribed files should be saved.
Choose Options: Configure transcription language, model, task (transcribe/translate), output format, and other options.
Advanced Options: Fine-tune your transcription using advanced features and parameters.
Transcribe: Initiate the transcription process by clicking the Transcribe button.
Save Settings: Preserve your settings for future sessions using the Save Settings button.

⚙️ Configuration

The application uses a config.ini file to store your settings. As mentioned, this file is automatically created in the application directory when you first run main.py. Update the path to faster-whisper-xxl.exe (or any other setting) in config.ini after the file is generated.

🙏 Acknowledgments

This project wouldn't be possible without whisper-standalone-win. A big thank you to its developers for their exceptional work!

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

📞 Contact

Have questions or suggestions? Don't hesitate to reach out to Ognisty321.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Whisper Transcription UI

Overview

✨ Features

🚀 Getting Started

Prerequisites

Installation

🎬 Usage

⚙️ Configuration

🙏 Acknowledgments

📄 License

📞 Contact

About

Uh oh!

Releases 3

Packages

Languages

License

Ognisty321/Whisper-Transcription-UI

Folders and files

Latest commit

History

Repository files navigation

Whisper Transcription UI

Overview

✨ Features

🚀 Getting Started

Prerequisites

Installation

🎬 Usage

⚙️ Configuration

🙏 Acknowledgments

📄 License

📞 Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages