
Whisper Transcription UI is a user-friendly graphical user interface (GUI) for the whisper-standalone-win tool.
This intuitive application simplifies audio and video transcription and translation using various Whisper models. Customize settings to your liking and save them for future use.
- Effortless File Handling: Browse, select, paste, or drag and drop multiple audio and video files.
- Direct URL Input: Transcribe audio from online sources by providing the URL.
- Flexible Transcription Options:
- Select the target language.
- Choose the Whisper model that best suits your needs.
- Transcribe or translate with ease.
- Define your preferred output format.
- Advanced Customization: Fine-tune transcription parameters like FF MDX Kim2, VAD filter, word timestamps, temperature, and beam size.
- Progress Monitoring: Keep track of the transcription process.
- Persistent Settings: Save your preferred transcription and advanced settings.
- Detailed Logging: Enable logging to monitor the transcription process and troubleshoot any issues.
- Whisper Standalone: Download and install the latest release.
- Python 3.x
-
Clone the repository:
git clone https://github.com/Ognisty321/whisper-transcription-ui.git cd whisper-transcription-ui
-
Install required packages:
pip install PyQt6 yt-dlp
-
Ensure
faster-whisper-xxl.exe
is available:- Option 1: Place
faster-whisper-xxl.exe
in the same directory asmain.py
. - Option 2: Specify the path to
faster-whisper-xxl.exe
in theconfig.ini
file (see the note below about creatingconfig.ini
):[Settings] exe_path = path/to/faster-whisper-xxl.exe
- Option 1: Place
-
Launch the application:
python main.py
Important Note:
Theconfig.ini
file is automatically created when you runpython main.py
for the first time. Once it is created, you can then edit theexe_path
(or other settings) as needed. Attempting to manually create theconfig.ini
file before running the application may lead to errors.
- Select Files: Click
Browse
to choose files, drag and drop them into the interface, or paste file paths/URLs. - Set Output Directory: Specify where transcribed files should be saved.
- Choose Options: Configure transcription language, model, task (transcribe/translate), output format, and other options.
- Advanced Options: Fine-tune your transcription using advanced features and parameters.
- Transcribe: Initiate the transcription process by clicking the
Transcribe
button. - Save Settings: Preserve your settings for future sessions using the
Save Settings
button.
The application uses a config.ini
file to store your settings. As mentioned, this file is automatically created in the application directory when you first run main.py
. Update the path to faster-whisper-xxl.exe
(or any other setting) in config.ini
after the file is generated.
This project wouldn't be possible without whisper-standalone-win. A big thank you to its developers for their exceptional work!
This project is licensed under the MIT License - see the LICENSE
file for details.
Have questions or suggestions? Don't hesitate to reach out to Ognisty321.