A Windows GUI application for transcribing audio files using OpenAI's Whisper speech recognition model locally on your machine.
- Transcribe audio files (MP3, WAV, M4A, FLAC, OGG, MP4)
- Select from different Whisper model sizes (tiny, base, small, medium, large)
- Choose between CPU or GPU processing (if CUDA is available)
- Progress indicator during transcription
- View raw transcript results
- Convert to Markdown with live preview
- Export as raw text or Markdown files
- Custom save location
- Clone or download this repository
- Create and activate a virtual environment:
python -m venv whisper_env
whisper_env\Scripts\activate
- Install required dependencies:
pip install -r requirements.txt
Note: The Whisper package is installed directly from the GitHub repository to ensure compatibility.
- Make sure you have PyInstaller installed:
pip install pyinstaller
- Create the executable using the provided spec file:
pyinstaller whisper_transcribe.spec
- The executable will be created in the
dist
folder. You can create a shortcut todist/WhisperTranscribe/WhisperTranscribe.exe
on your desktop.
Note: The first time you run the executable, it will download the Whisper model files. This might take a few minutes depending on your internet connection and the model size you select.
- Run the application (either through Python or the executable)
- Select an audio file to transcribe
- Choose the Whisper model size and processing device
- Set your preferred save directory (optional)
- Click "Transcribe Audio" and wait for the process to complete
- View the results in the Raw Transcript tab
- Optionally convert to Markdown with the "Prettify to Markdown" button
- Export the transcription as a text or Markdown file
- Python 3.7+
- PyQt6
- OpenAI Whisper (from GitHub)
- PyTorch
- Markdown
- Larger models provide better transcription quality but require more memory and processing time
- GPU acceleration significantly improves processing speed for larger models
- The application creates "transcribed_text" and "uploaded_audio" directories in the application folder
- The executable includes all necessary dependencies and will work on any Windows system