whisper-model

Here are 16 public repositories matching this topic...

shhossain / BanglaSpeech2Text

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.

machine-learning deep-learning speech pytorch transformer voice-recognition speech-recognition bangla speech-to-text hacktoberfest whisper bangla-asr bangla-speech-recognition bangla-speech-to-text bangla-automatic-speech-recognition whisper-model bangla-voice-recognition

Updated Mar 1, 2025
Python

jim-schwoebel / nala_assistant

Star

🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.

Updated Jan 15, 2024
JavaScript

thc1006 / whisper-colab-tpu-transcriber

Star

High-performance Google Colab Notebook for fast & accurate audio transcription/translation using OpenAI Whisper. Accelerated on TPUs with PyTorch/XLA. Features an interactive UI for model selection, multi-language support, and long-form audio processing.

python machine-learning natural-language-processing deep-learning ffmpeg jupyter-notebook pytorch speech-recognition ipywidgets voice-to-text tpu google-colab audio-transcription huggingface-transformers pytorch-xla openai-whisper whisper-model multilingual-asr

Updated Jun 8, 2025
Jupyter Notebook

hemangjoshi37a / French_audio_transcription_using_gradio

Star

French audio transcription using gradio

machine-learning speech-recognition gradio audio-processing french-language audio-transcription audio-to-text transcription-tool whisper-model french-audio-transcription

Updated Sep 22, 2024
Jupyter Notebook

krithicswaroopan / AI-Voice-Assistance-Pipeline

Star

A real-time voice-to-text and text-to-speech AI pipeline using Whisper, an LLM, and Edge-TTS with tunable parameters for low-latency audio processing and response generation.

python natural-language-processing text-to-speech speech-recognition speech-to-text real-time-processing conversational-ai voice-activity-detection ai-ml hugging-face-transformers large-language-models whisper-model edge-tts

Updated Sep 24, 2024
Python

furkanksl / FreeWhisper

Star

free macOS whisper dictation app

speech-to-text transcription whisper whisper-model

Updated Jun 5, 2025
Swift

franckferman / Whisper_Transcriber

Star

📝 Turn audio into text effortlessly. Audio transcription powered by OpenAI's Whisper API.

Updated Mar 15, 2025
Python

dvorobiev / subtitles_project

Star

Subtitles Generator: Автоматический генератор субтитров для видео с поддержкой перевода на различные языки, использующий модель Whisper от OpenAI.

python machine-learning subtitles video-processing audio-transcription whisper-model

Updated Mar 19, 2025
Python

sushant1827 / CrewAI-Agents-MinutesOfMeeting-Gmail

Star

MinutesOfMeeting and Gmail is a collaborative crew of AI agents that autonomously understand audio, transcripts, summarizes, writes and drafts an email in Gmail account.

chunking google-cloud-platform gmail-api audio-segmentation google-auth-library whisper-model llm-tools crewai agentic-workflow gpt-4o-mini agent-ops crewai-flow

Updated Jan 18, 2025
Python

otonomee / youtube-to-transcript

Star

Convert YouTube videos to text files. Why spend 30 minutes watching a video when you can skim the transcript in a couple minutes?

python machine-learning openai youtube-downloader speech-to-text transcription pytube video-to-text audio-transcription whisper-model

Updated Jul 30, 2024
Python

Xza85hrf / Whisper-Subtitle-Generator

Star

The Whisper Subtitle Generator leverages OpenAI's Whisper model to generate subtitles from audio and video files. This Python-based tool supports multiple languages and employs advanced audio processing techniques to ensure high accuracy in transcription.

python ffmpeg speech-recognition openai gpu-acceleration noise-reduction audio-processing subtitle-generator audio-to-text video-subtitles transcription-tool whisper-model multilingual-transcription srt-output vtt-output

Updated Apr 23, 2024
Python

seccanj / generate-subtitle-llm

Star

Generates subtitles from a video speech (Whisper OpenAI LLM) or extracts existing subtitles, translates them into a different language using Mistral LLM and adds them to the video. Uses ffmpeg for extracting and encoding

machine-learning video ai ffmpeg python3 video-processing subtitles-generator llms whisper-model mistral-7b subtitles-translator mistral-ai

Updated Jan 28, 2025
Python

Avinraj01 / SHL-Grammar-Scoring-Engine-for-Voice-Samples

Star

This model predicts grammar scores (1–5) from audio files. It uses Whisper to transcribe speech to text, cleans the text, and extracts features with TF-IDF. A Random Forest Regressor is trained to learn grammar score patterns. Evaluation via Pearson Correlation showed good results.

machine-learning random-forest speech-recognition tf-idf nlp-machine-learning model-evaluation pearson-correlation text-preprocessing regression-model audio-to-text whisper-model grammar-scoring submission-pipeline

Updated Jun 10, 2025
Jupyter Notebook

RishabhMathur06 / Fine-Tuning-Whisper-Small-For-ASR-

Star

This repository contains notebook that shows how to fine-tune OpenAI's Whisper model on custom Hindi dataset.

python artificial-intelligence openai automatic-speech-recognition whisper asr fine-tuning whisper-model

Updated Aug 14, 2024
Jupyter Notebook

13shivam / yt-agent

Star

Offline-friendly backend POC to transcribe YouTube videos and chat with video content using Whisper (no cloud required) and local LLMs via Ollama like Mistral or LLaMA2. Built with Flask and PostgreSQL, fully open source with Swagger APIs. Easily connect any frontend. ⚠️ Use Submit API to download one video at a time to avoid YouTube throttling.

postgresql speech-recognition flask-api dockerised audiototext llm whisper-model whisper-ai ollama mistral-ai mistral-7b-instruct

Updated May 6, 2025
Python

AshwinSomi / messagingApp

Star

A real time chat application using Next, Redis, Pub/Sub, Audio-To-Text LLM, Next-auth. I am still working on it

redis pusher rest google-oauth2 tailwindcss next-auth huggingface whisper-model nextjs15-typescript

Updated Dec 9, 2024
TypeScript

Improve this page

Add a description, image, and links to the whisper-model topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the whisper-model topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

whisper-model

Here are 16 public repositories matching this topic...

shhossain / BanglaSpeech2Text

jim-schwoebel / nala_assistant

thc1006 / whisper-colab-tpu-transcriber

hemangjoshi37a / French_audio_transcription_using_gradio

krithicswaroopan / AI-Voice-Assistance-Pipeline

furkanksl / FreeWhisper

franckferman / Whisper_Transcriber

dvorobiev / subtitles_project

sushant1827 / CrewAI-Agents-MinutesOfMeeting-Gmail

otonomee / youtube-to-transcript

Xza85hrf / Whisper-Subtitle-Generator

seccanj / generate-subtitle-llm

Avinraj01 / SHL-Grammar-Scoring-Engine-for-Voice-Samples

RishabhMathur06 / Fine-Tuning-Whisper-Small-For-ASR-

13shivam / yt-agent

AshwinSomi / messagingApp

Improve this page

Add this topic to your repo