The app: https://english-speech-pathologist.streamlit.app/
English Speech Pathologist is an AI-powered web application that helps users improve their English speaking skills through automated speech evaluation and personalized feedback. This tool uses Google's Gemini AI to analyze speech recordings and provide detailed assessments on pronunciation, vocabulary, grammar, fluency, and coherence.
- AI-Powered Speech Evaluation: Get comprehensive feedback on your English speaking abilities
- Customizable Practice Sessions: Select topics, difficulty levels, and duration based on your needs
- Real-time Audio Recording: Record directly from your browser or upload audio files
- Detailed Feedback: Receive specific scores and recommendations across 5 key areas
- Visual Analytics: View your performance through an intuitive radar chart
- Error Highlighting: See grammar, vocabulary, and expression suggestions directly in your transcription
- Frontend & Backend: Streamlit
- AI Model: Google Gemini 2.0 Flash
- Data Visualization: Matplotlib
- Audio Processing: Browser-based audio recording
-
Clone the repository:
git clone https://github.com/dustinvk03/English_Speech_Pathologist cd english-speech-pathologist
-
Install dependencies:
pip install -r requirements.txt
-
Run the application:
streamlit run app.py
streamlit
numpy
matplotlib
google-generativeai
- Authentication: Enter your Google API key or use the provided password
- Set up your practice session:
- Choose a speaking topic
- Set the duration (1-10 minutes)
- Select your difficulty level (Beginner, Intermediate, Advanced)
- Practice speaking:
- Record directly using your microphone
- Or upload an audio file (WAV, MP3, M4A)
- Review your results:
- Overall score and detailed breakdown
- Transcription with highlighted improvement areas
- Strengths and recommendations for improvement
The application evaluates your speech based on five critical areas:
- Pronunciation: Accuracy of sounds, stress, and intonation
- Vocabulary: Word choice, variety, and appropriateness
- Grammar: Correctness of sentence structure and grammar rules
- Fluency: Smoothness, pace, and natural flow of speech
- Coherence: Organization, logical flow, and clarity of ideas
To use this application, you need a Google API key with access to the Gemini model:
- Get your API key from the Google AI Studio
- Enter it in the application when prompted
Contributions are welcome! Please feel free to submit a Pull Request.
- Fork the repository
- Create your feature branch (
git checkout -b feature/amazing-feature
) - Commit your changes (
git commit -m 'Add some amazing feature'
) - Push to the branch (
git push origin feature/amazing-feature
) - Open a Pull Request
- This application uses Google's Gemini AI technology
- UI components powered by Streamlit
- Visualization created with Matplotlib