English Speech Pathologist

The app: https://english-speech-pathologist.streamlit.app/

**

Overview

English Speech Pathologist is an AI-powered web application that helps users improve their English speaking skills through automated speech evaluation and personalized feedback. This tool uses Google's Gemini AI to analyze speech recordings and provide detailed assessments on pronunciation, vocabulary, grammar, fluency, and coherence.

Features

AI-Powered Speech Evaluation: Get comprehensive feedback on your English speaking abilities
Customizable Practice Sessions: Select topics, difficulty levels, and duration based on your needs
Real-time Audio Recording: Record directly from your browser or upload audio files
Detailed Feedback: Receive specific scores and recommendations across 5 key areas
Visual Analytics: View your performance through an intuitive radar chart
Error Highlighting: See grammar, vocabulary, and expression suggestions directly in your transcription

Tech Stack

Frontend & Backend: Streamlit
AI Model: Google Gemini 2.0 Flash
Data Visualization: Matplotlib
Audio Processing: Browser-based audio recording

Installation

Clone the repository:

git clone https://github.com/dustinvk03/English_Speech_Pathologist
cd english-speech-pathologist

Install dependencies:
```
pip install -r requirements.txt
```
Run the application:
```
streamlit run app.py
```

Requirements

streamlit
numpy
matplotlib
google-generativeai

Usage

Authentication: Enter your Google API key or use the provided password
Set up your practice session:
- Choose a speaking topic
- Set the duration (1-10 minutes)
- Select your difficulty level (Beginner, Intermediate, Advanced)
Practice speaking:
- Record directly using your microphone
- Or upload an audio file (WAV, MP3, M4A)
Review your results:
- Overall score and detailed breakdown
- Transcription with highlighted improvement areas
- Strengths and recommendations for improvement

Speech Evaluation Criteria

The application evaluates your speech based on five critical areas:

Pronunciation: Accuracy of sounds, stress, and intonation
Vocabulary: Word choice, variety, and appropriateness
Grammar: Correctness of sentence structure and grammar rules
Fluency: Smoothness, pace, and natural flow of speech
Coherence: Organization, logical flow, and clarity of ideas

API Key Setup

To use this application, you need a Google API key with access to the Gemini model:

Get your API key from the Google AI Studio
Enter it in the application when prompted

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Acknowledgments

This application uses Google's Gemini AI technology
UI components powered by Streamlit
Visualization created with Matplotlib

**

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.devcontainer		.devcontainer
.streamlit		.streamlit
screenshot		screenshot
README.md		README.md
requirements.txt		requirements.txt
screen-shots		screen-shots
streamlit_app.py		streamlit_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

English Speech Pathologist

Overview

Features

Tech Stack

Installation

Requirements

Usage

Speech Evaluation Criteria

API Key Setup

Contributing

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

dustinvk03/English_Speech_Pathologist

Folders and files

Latest commit

History

Repository files navigation

English Speech Pathologist

Overview

Features

Tech Stack

Installation

Requirements

Usage

Speech Evaluation Criteria

API Key Setup

Contributing

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages