A desktop application for performing Exploratory Data Analysis (EDA) using natural language commands.
- Upload and analyze CSV datasets
- Voice command recognition using OpenAI Whisper (Currently using VOSK libraries to analyse speech)
- Natural language processing of analysis commands
- Interactive visualizations and statistical analysis
- Clean, minimal GUI built with Streamlit
- Create a virtual environment (recommended):
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
- Install dependencies:
pip install -r requirements.txt
- Run the application:
streamlit run app.py
- Launch the application
- Upload your CSV dataset using the file upload widget
- Click the "Start Listening" button or press the spacebar to begin voice recognition
- Speak your analysis commands naturally
- View the results in the main panel
- "Show summary statistics"
- "Plot a histogram of age"
- "Create a boxplot for salary"
- "Display the correlation matrix"
- "Show missing value heatmap"
- "Run linear regression with price as target and area, rooms as features"
- Python 3.10 or higher
- Microphone for voice input
- Speakers for optional text-to-speech output
The application runs entirely locally, with no cloud dependencies. Voice recognition is handled by OpenAI's Whisper model running on your machine, (also working on fixing The issues regarding speech recognition actively.