A Python-based floating GUI application that works as an intelligent exam helper and personal assistant. The application captures questions from screen text (OCR) and audio input, then provides AI-powered answers using OpenAI's GPT models.
Modern, sleek interface with dark theme and intuitive controls
- π¨ Modern Dark UI: Sleek, professional interface with smooth animations
- π€ Multi-AI Support: OpenAI GPT, Google Gemini, and Perplexity integration
- π» Stealth Mode: Invisible to screen sharing (Zoom, Teams, etc.)
- πΈ Smart Capture: OCR, audio recording, and live screen monitoring
- β‘ Real-time Processing: Instant AI responses with progress indicators
- Screen OCR: Continuously scans the entire screen for text questions
- Audio Input: Captures questions from microphone input
- Manual Input: Text box for typing custom questions
- Smart Detection: Automatically identifies likely questions using heuristics
- Dual AI Integration: OpenAI for text, Perplexity for vision analysis
- Perplexity Vision: Advanced image analysis for screenshot-based questions
- Response Modes: Toggle between short/direct and detailed/explanatory answers
- Question Analysis: Automatically categorizes question types (math, definition, explanation, etc.)
- Context Awareness: Provides relevant and accurate answers
- Screen Sharing Protection: Hides from screen capture applications (Zoom, Teams, etc.)
- Always Visible: Remains visible to the user while hidden from recordings
- Transparency Control: Adjustable window transparency
- Click-through Mode: Optional mouse event pass-through
- Modern Dark Theme: Professional appearance with smooth gradients
- Floating Window: Always-on-top, resizable interface (toggleable)
- Model Selection: Choose between different AI models for different tasks
- Quick Toggle: Ctrl+Shift+H hotkey to hide/show instantly
- Screenshot Vision: Ctrl+Shift+C hotkey for instant screenshot capture and AI analysis
- Real-time Display: Live answer updates with timestamps and status indicators
- Settings Panel: Easy configuration of API keys and preferences
- Progress Tracking: Visual feedback for all operations
- Download the latest release from the releases page
- Extract
ExamHelper.exefrom the zip file - Run
ExamHelper.exe- no installation required! - Configure your API keys in the settings
- Python 3.8+ installed on your system
- Tesseract OCR for text recognition
- OpenAI API Key for AI responses
python install_dependencies.pyOr manually install:
pip install -r requirements.txtWindows:
winget install UB-Mannheim.TesseractOCROr download from: https://github.com/UB-Mannheim/tesseract/wiki
macOS:
brew install tesseractLinux:
sudo apt-get install tesseract-ocr- Run the application:
python exam_helper.py - The application will create a
config.jsonfile automatically - Use the model selection dropdowns to choose your preferred AI models:
- Image Recognition: OpenAI GPT-4o, Gemini Pro Vision, etc.
- Audio Recognition: OpenAI Whisper, Gemini models
- AI Response: OpenAI GPT models, Gemini Flash/Pro
- Enter your API keys in the configuration file or through the interface
- Adjust other preferences as needed
To create your own standalone executable:
# Method 1: Advanced build with progress tracking
python build_exe.py
# Method 2: Simple one-click build (takes long time with large application size)
build_simple.bat
# Method 3: Simple one-click build (takes less time with less application size)
build_optimize.bat
# Method 4: almost same time with same size application
build_executable.bat
build.bat
# Method 5: Manual PyInstaller
pyinstaller --onefile --windowed exam_helper.pySee BUILD_GUIDE.md for detailed build instructions.
python exam_helper.py- Automatic Scanning: The app continuously scans for questions
- Manual Questions: Type questions in the input box
- Screenshot Analysis: Click "πΈ Capture Screen" or press Ctrl+Shift+C
- View Answers: Responses appear in the main display area
- Toggle Modes: Switch between short and detailed answers
- Control Scanning: Start/Stop OCR and Audio scanning independently
- Hide/Show: Use Ctrl+Shift+H or the Hide button
- Always on Top: Toggle to keep window above other applications
- Automatically hides from most screen sharing applications
- Window remains visible to you but not in recordings
- Works with Zoom, Teams, Google Meet, Discord, and more
{
"openai_api_key": "your-api-key-here",
"scan_interval": 3,
"audio_enabled": true,
"ocr_enabled": true,
"response_mode": "short"
}- scan_interval: Seconds between screen scans (default: 3)
- audio_enabled: Enable/disable microphone listening
- ocr_enabled: Enable/disable screen text capture
- response_mode: "short" for brief answers, "detailed" for explanations
exam_helper/
βββ exam_helper.py # Main application with modern GUI
βββ ocr_module.py # Screen text capture and OCR
βββ audio_module.py # Audio input and speech recognition
βββ llm_module.py # OpenAI API integration
βββ gemini_module.py # Google Gemini AI integration
βββ perplexity_module.py # Perplexity AI integration
βββ stealth_module.py # Stealth mode functionality
βββ screenshot_module.py # Screenshot capture utilities
βββ config.json # Configuration file (auto-generated)
βββ requirements.txt # Python dependencies
βββ build_exe.py # Advanced executable builder
βββ build_simple.bat # Simple build script
βββ BUILD_GUIDE.md # Detailed build instructions
βββ preview.png # Application screenshot
βββ README.md # This file
- Uses Tesseract OCR with OpenCV preprocessing
- Applies noise reduction and image enhancement
- Filters out non-question text automatically
- Real-time speech recognition using Google Speech API
- Microphone calibration for ambient noise
- Voice activity detection to reduce false positives
- Uses Windows API calls to hide from screen capture
- SetWindowDisplayAffinity with WDA_EXCLUDEFROMCAPTURE
- Layered window attributes for transparency control
- Multi-Provider Support: OpenAI GPT models, Google Gemini, Perplexity
- Model Selection: Choose optimal models for different tasks
- Smart Routing: Automatic model selection based on question type
- Rate limiting: Prevents API abuse and manages costs
- Error handling: Robust network and API error recovery
- Status Indicators: Real-time model availability and health checks
"No OpenAI API key configured"
- Add your API key in Settings
- Ensure the key has sufficient credits
"Tesseract not found"
- Install Tesseract OCR (see installation steps)
- On Windows, may need to set path manually in
ocr_module.py
"Microphone not working"
- Check microphone permissions
- Ensure microphone is not used by other applications
- Try running as administrator
"Stealth mode not working"
- Run as administrator for full functionality
- Some screen capture apps may still detect the window
- Try adjusting transparency settings
- Increase scan_interval for better performance
- Disable audio or OCR if not needed
- Close other resource-intensive applications
- This tool is for educational assistance only
- Respect your institution's academic integrity policies
- Use responsibly and in accordance with exam rules
- The stealth feature should only be used where permitted
- Audio is processed locally and via Google Speech API
- Questions are sent to OpenAI for processing
- No data is stored permanently by the application
- Review OpenAI's privacy policy for their data handling
Feel free to contribute improvements:
- Fork the repository
- Create a feature branch
- Make your changes
- Submit a pull request
This project is for educational purposes. Please use responsibly and in accordance with your institution's policies.
For issues or questions:
- Check the troubleshooting section
- Review the logs in
exam_helper.log - Ensure all dependencies are properly installed
- Verify your OpenAI API key is valid and has credits
