Chrome AI Assistant is a powerful browser extension that brings AI-powered content analysis directly to any webpage. Simply click the extension icon on any website, PDF, or image, and get instant AI insights without leaving your current page. The extension automatically extracts content from webpages, reads text from images via OCR, processes PDF documents, and provides intelligent responses using advanced AI models.
- Universal Content Analysis - Works on any webpage, PDF, or image
- AI-Powered OCR - Extract and understand text from images using Tesseract.js
- PDF Text Extraction - Read and analyze PDF documents directly in browser
- Real-time Content Processing - Instant analysis of webpage content
- Voice Input Support - Hands-free interaction with speech recognition
- Professional Chat Interface - Modern UI with light/dark themes
- Smart Content Prioritization - AI focuses on relevant content based on your questions
- Cross-Platform Compatibility - Works on all websites and content types
- Chrome Web Store: Install Extension
- Manifest V3 - Latest Chrome extension standards
- Vanilla JavaScript - Pure JS for maximum performance and compatibility
- CSS3 - Modern styling with responsive design
- HTML5 - Semantic markup and accessibility
- Node.js - JavaScript runtime for server-side processing
- Express.js - Fast, lightweight web framework
- Tesseract.js - Advanced OCR engine for image text extraction
- pdf-parse - PDF document text extraction
- Sharp - High-performance image processing
- OpenRouter API - Access to multiple AI models (gpt-3.5-turbo, etc.)
- Content Intelligence - Smart content filtering and prioritization
- Context-Aware Responses - AI understands webpage context
- Chrome Extensions API - Tab management and content injection
- Web Speech API - Voice input and speech recognition
- DOM Manipulation - Advanced webpage content extraction
# Clone repository
git clone https://github.com/rabumaabraham/Chrome-AI-Assistant.git
cd Chrome-AI-Assistant
# Setup environment
cp .env.example .env
# Add your OpenRouter API key to .env
# Start with Docker
docker-compose up -d
# Load Extension in Chrome
# 1. Open Chrome and go to chrome://extensions/
# 2. Enable "Developer mode"
# 3. Click "Load unpacked" and select the chrome-extension folder# Clone repository
git clone https://github.com/rabumaabraham/Chrome-AI-Assistant.git
cd Chrome-AI-Assistant
# Setup Backend
cd backend
npm install
# Setup environment
cp env.example .env
# Add your OpenRouter API key to .env
# Start backend server
npm start
# Load Extension in Chrome
# 1. Open Chrome and go to chrome://extensions/
# 2. Enable "Developer mode"
# 3. Click "Load unpacked" and select the chrome-extension folderRequired API Keys:
- OpenRouter API key (for AI responses)
- Backend server URL (for OCR and PDF processing)
- Download the extension from Chrome Web Store
- Click "Add to Chrome" to install
- No backend setup required - uses hosted services
Base URL: http://localhost:3000 (Development) or https://chrome-ai-assistant.onrender.com (Production)
- POST
/api/ask-ai- Send question with webpage context to AI{ "question": "What is this page about?", "context": { "url": "https://example.com", "title": "Page Title", "textContent": "Extracted page content...", "ocrText": "Text from images...", "pdfText": "PDF content..." } }
- POST
/api/ocr- Extract text from images{ "imageData": "data:image/png;base64,...", "imageInfo": { "src": "image-url", "width": 800, "height": 600 } }
- POST
/api/pdf/extract- Extract text from PDF documents{ "pdfData": "data:application/pdf;base64,..." }
- GET
/api/health- Server status and health monitoring
User: "What are the main topics discussed on this page?"
AI: Analyzes webpage content and provides a comprehensive summary of key topics, themes, and important information.
User: "What does this image say?"
AI: Uses OCR to read text from images, screenshots, memes, or documents and provides the extracted content with context.
User: "Summarize this PDF document"
AI: Extracts and analyzes PDF content, providing detailed summaries, key points, and relevant information.
User: *Speaks* "Explain the main points from this article"
AI: Processes voice input, analyzes webpage content, and responds with comprehensive explanations.
- DOM Parsing - Extracts headings, paragraphs, lists, and structured content
- Image Detection - Automatically finds and processes images with text
- PDF Recognition - Detects and processes PDF documents
- Content Prioritization - AI focuses on most relevant content for your questions
- Multi-format Support - PNG, JPEG, WebP, and other image formats
- Text Recognition - Accurate extraction from screenshots, photos, and documents
- Context Understanding - AI interprets extracted text in context
- Performance Optimized - Fast processing with timeout protection
- Context-Aware - Understands webpage content and user intent
- Multi-modal Processing - Combines text, images, and PDF content
- Personalized Responses - Tailored answers based on content type
- Real-time Processing - Instant responses with optimized performance
- Package extension files
- Upload to Chrome Web Store Developer Dashboard
- Submit for review
- Publish to users worldwide
- Deploy Node.js backend to hosting service (Heroku, Railway, etc.)
- Set environment variables
- Update extension with production backend URL
- Monitor with health checks
- OCR Timeouts - Maximum 5 seconds per image processing
- Content Limits - Processes up to 3 images for optimal speed
- Smart Filtering - Skips non-text images (avatars, icons, etc.)
- Efficient Processing - Race conditions and timeout protection
- Memory Management - Optimized for long browsing sessions
- Local Processing - Content extraction happens in browser
- Secure API Calls - Encrypted communication with backend
- No Data Storage - No user data or content is permanently stored
- Privacy First - Respects user privacy and website terms
Contributions are welcome! Please follow these steps:
- Fork the repository
- Create a feature branch:
git checkout -b feature/example-feature - Commit your changes:
git commit -m 'Add example feature' - Push to the branch:
git push origin feature/example-feature - Open a Pull Request
MIT License - see LICENSE file for details.
- Email: iamrabuma@gmail.com
- Issues: GitHub Issues
Transform your browsing experience with AI-powered content analysis!


