โ ๏ธ Important: This extension requires Chrome Dev or Canary channel (โฅ 128.0.6545.0) to function properly.
ChromeSAHAYAK is a web app and Chrome extension that transforms learning and entertainment. Powered by cutting-edge AI technologies like Prompt API and Speech Recognition, it offers personalized tutoring, hands-free browsing, interactive games, and mental health support. ๐๐
Engage directly with the AI Learner by asking questions or seeking information. The intelligent system provides accurate, tailored responses to enhance your understanding of complex topics.

Simplify learning with AI Learnerโs lesson conversion feature. It breaks down complex topics into structured, digestible lessons, making it easier for users to grasp even the most challenging concepts.

Leverage the AI voice assistantโs ability to detect emotions during interactions. By understanding the userโs emotional state, AI Learner adapts its responses to provide a supportive and empathetic learning experience.

Experience hands-free learning with the AI voice assistant. It offers seamless voice-based navigation, personalized suggestions, and interactive support, ensuring a convenient and user-friendly educational journey.

- ๐ง Customizable Answers:
- Adjust answers by difficulty level (Simple โก๏ธ Advanced).
- Control the length (Short โก๏ธ Detailed).
- ๐ ๏ธ Interactive Features:
- Use the "Ask" button for instant explanations.
- Save answers to PDF for future reference.
- Create Structured Lessons: Generate a structured lesson plan for in-depth study.
- Audio Classes: Learn with AI-generated audio explanations.
- ๐ Real-Time Interaction: Follow-up questions supported for dynamic learning.
- ๐ฃ๏ธ Hands-Free Learning & Navigation:
- Use voice commands to interact with AI effortlessly.
- Perform tasks like opening tabs, searching queries, and interacting with lessons.
- ๐ Conversational AI:
- Engage in natural conversations.
- Includes mental health support: Acts as a friendly consoler or psychiatrist to help users feel better during tough times.
- ๐ง Audio to Text: Convert YouTube audio into accurate text using
yt-dlp
and Google Speech API. - ๐ Multilingual Support: Understand content in multiple languages.
- ๐ต Mood-Based Recommendations: Suggests resources or playlists based on context.
- โ Trivia & Quizzes: Fun, engaging games to test knowledge.
- โจ Storytelling Mode: AI-generated stories based on your prompts.
- ๐ Joke Generator: Enjoy light-hearted jokes, riddles, and humor.
- ๐จ Voice-Driven Doodles: Create sketches with simple verbal descriptions.
- ๐ง AI-Curated Music: Get personalized playlists based on detected mood.
- ๐ผ๏ธ AI Memes: Generate witty, captioned memes.
- ๐ฎ Fortune Teller: Playful AI-powered horoscopes and fortunes.
- ๐ Language Games: Challenge vocabulary and translation skills.
- ๐ฎ Chat Games: Play "20 Questions," "Would You Rather," and more.
Component | Minimum Requirement |
---|---|
Browser | Chrome Dev/Canary (โฅ 128.0.6545.0) REQUIRED |
Operating System | Windows 10+, macOS 13+, or Linux |
CPU | Multi-core processor (Intel/AMD) |
GPU/VRAM | GPU with 4GB+ VRAM supporting FP16 |
Storage | 24GB free space (22GB Gemini Nano, 2GB Moondream2) |
- Download Chrome Dev or Chrome Canary.
- Verify version โฅ 128.0.6545.0 at
chrome://settings/help
.
Enable the following Chrome flags:
Category | Flag | Setting |
---|---|---|
WebGPU | chrome://flags/#enable-webgpu-developer-features |
Enabled |
Gemini Nano | chrome://flags/#optimization-guide-on-device-model |
Enabled BypassPerfRequirement |
Prompt API | chrome://flags/#prompt-api-for-gemini-nano |
Enabled |
Restart Chrome after enabling these flags to apply changes.
-
Open the Prompt API Playground.
-
Launch the DevTools Console (
F12
). -
Execute:
(await ai.languageModel.capabilities()).available;
-
If the return value is
"after-download"
:- Go to chrome://components.
- Verify Optimization Guide On Device Model version โฅ 2024.5.21.1031.
- If outdated, click "Check for update".
-
If the return value is
"no"
:- Run in the Console:
await ai.languageModel.create();
- If it fails (expected), relaunch Chrome and retry the earlier command.
- Run in the Console:
- Execute in DevTools Console:
(await ai.languageModel.capabilities()).available;
- Ensure return value is
"after-download"
. - Confirm Optimization Guide On Device Model is updated under
chrome://components
.
- Open your terminal or command prompt.
- Run:
git clone https://github.com/vinayak1729-web/Chrome-Sahayaka.git
- Navigate to the project directory:
cd Chrome-Sahayaka
- Open
index.html
in your browser:- Double-click on the
index.html
file. - Or, right-click > "Open with..." > Select Chrome.
- Double-click on the
Enjoy exploring the interactive features like AI tutoring, Fun Zone, and mental health support.
Important: After modifying Chrome flags, restart Chrome to ensure settings are activated correctly.
- Chrome AI APIs:
- ๐งพ Prompt API: Enables content generation.
- ๐๏ธ Speech Recognition API: Supports voice commands.
- Google Cloud Speech API: Transcribes audio.
- yt-dlp: Downloads YouTube videos and extracts audio.
- Flask Backend: Powers server-side operations.
- Type your question in the input field.
- Adjust difficulty and length settings.
- Use:
- "Ask" for instant answers.
- "Save to PDF" to save your query for future use.
- "Create Lesson" for a structured AI-generated lesson plan.
- "Audio Class" to listen to explanations through AI-generated voice.
- Activate the microphone icon.
- Speak naturally for queries or navigation.
- Let AI open tabs, search, and interact seamlessly.
- Paste a YouTube video URL.
- Click "Transcribe" to get detailed, multilingual text.
- Explore engaging games and creative tools.
- Use voice commands to create doodles, jokes, and more.
- ๐ AR-powered immersive learning.
- ๐ Offline functionality for uninterrupted learning.
- ๐ก Enhanced interactivity with advanced AI models.
We โค๏ธ contributions! Help us improve by:
- Suggesting new features ๐
- Reporting bugs ๐
- Submitting pull requests ๐
Developed with ๐ by Vinayak Shinde, Vishal Mishra, Chelsea Singla, and Swasthika Devadiga.
๐ซ Contact: Have questions? Drop an issue in the repo!
๐ Star this repo if you find it helpful.
This project is licensed under the MIT License.
MIT License
Copyright (c) 2024 Vinayak Shinde
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.