🎓 Elarova 2.0 – Multimodal Medical Virtual Learning Chatbot (Medicine Student ) (This the version 2 of Elarova)

Elarova is an intelligent, voice-interactive Multimodal Medical Virtual Learning Chatbot (Medicine Student ) (This the version 2 of Elarova) that helps students explore and understand visual academic content like diagrams, charts, handwritten notes, or Virtual Learning papers. Just speak your query, upload an image, and Elarova will answer both visually and audibly.

📸 Demo

🧠 Model Used

Multimodal Model: meta-llama/llama-4-scout-17b-16e-instruct via Groq API
Voice Recognition: Whisper
TTS Engines: Google gTTS & ElevenLabs

Tech Stack

Groq API – Ultra-fast LLM API LLaMA-4 Vision Model – meta-llama/llama-4-scout-17b-16e-instruct Whisper – For speech recognition gTTS & ElevenLabs – For voice output Gradio – For building the web interface

🔍 Features

🎙️ Voice Input: Speak your Virtual Learning question naturally.
🧠 Multimodal AI: Combines your voice query with an uploaded image to give smart, context-aware answers.
🖼️ Image Understanding: Upload diagrams, charts, handwritten pages, or screenshots — Elarova understands them.
💬 LLM-Powered Responses: Powered by meta-llama/llama-4-scout-17b-16e-instruct via Groq API.
🔊 Dual TTS Engines: Replies are spoken aloud using both gTTS and ElevenLabs.
🌐 Gradio Web Interface: Clean, easy-to-use interface accessible from your browser.

📁 Project Structure

Elarova/ ├── gradio_app.py # Main app with Gradio interface ├── brain_of_the_Elarova.py # Core logic for image + query processing ├── .env ├── voice_of_the_doctor.py └── voice_of_the_user.py

⚙️ Setup Instructions

1. Clone the Repo

git clone https://github.com/iamafridi/Elarova2.0.git
cd Elarova2.0

Set Environment Variables

GROQ_API_KEY=your_groq_api_key ELEVENLABS_API_KEY=your_elevenlabs_api_key

Run the App

python gradio_app.py

🎓 Example Use Case

Upload a diagram and ask:

🗣️ "Explain this process in simple terms."

📢 Elarova will generate a voice and text response explaining the diagram based on your question.

📜 License MIT License

👤 Author

Afridi Akbar Ifty GitHub: https://github.com/iamafridi Portfolio : https://iamafrididev.netlify.app LinkedIn: your-linkedin-profile

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
__pycache__		__pycache__
.gitignore		.gitignore
Dockerfile		Dockerfile
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
brain_of_the_elaRova.py		brain_of_the_elaRova.py
elevenlabs_testing.mp3		elevenlabs_testing.mp3
elevenlabs_testing.wav		elevenlabs_testing.wav
elevenlabs_testing_autoplay.mp3		elevenlabs_testing_autoplay.mp3
final.mp3		final.mp3
final.wav		final.wav
gradio_app.py		gradio_app.py
gtts_testing.mp3		gtts_testing.mp3
gtts_testing.wav		gtts_testing.wav
gtts_testing_autoplay.mp3		gtts_testing_autoplay.mp3
requirement.txt		requirement.txt
test1.png		test1.png
user_voice_test_for_user.mp3		user_voice_test_for_user.mp3
voice_of_the_doctor.py		voice_of_the_doctor.py
voice_of_the_user.py		voice_of_the_user.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🎓 Elarova 2.0 – Multimodal Medical Virtual Learning Chatbot (Medicine Student ) (This the version 2 of Elarova)

🧠 Model Used

Tech Stack

🔍 Features

📁 Project Structure

⚙️ Setup Instructions

1. Clone the Repo

Set Environment Variables

Run the App

🎓 Example Use Case

👤 Author

About

Uh oh!

Releases

Packages

Uh oh!

Languages

iamafridi/elarova-2.0

Folders and files

Latest commit

History

Repository files navigation

🎓 Elarova 2.0 – Multimodal Medical Virtual Learning Chatbot (Medicine Student ) (This the version 2 of Elarova)

🧠 Model Used

Tech Stack

🔍 Features

📁 Project Structure

⚙️ Setup Instructions

1. Clone the Repo

Set Environment Variables

Run the App

🎓 Example Use Case

👤 Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages