NotesBot- Gemini-Powered Multi-File Processor

This project is a Python-based Streamlit application that processes and analyzes unstructured data from PDFs and images using Google Gemini’s Large Language Model (LLM) API. It extracts text from PDFs and images and answers user queries based on the extracted data.

Features

Multiple File Uploads: Users can upload multiple PDFs and images.
Text Extraction: Extract text from PDFs and images using PyMuPDF (fitz) and Tesseract.
Query Processing: Use Google Gemini’s generative AI to answer user queries based on the extracted text.
User-Friendly Interface: Streamlit-powered UI for easy file upload and query submission.

Installation

Prerequisites

Before you begin, ensure you have the following installed:

Clone the Repository

git clone https://github.com/your-username/gemini-multi-file-processor.git
cd gemini-multi-file-processor
Here's the given content rewritten in Markdown (`.md`) format:

```markdown
## Environment Variables

To use Google Gemini’s API, you'll need to set up environment variables. The project uses `dotenv` to manage these.

### Create a `.env` file in the project directory:

```bash
touch .env

Add your Gemini API key:

GEMINI_API_KEY=your-google-gemini-api-key

Usage

Start the Streamlit application:

streamlit run app.py

Open your browser and go to:

http://localhost:8501

Upload your PDF and image files using the interface.
Input your query (e.g., "What is the main topic of the handwritten notes?").
The application will process the files, extract the text, and provide answers based on the extracted data.

How It Works

File Upload: Users upload PDFs and images through the Streamlit interface.
Text Extraction:
- PDFs: The application uses PyMuPDF (fitz) to extract images embedded in PDFs and Tesseract for Optical Character Recognition (OCR) to extract text from the images and PDFs.
Knowledge Base Creation: All extracted text is compiled into a centralized knowledge base.
Gemini API Querying: The knowledge base and user query are sent to Google Gemini’s LLM API, which generates answers based on the data.
Response Display: The generated response is shown in the Streamlit interface.


This `.md` content is properly formatted for inclusion in a `README.md` or documentation file.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.idea		.idea
AIppt.pptx		AIppt.pptx
FLOW-10 2.jpg		FLOW-10 2.jpg
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NotesBot- Gemini-Powered Multi-File Processor

Table of Contents

Features

Installation

Prerequisites

Clone the Repository

Add your Gemini API key:

Usage

Start the Streamlit application:

Open your browser and go to:

How It Works

About

Uh oh!

Releases

Packages

Languages

ShiroYasha18/NotesBot--Gemini-Powered-Multi-File-Processor

Folders and files

Latest commit

History

Repository files navigation

NotesBot- Gemini-Powered Multi-File Processor

Table of Contents

Features

Installation

Prerequisites

Clone the Repository

Add your Gemini API key:

Usage

Start the Streamlit application:

Open your browser and go to:

How It Works

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages