Git Insight Orchestrator Agent

Chat Interface (Laptop)	Repository Analysis(Mobile)

A sophisticated chatbot that allows you to query any GitHub repository by analyzing its source code. The system clones the repository, processes the code files, and creates vector embeddings for semantic search.

Features

Repository Analysis: Clone and analyze any public GitHub repository
Semantic Search: Find relevant code sections using natural language queries
AI-Powered Answers: Get explanations and insights about the codebase
Vector Database: Efficient storage and retrieval of code embeddings
Modern UI: Clean, futuristic interface for optimal user experience

Technology Stack

Backend: Python, Flask
Vector Database: ChromaDB
Embeddings: Google Embedding 001
LLM: Gemini 2.5 Flash
Frontend: HTML, CSS, JavaScript
Repository Handling: GitPython

Workflow

graph TD
    A[GitHub Repository URL] --> B[Clone Repository]
    B --> C[Extract Code Files]
    C --> D[Generate Embeddings]
    D --> E[Store in ChromaDB]
    E --> F[User Query]
    F --> G[Similarity Search]
    G --> H[Generate Response with Gemini 2.5 Flash]
    H --> I[Display Results]

OR

Installation

Prerequisites

Python 3.9+
Git
Google Cloud API key (for Gemini and Embeddings)

Setup

Clone the repository:

git clone https://github.com/yourusername/repo-chatbot.git
cd repo-chatbot

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows use `venv\Scripts\activate`

Install dependencies:

pip install -r requirements.txt

Create a .env file and add your API keys:

GOOGLE_API_KEY=your_google_api_key

Usage

Start the Flask server:

python app.py

Open your browser to http://localhost:5000
Enter a GitHub repository URL and click "Analyze"
Once processed, you can ask questions about the repository

API Endpoints

POST /analyze - Submit a GitHub repository for analysis
POST /chat - Submit a query about the analyzed repository
GET /status - Check processing status

Configuration

Modify config.py for these settings:

# Chunking parameters
CHUNK_SIZE = 1000
CHUNK_OVERLAP = 200

# Database settings
PERSIST_DIRECTORY = "db"
COLLECTION_NAME = "code_embeddings"

# Model settings
EMBEDDING_MODEL = "models/embedding-001"
LLM_MODEL = "gemini-1.5-flash"

Development

To contribute to the project:

Fork the repository
Create a new branch (git checkout -b feature-branch)
Commit your changes (git commit -am 'Add new feature')
Push to the branch (git push origin feature-branch)
Create a new Pull Request

Testing

Run the test suite with:

python -m pytest tests/

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Google for the Gemini models and embeddings
ChromaDB team for the vector database
LangChain for the LLM integration framework

Support

For issues or questions, please open an issue on GitHub or contact jasjeev99@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
research		research
screenshots		screenshots
src		src
static		static
templates		templates
workflow		workflow
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
setup.py		setup.py
store_index.py		store_index.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Git Insight Orchestrator Agent

Features

Technology Stack

Workflow

OR

Installation

Prerequisites

Setup

Usage

API Endpoints

Configuration

Development

Testing

License

Acknowledgments

Support

About

Uh oh!

Releases

Packages

Languages

License

jasjeev013/Git-Insight-Orchestrator-Agent

Folders and files

Latest commit

History

Repository files navigation

Git Insight Orchestrator Agent

Features

Technology Stack

Workflow

OR

Installation

Prerequisites

Setup

Usage

API Endpoints

Configuration

Development

Testing

License

Acknowledgments

Support

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages