Maestro - Advanced AI-Powered Code Review Assistant

Maestro is an intelligent code review assistant built with DSPy-Go that provides comprehensive, file-level code analysis for GitHub pull requests. It combines advanced AST parsing, semantic analysis, and LLM-powered reasoning to deliver high-quality, actionable code review feedback. Additionally, Maestro features seamless integration with Claude Code and Gemini CLI tools for enhanced AI-powered development workflows.

🏗️ Architecture Overview

graph TB
    subgraph "User Interface Layer"
        CLI[Command Line Interface]
        Interactive[Interactive Mode] 
        GitHub[GitHub Integration]
        ClaudeCLI[Claude Code CLI]
        GeminiCLI[Gemini CLI]
        Sessions[AI Session Management]
    end

    subgraph "Core Analysis Engine"
        Agent[PR Review Agent]
        Context[Context Extractor]
        Processor[Enhanced Review Processor]
        Aggregator[Result Aggregator]
    end

    subgraph "Intelligence Layer"
        AST[AST Parser]
        RAG[RAG Store & Retrieval]
        LLM[LLM Orchestration]
        Rules[Review Rules Engine]
    end

    subgraph "Data & Storage"
        Vector[SQLite + Vector DB]
        Embedding[Embedding Models]
        Guidelines[Code Guidelines]
        Patterns[Code Patterns]
    end

    CLI --> Agent
    Interactive --> Agent
    GitHub --> Agent
    ClaudeCLI --> Agent
    GeminiCLI --> Agent
    Sessions --> Agent
    
    Agent --> Context
    Agent --> Processor
    Processor --> Aggregator
    
    Context --> AST
    Processor --> RAG
    Processor --> LLM
    Agent --> Rules
    
    RAG --> Vector
    RAG --> Embedding
    Vector --> Guidelines
    Vector --> Patterns

🎯 Core Features

Advanced Context Analysis

AST-Based Parsing: Deep Go code structure analysis including packages, imports, types, and functions
File-Level Context: Comprehensive understanding of code relationships and dependencies
Semantic Purpose Detection: Automatic identification of code chunk functionality
Enhanced Chunk Metadata: Rich context with 15+ lines of surrounding code

Intelligent Review Pipeline

Multi-Stage Processing: Context preparation → Analysis → Validation → Aggregation
File-Level Aggregation: Groups related issues by file with intelligent deduplication
Advanced Debugging: Comprehensive logging and performance metrics
Configurable Processing: Environment variables for fine-tuning behavior

GitHub Integration

Seamless PR Workflow: Direct integration with GitHub pull request comments
Bulk Processing: Efficient handling of large PRs with parallel processing
Interactive Mode: Guided setup and configuration
Real-time Feedback: Live processing status and progress indicators

AI CLI Tool Integration

Claude Code CLI: Direct integration with Anthropic's official Claude Code CLI
Gemini CLI: Integration with Google's Gemini CLI for web search and general queries
Session Management: Create and manage multiple isolated AI sessions
Smart Routing: Automatic delegation of tasks to the most appropriate AI tool
Tool Auto-Setup: Automatic installation and configuration of CLI tools

Flexible Model Support

Multiple Backends: Anthropic Claude, Google Gemini, Local models (Ollama, LLaMA.cpp)
Unified Embedding: Consistent vector representations for code and guidelines
Performance Optimization: Intelligent model selection and caching

🛠️ Enhanced Technical Capabilities

Review Dimensions

Code Defects: Logic flaws, error handling issues, resource management
Security Vulnerabilities: Injection attacks, insecure data handling, authentication issues
Maintainability: Code organization, documentation, naming conventions, complexity
Performance: Algorithmic efficiency, data structures, resource utilization

Advanced Features

Vector-Based Similarity: SQLite with sqlite-vec for efficient code pattern matching
Deduplication Engine: Levenshtein distance-based issue consolidation
Context Extraction: Go AST parsing with semantic analysis
Background Indexing: Non-blocking repository analysis
Parallel Processing: Concurrent chunk analysis for performance

📦 Getting Started

Prerequisites

Go 1.24.1 or higher
SQLite with sqlite-vec extension
GitHub API access token
Supported LLM backend (Claude, Gemini, or local model)

Installation

# Clone the repository
git clone https://github.com/XiaoConstantine/maestro.git
cd maestro

# Install dependencies
go mod download

# Build the binary
go build -o maestro

# Set up GitHub token
export MAESTRO_GITHUB_TOKEN=your_github_token

Quick Start

# Interactive mode with guided setup and AI tool integration
./maestro -i

# Direct CLI usage for a specific PR  
./maestro --owner=username --repo=repository --pr=123

# Use integrated Claude Code CLI
maestro> /claude help
maestro> /claude create a react component

# Use integrated Gemini CLI for web search
maestro> /gemini search for latest Go best practices

# Create and manage AI sessions
maestro> /sessions create frontend "react development"
maestro> /enter frontend

# With enhanced debugging
export MAESTRO_LOG_LEVEL=debug
export MAESTRO_RAG_DEBUG_ENABLED=true
./maestro --owner=username --repo=repository --pr=123 --verbose

⚙️ Configuration

Environment Variables

Core Configuration

MAESTRO_GITHUB_TOKEN=your_token          # GitHub API access
MAESTRO_LOG_LEVEL=debug                  # Logging level (debug, info, warn, error)

Enhanced Processing (Phase 4.1)

# File-level aggregation
MAESTRO_FILE_AGGREGATION_ENABLED=true    # Enable file-level result aggregation
MAESTRO_DEDUPLICATION_THRESHOLD=0.8      # Issue similarity threshold (0.0-1.0)

# AST Context extraction  
MAESTRO_CONTEXT_EXTRACTION_ENABLED=true  # Enable AST-based context extraction
MAESTRO_CHUNK_CONTEXT_LINES=15          # Lines of context around chunks
MAESTRO_ENABLE_SEMANTIC_CONTEXT=true     # Enable semantic purpose detection
MAESTRO_ENABLE_DEPENDENCY_ANALYSIS=true  # Enable dependency tracking

# Advanced debugging
MAESTRO_RAG_DEBUG_ENABLED=true           # RAG retrieval debugging
MAESTRO_LLM_RESPONSE_DEBUG=true          # LLM response debugging  
MAESTRO_SIMILARITY_LOGGING=true          # Similarity score logging

Feature Toggles

MAESTRO_ENHANCED_REASONING=true          # Enhanced reasoning capabilities
MAESTRO_COMMENT_REFINEMENT=true          # Comment refinement processing
MAESTRO_CONSENSUS_VALIDATION=true        # Consensus validation

Command Line Options

--github-token: GitHub personal access token
--owner: Repository owner
--repo: Repository name
--pr: Pull request number to review
--model: LLM backend selection
--index-workers: Concurrent indexing workers
--review-workers: Concurrent review workers
--verbose: Enable detailed logging
-i: Interactive mode with AI CLI integration

Interactive Mode Commands

/help: Show available commands
/claude [args]: Access Claude Code CLI directly
/gemini [args]: Access Gemini CLI for web search
/sessions create <name> <purpose>: Create new AI session
/enter <session>: Enter interactive Claude session
/list: List all available sessions
/tools setup: Install and configure CLI tools
/ask <question>: Ask questions about the repository

Model Selection

# Use local LLaMA.cpp
./maestro --model="llamacpp:" --pr=123

# Use Ollama with specific model
./maestro --model="ollama:codellama" --pr=123

# Use Anthropic Claude
./maestro --model="anthropic:claude-3-sonnet" --api-key=your_key --pr=123

# Use Google Gemini
./maestro --model="google:gemini-pro" --api-key=your_key --pr=123

📊 Performance & Metrics

Current Scale

Codebase: 19,552+ lines across 27 Go modules
Dependencies: DSPy-Go v0.36.0, SQLite-vec, GitHub API v68, Claude Code CLI, Gemini CLI
Processing: Handles 300+ chunks per PR with file-level aggregation
Performance: ~500ms average per chunk with parallel processing
AI Integration: Seamless switching between multiple AI tools and sessions

Recent Improvements

AI CLI Integration: Direct Claude Code and Gemini CLI integration
Session Management: Multi-session AI workflow support
File Aggregation: 371 chunks → 21 files (proper grouping)
Context Enhancement: 5 → 15+ lines of chunk context
Debug Visibility: Comprehensive RAG and processing metrics
Lint Compliance: Zero golangci-lint issues

🔬 Advanced Usage

Debug Mode

# Full debug mode with all logging enabled
export MAESTRO_LOG_LEVEL=debug
export MAESTRO_RAG_DEBUG_ENABLED=true
export MAESTRO_LLM_RESPONSE_DEBUG=true
export MAESTRO_SIMILARITY_LOGGING=true
export MAESTRO_CONTEXT_EXTRACTION_ENABLED=true

./maestro --owner=user --repo=project --pr=123 --verbose

Performance Tuning

# Optimize for speed
export MAESTRO_CHUNK_CONTEXT_LINES=10
export MAESTRO_DEDUPLICATION_THRESHOLD=0.9

# Optimize for accuracy  
export MAESTRO_CHUNK_CONTEXT_LINES=20
export MAESTRO_DEDUPLICATION_THRESHOLD=0.7
export MAESTRO_ENABLE_DEPENDENCY_ANALYSIS=true

AI Workflow Examples

# Multi-session development workflow
./maestro -i
maestro> /sessions create backend "API development"
maestro> /sessions create frontend "React components"
maestro> /enter backend
# Work in Claude Code for backend
maestro> /exit
maestro> /enter frontend  
# Work in Claude Code for frontend

# Mixed AI tool usage
maestro> /gemini search for Go error handling best practices
maestro> /claude implement error handling based on search results
maestro> /ask how does this compare to our current codebase?

📄 License

Maestro is released under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
.github/workflows		.github/workflows
.gitignore		.gitignore
.golangci.yml		.golangci.yml
LICENSE		LICENSE
README.md		README.md
advanced_workflow_manager.go		advanced_workflow_manager.go
architecture_analyzer.go		architecture_analyzer.go
architecture_signatures.go		architecture_signatures.go
capability_router.go		capability_router.go
chunk.go		chunk.go
claude_coordinator.go		claude_coordinator.go
claude_dashboard.go		claude_dashboard.go
claude_session_manager.go		claude_session_manager.go
claude_session_switcher.go		claude_session_switcher.go
cli_tool_integration.go		cli_tool_integration.go
comment_processor.go		comment_processor.go
comment_refinement_processor.go		comment_refinement_processor.go
consensus_validation_processor.go		consensus_validation_processor.go
console.go		console.go
context_extractor.go		context_extractor.go
diagram_templates.go		diagram_templates.go
diagram_utils.go		diagram_utils.go
enhanced_features.go		enhanced_features.go
enhanced_integration.go		enhanced_integration.go
enhanced_memory_manager.go		enhanced_memory_manager.go
enhanced_review_processor.go		enhanced_review_processor.go
enhanced_workflow_builder.go		enhanced_workflow_builder.go
evaluation_metrics.go		evaluation_metrics.go
github_tool.go		github_tool.go
go.mod		go.mod
go.sum		go.sum
guideline.go		guideline.go
indexer.go		indexer.go
intelligent_parallel_processor.go		intelligent_parallel_processor.go
late_interaction.go		late_interaction.go
main.go		main.go
metric.go		metric.go
multi_vector_embedding.go		multi_vector_embedding.go
optimization_config.go		optimization_config.go
qa_processor.go		qa_processor.go
rag.go		rag.go
result_aggregator.go		result_aggregator.go
review.go		review.go
review_chain.go		review_chain.go
review_processor.go		review_processor.go
review_types.go		review_types.go
rules.go		rules.go
submodular.go		submodular.go
util.go		util.go

License

XiaoConstantine/maestro

Folders and files

Latest commit

History

Repository files navigation