AI Assistant API

A powerful AI assistant API that combines document search, database querying, and natural language processing capabilities.

Features

Dual LLM Support:
- Local LLM (default: deepseek-r1:7b via Ollama)
- Cloud LLM (via OpenRouter API)
- Dynamic switching between providers
OpenRouter Integration:
- Access to multiple LLM providers through a single API
- Supported models:
  - Google Gemini Pro
  - Anthropic Claude
  - Meta Llama
  - Mistral
- Cost-effective API usage
- Automatic fallback handling
Document Processing:
- Supports PDF, DOCX, TXT, and HTML files
- Maximum file size: 10MB
- Vector storage using Weaviate
- Semantic search with hybrid mode support
- Document chunking with overlap
- Document status management (active/inactive)
- Multi-user document access
- Document sharing capabilities
- Metadata-based filtering
Database Integration:
- PostgreSQL database support
- Natural language to SQL conversion
- Schema-aware query generation
- Automatic database initialization
URL Processing:
- Automatic URL content extraction
- Content caching with Redis
- 5MB max URL content size
- 10-second fetch timeout

Technical Stack

Backend Framework: FastAPI
Vector Store: Weaviate
Cache: Redis
Database: PostgreSQL
Embedding Model: BAAI/bge-small-en
Authentication: JWT
Local LLM: Ollama
Cloud LLM: OpenRouter API

Configuration

Key settings (configurable via environment variables):

# LLM Settings
LLM_API_KEY=[your-openrouter-api-key]  # Get from https://openrouter.ai/keys
LLM_MODEL=google/gemini-2.0-flash-001  # OpenRouter model identifier
LLM_LOCAL_MODEL=deepseek-r1:7b  # Ollama model name
LLM_PROVIDER=local  # 'local' for Ollama or 'cloud' for OpenRouter
TEMPERATURE=0.7

# Infrastructure
DOCKER_BUILDKIT=1
COMPOSE_DOCKER_CLI_BUILD=1

# Database
POSTGRES_HOST=postgres
POSTGRES_PORT=5432
POSTGRES_DB=[your-database-name]
POSTGRES_USER=[your-username]
POSTGRES_PASSWORD=[your-password]

# Vector Store
WEAVIATE_URL=http://weaviate:8080

API Endpoints

/api/chat: Main chat endpoint
/api/documents: Document management
- POST /api/documents/upload: Upload new document
- GET /api/documents/list: List user's documents
- DELETE /api/documents/{doc_id}: Delete document
- PATCH /api/documents/{doc_id}: Update document status
- DELETE /api/documents/clear: Clear all user documents
/api/system: System settings and model switching
- GET /api/system/models: List available models
- POST /api/system/switch-provider: Switch between local/cloud
/api/auth: Authentication endpoints

Security

JWT-based authentication
File type validation
MIME type checking
Request rate limiting
Input sanitization

Development

Clone the repository

Install dependencies:

pip install -r requirements.txt
pip install -r requirements-test.txt  # Install test dependencies

Set up environment variables:
- Copy .env.example to .env
- Add your OpenRouter API key
Run services:
```
docker-compose up -d
```

Testing

The project includes comprehensive test suites:

Unit tests

Running Tests

Use pytest to run the tests:

pytest

Test Options

Test Structure

backend/tests/
└── unit/               # Unit tests
    ├── api/            # API tests
    └── services/       # Service tests

Production Considerations

Change default admin password
Set proper JWT secret key
Configure appropriate rate limits
Adjust token limits based on usage
Monitor vector store performance
Set up proper logging
Configure CORS settings
Secure your OpenRouter API key
Monitor OpenRouter API usage and costs

License

[Your License Here]

Vector Store Schema

{
  "Documents": {
    "properties": [
      {"name": "text", "dataType": "text"},
      {"name": "filename", "dataType": "text"},
      {"name": "doc_id", "dataType": "text"},
      {"name": "chunk_id", "dataType": "int"},
      {"name": "active", "dataType": "text"},
      {"name": "users", "dataType": "text[]"},
      {"name": "file_size", "dataType": "int"},
      {"name": "total_chunks", "dataType": "int"}
    ]
  }
}

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
backend		backend
docker/ollama		docker/ollama
frontend		frontend
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
SECURITY.md		SECURITY.md
clean.sh		clean.sh
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Assistant API

Features

Technical Stack

Configuration

API Endpoints

Security

Development

Testing

Running Tests

Test Options

Test Structure

Production Considerations

License

Vector Store Schema

About

Uh oh!

Releases

Packages

Uh oh!

Languages

W1neSkin/AI-Assistant

Folders and files

Latest commit

History

Repository files navigation

AI Assistant API

Features

Technical Stack

Configuration

API Endpoints

Security

Development

Testing

Running Tests

Test Options

Test Structure

Production Considerations

License

Vector Store Schema

About

Resources

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages