AI WebHub: Self-Hosted LLM Management Platform 🚀

AI WebHub is a high-performance, fully self-hosted AI platform for managing, running, and interacting with LLMs (Large Language Models). Designed to work entirely offline, it supports Ollama, OpenAI-compatible APIs, and provides a built-in inference engine for RAG (Retrieval Augmented Generation), enabling enterprise-grade AI deployment.

Key Highlights 🌟

Seamless Deployment: Docker, Docker Compose, Kubernetes (Helm/Kustomize), and native Python installation for flexible deployment.
Multi-LLM Support: Integrate Ollama, OpenAI APIs, LMStudio, GroqCloud, Mistral, OpenRouter, and more.
Offline & Secure: Fully functional offline mode; supports granular RBAC and SCIM 2.0 provisioning.
Modern UI/UX: Responsive, PWA-ready Web UI with Markdown, LaTeX, and live multimedia support.
Python Function Integration: Bring Your Own Function (BYOF) for custom Python tools inside the LLM workspace.
RAG & Web Integration: Local document RAG, web searches (SearXNG, Google, DuckDuckGo, Bing), and live web content injection.
Model Builder & Management: Create, import, and manage Ollama models via a clean Web UI.
Image Generation: Integrates AUTOMATIC1111, ComfyUI (local), or DALL-E for rich visual AI content.
Pipelines & Plugins: Extend AI WebHub with Python plugins and custom pipelines.
Multilingual Support: Full i18n support for global accessibility.

Installation Guide 🚀

Python Installation

Requires Python 3.11+:

pip install ai-webhub

Start the server:

ai-webhub serve

Access Web UI: http://localhost:8080

Docker Quick Start 🐳

Default Setup with Local Ollama

docker run -d -p 3000:8080 \
  --add-host=host.docker.internal:host-gateway \
  -v ai-webhub:/app/backend/data \
  --name ai-webhub \
  --restart always \
  ghcr.io/karianne50m/ai-webhub:main

GPU-Enabled Ollama Setup

docker run -d -p 3000:8080 --gpus all \
  -v ollama:/root/.ollama \
  -v ai-webhub:/app/backend/data \
  --name ai-webhub \
  --restart always \
  ghcr.io/karianne50m/ai-webhub:ollama

OpenAI-Only API Usage

docker run -d -p 3000:8080 \
  -e OPENAI_API_KEY=your_secret_key \
  -v ai-webhub:/app/backend/data \
  --name ai-webhub \
  --restart always \
  ghcr.io/karianne50m/ai-webhub:main

Offline Mode

export HF_HUB_OFFLINE=1

Troubleshooting & Support

Server Connection Errors: Use --network=host if container cannot reach Ollama on localhost.
Docker Updates: Use Watchtower for automatic container updates.
Join our Discord community for real-time support.

Advanced Features

RAG Document Integration: Load documents and query via #doc_name.
Web & Image Content Injection: Integrate live web content and images dynamically.
Multi-Model Conversations: Simultaneously interact with multiple LLMs.
Role-Based Access Control (RBAC): Restrict model creation and access to specific users.
Pipelines & Plugins: Automate workflows, rate limiting, monitoring, and real-time translation.

Roadmap & Contribution

Check the roadmap to explore upcoming features. Contributions are welcome — report issues or submit PRs on GitHub.

License 📜

This repository uses a BSD-3-Clause style license with an additional clause preserving the AI WebHub branding. Full license details in LICENSE.

Analytics & Popularity

Join the Community 💬

Connect, collaborate, and contribute via Discord.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
backend		backend
cypress		cypress
docs		docs
kubernetes/helm		kubernetes/helm
scripts		scripts
src		src
static		static
test/test_files/image_gen		test/test_files/image_gen
.dockerignore		.dockerignore
.env.example		.env.example
.eslintignore		.eslintignore
.eslintrc.cjs		.eslintrc.cjs
.gitattributes		.gitattributes
.gitignore		.gitignore
.npmrc		.npmrc
.prettierignore		.prettierignore
.prettierrc		.prettierrc
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTOR_LICENSE_AGREEMENT		CONTRIBUTOR_LICENSE_AGREEMENT
Dockerfile		Dockerfile
INSTALLATION.md		INSTALLATION.md
LICENSE		LICENSE
LICENSE_HISTORY		LICENSE_HISTORY
Makefile		Makefile
README.md		README.md
TROUBLESHOOTING.md		TROUBLESHOOTING.md
confirm_remove.sh		confirm_remove.sh
contribution_stats.py		contribution_stats.py
cypress.config.ts		cypress.config.ts
demo.gif		demo.gif
docker-compose.a1111-test.yaml		docker-compose.a1111-test.yaml
docker-compose.amdgpu.yaml		docker-compose.amdgpu.yaml
docker-compose.api.yaml		docker-compose.api.yaml
docker-compose.data.yaml		docker-compose.data.yaml
docker-compose.gpu.yaml		docker-compose.gpu.yaml
docker-compose.otel.yaml		docker-compose.otel.yaml
docker-compose.playwright.yaml		docker-compose.playwright.yaml
docker-compose.yaml		docker-compose.yaml
hatch_build.py		hatch_build.py
i18next-parser.config.ts		i18next-parser.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
pyproject.toml		pyproject.toml
run-compose.sh		run-compose.sh
run-ollama-docker.sh		run-ollama-docker.sh
run.sh		run.sh
svelte.config.js		svelte.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
update_ollama_models.sh		update_ollama_models.sh
uv.lock		uv.lock
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI WebHub: Self-Hosted LLM Management Platform 🚀

Key Highlights 🌟

Installation Guide 🚀

Python Installation

Docker Quick Start 🐳

Default Setup with Local Ollama

GPU-Enabled Ollama Setup

OpenAI-Only API Usage

Offline Mode

Troubleshooting & Support

Advanced Features

Roadmap & Contribution

License 📜

Analytics & Popularity

Join the Community 💬

About

Uh oh!

Releases

Packages

Languages

License

neallawson/llm-inference-hub

Folders and files

Latest commit

History

Repository files navigation

AI WebHub: Self-Hosted LLM Management Platform 🚀

Key Highlights 🌟

Installation Guide 🚀

Python Installation

Docker Quick Start 🐳

Default Setup with Local Ollama

GPU-Enabled Ollama Setup

OpenAI-Only API Usage

Offline Mode

Troubleshooting & Support

Advanced Features

Roadmap & Contribution

License 📜

Analytics & Popularity

Join the Community 💬

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages