Self Hosted Ollama (Shollama)

Purpose

Easy to setup, self-hosted, multi-model AI chatbot & API server.

3rd Party Components

Ollama - https://ollama.com/
Open Web UI - https://openwebui.com/
Alexis - ./alexis, An Alexa-like chatbot.
Hardware?: See https://github.com/olafrv/nvidia-docker-wsl

Usage

Clone the repository and set up the environment:

git clone https://github.com/olafrv/shollama.git
cd shollama
# Create .env files as described above
mv docker-compose.example.yml docker-compose.yml  # adjust if needed

Create env.global.env with the following content:

OLLAMA_BASE_URL=http://ollama:11434
OLLAMA_API_KEY=FOR_GOD_SAKE_CHANGE_ME

Create env.ui.env with the following content:

HF_HUB_OFFLINE=0
WEBUI_NAME="Self Hosted Ollama"
WEBUI_URL="http://localhost:8080"
PORT=8080
ENABLE_SIGNUP=true
ENABLE_OAUTH_SIGNUP=false
ENABLE_REALTIME_CHAT_SAVE=true
ENABLE_ADMIN_EXPORT=false
USE_CUDA_DOCKER=true
ENABLE_EVALUATION_ARENA_MODELS=False

Enable Docker NVIDIA GPU Support, which means, you need to install NVIDIA CUDA drivers, do checks, NVIDIA container toolkit, do checks, then install finally restart Docker.

See https://github.com/olafrv/nvidia-docker-wsl

Then run the following commands to start the services:

./start.sh
./models.sh
# Ollama require GPU (CUDA) to run, but
# in docker-compose.yml can be disabled.
# Go to http://localhost:8080
# Or run any tests ./test*.sh
./stop.sh

Common Errors

Ollama

docker: ()...) could not select device driver "nvidia" (...) [[gpu]], Comment the 'deploy' section in docker-compose.yaml for ollama service.

Open Web UI

Cannot find local snapshot HF_HUGGING_OFFLINE=1. Set it to 0 in env.iu.env, you can set it back after the first run.

Test API

# Test Ollama API
curl http://localhost:11434/api/generate -d '{ "model": "llama3.2", "prompt": "How are you today?"}'

Integrating Ollama with VSCode

GitHub Copilot Chat Extension

As of 18.07.2025:

Currently, only works with Chat mode and has limitations: https://code.visualstudio.com/docs/copilot/language-models

You need to define the setting (and restart VSCode): GitHub › Copilot › Chat › Byok: Ollama Endpoint

ChatGPT Copilot Extension

As of 17.08.2025:

Currently, works but not as flowlessly as GitHub Copilot:

https://github.com/feiskyer/chatgpt-copilot

After install, go to Settings > Filter by ChatGPT, set the following parameters (ChatGPT > Gpt3):

Provider LLM: Ollama
API Key: FOR_GOD_SAKE_CHANGE_ME
API Base Url: Leave it in blank
Model: custom
Custom Model: qwen2.5-coder:7b

This model is not downloaded by default, so:

# Edit ./models.sh, or pull it manually running:
docker exec -it ollama ollama pull qwen2.5-coder:7b

Discarded

Continue requires 3 models running to operate (too much): https://docs.continue.dev/customize/model-providers/top-level/ollama

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
alexis		alexis
.gitignore		.gitignore
README.md		README.md
docker-compose-example.yaml		docker-compose-example.yaml
models.sh		models.sh
start.sh		start.sh
stop.sh		stop.sh
test_alexis.sh		test_alexis.sh
test_ollama_api.py		test_ollama_api.py
test_ollama_prompt.sh		test_ollama_prompt.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Self Hosted Ollama (Shollama)

Purpose

3rd Party Components

Usage

Common Errors

Ollama

Open Web UI

Test API

Integrating Ollama with VSCode

GitHub Copilot Chat Extension

ChatGPT Copilot Extension

Discarded

To Explore

About

Uh oh!

Releases

Packages

Uh oh!

Languages

olafrv/shollama

Folders and files

Latest commit

History

Repository files navigation

Self Hosted Ollama (Shollama)

Purpose

3rd Party Components

Usage

Common Errors

Ollama

Open Web UI

Test API

Integrating Ollama with VSCode

GitHub Copilot Chat Extension

ChatGPT Copilot Extension

Discarded

To Explore

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages