Text Generation WebUI for ClassroomCopilot

This directory contains the configuration for running Text Generation WebUI with TensorRT-LLM in a Docker container as part of the ClassroomCopilot project.

Setup

Prerequisites

Make sure you have Docker installed and running.
If you want to use GPU acceleration, ensure you have NVIDIA drivers and the NVIDIA Container Toolkit installed.

Models

Place your language models in the following directory:

cc-volumes/text-generation/models/

The container supports various model formats including:

GGUF models (for CPU inference)
HuggingFace models
TensorRT-LLM optimized models

Running

The container is configured to start automatically with the rest of the ClassroomCopilot services:

docker-compose up -d text-generation-webui

Or you can start all services:

docker-compose up -d

Accessing the WebUI

Once the container is running, you can access the WebUI at:

http://localhost:7861

Or through the Nginx reverse proxy at:

http://textgen.localhost

The API is available at:

http://localhost:5010

Or through the Nginx reverse proxy at:

http://textgen.localhost/api

Configuration

The container is configured with the following settings:

Uses CPU-only inference optimized for Apple Silicon
Exposes both the web interface (port 7861) and API (port 5010)
Mounts volumes for models, LoRAs, presets, characters, and extensions

Apple Silicon Compatibility

This container is specifically configured for Apple Silicon (M1/M2/M3) Macs. It uses CPU-only inference since TensorRT-LLM is not compatible with Apple Silicon. For optimal performance on Apple Silicon:

Use GGUF models which are optimized for CPU inference
Smaller models (7B parameters or less) will perform better
Consider using models with quantization (like Q4_K_M) for faster inference

Recommended Models for Apple Silicon

Mistral 7B Instruct GGUF (Q4_K_M)
Llama 2 7B Chat GGUF (Q4_K_M)
Phi-2 GGUF (Q4_K_M)

You can download these models using the setup script or manually place them in the models directory.

Troubleshooting

If you encounter issues:

Model loading errors: Ensure your models are in the correct format and location.
GPU issues: Check that your NVIDIA drivers and CUDA are properly installed and that the NVIDIA Container Toolkit is configured.
Container logs: Check the container logs for more detailed error messages:
```
docker-compose logs text-generation-webui
```
Restart the container: Sometimes simply restarting the container can resolve issues:
```
docker-compose restart text-generation-webui
```

Name		Name	Last commit message	Last commit date
Latest commit History 4,171 Commits
.github		.github
characters		characters
css		css
docker		docker
docs		docs
extensions		extensions
grammars		grammars
instruction-templates		instruction-templates
js		js
loras		loras
models		models
modules		modules
presets		presets
prompts		prompts
training		training
.gitignore		.gitignore
CMD_FLAGS.txt		CMD_FLAGS.txt
Colab-TextGen-GPU.ipynb		Colab-TextGen-GPU.ipynb
LICENSE		LICENSE
README.md		README.md
cmd_linux.sh		cmd_linux.sh
cmd_macos.sh		cmd_macos.sh
cmd_windows.bat		cmd_windows.bat
cmd_wsl.bat		cmd_wsl.bat
download-model.py		download-model.py
one_click.py		one_click.py
requirements.txt		requirements.txt
requirements_amd.txt		requirements_amd.txt
requirements_amd_noavx2.txt		requirements_amd_noavx2.txt
requirements_apple_intel.txt		requirements_apple_intel.txt
requirements_apple_silicon.txt		requirements_apple_silicon.txt
requirements_cpu_only.txt		requirements_cpu_only.txt
requirements_cpu_only_noavx2.txt		requirements_cpu_only_noavx2.txt
requirements_noavx2.txt		requirements_noavx2.txt
requirements_nowheels.txt		requirements_nowheels.txt
server.py		server.py
settings-template.yaml		settings-template.yaml
setup-models.sh		setup-models.sh
setup.cfg		setup.cfg
start_linux.sh		start_linux.sh
start_macos.sh		start_macos.sh
start_windows.bat		start_windows.bat
start_wsl.bat		start_wsl.bat
update_wizard_linux.sh		update_wizard_linux.sh
update_wizard_macos.sh		update_wizard_macos.sh
update_wizard_windows.bat		update_wizard_windows.bat
update_wizard_wsl.bat		update_wizard_wsl.bat
wsl.sh		wsl.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Text Generation WebUI for ClassroomCopilot

Setup

Prerequisites

Models

Running

Accessing the WebUI

Configuration

Apple Silicon Compatibility

Recommended Models for Apple Silicon

Troubleshooting

About

Uh oh!

Releases

Packages

Languages

License

tidymonkey81/text-generation-webui

Folders and files

Latest commit

History

Repository files navigation

Text Generation WebUI for ClassroomCopilot

Setup

Prerequisites

Models

Running

Accessing the WebUI

Configuration

Apple Silicon Compatibility

Recommended Models for Apple Silicon

Troubleshooting

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages