LEAF Large Language Model (LLM)

This module provides natural language capability for specific features within VA LEAF, such as automatic categorization of IT issue tickets.

This leverages llama.cpp to implement self-hosted inference tasks with open models such as Gemma 3.

Example Usage

Prerequisites:

6GB free RAM
OCI-compliant container engine such as Docker or Podman

Download a compatible model, such as gemma-3-4b-it-q4_0.gguf from Gemma 3
Navigate to ./docker/cpu or ./docker/cuda depending on CUDA-compatible hardware availability
Update ./docker/*/docker-compose.yml with the path to the model.
Start the container

To quickly check functionality, navigate to http://localhost:8012 or the relevant hostname and send a message

Configuraiton

Generate a secure hash, and set the environment variable LLM_API_KEY

Integration with LEAF Agent

LEAF Agent environment variables:

LLM_API_KEY must match the key generated in this configuration
LLM_CATEGORIZATION_URL must match the URL of the llama.cpp OpenAI-compatible Chat Completions API endpoint (e.g. http://localhost:8012/v1/chat/completions)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
docker		docker
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LEAF Large Language Model (LLM)

Example Usage

Configuraiton

Integration with LEAF Agent

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

department-of-veterans-affairs/LEAF-LLM

Folders and files

Latest commit

History

Repository files navigation

LEAF Large Language Model (LLM)

Example Usage

Configuraiton

Integration with LEAF Agent

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages