GOLLuM: Gaussian Process Optimized LLMs – Reframing LLMs as Principled Bayesian Optimizers 🧙‍♂️📈

GOLLuM – Gaussian Process Optimized LLMs are here!
One representation to rule them all!

📄 Paper:

🔍 Overview

🎯 GOLLuM addresses the challenge of harnessing LLMs for optimization under uncertainty by introducing:

LLM-based deep kernels, jointly optimized with GPs to preserve the benefits of both
LLMs to provide a rich and flexible input space for Bayesian optimization
GPs to model this space with predictive uncertainty for more efficient sampling

🌌 The framework enables a bidirectional feedback loop:

The GP guides updates to LLM weights to produce more effective embeddings
These embeddings enhance the GP's probabilistic modeling

🧠 Key Features

Unified Representation Learning: Uses textual templates to represent heterogeneous parameter types (categorical, numerical, structural)
GP-Guided LLM Finetuning: Optimizes LLM embeddings through GP marginal likelihood
Implicit Contrastive Learning: Automatically organizes the latent space into distinct performance regions
Chemical reasoning in the latent space: Uncovering chemical patterns under extreme low-data regimes
Architecture Agnostic: Works with various LLM architectures (encoder, decoder, encoder-decoder)
Domain Agnostic: No requirement for domain-specialized models or pretraining

🚀 Quickstart

📦 Project Dependencies

You can install the environment from a file:

# Recommended (Conda)
conda env create -f environment.yaml
conda activate gollum

# OR (pip-only)
pip install -r requirements.txt

For manual setup or more details, see docs/DEPENDENCIES.md.

🛠 Install GOLLuM in editable mode

pip install -e .

⚙️ Running Experiments

All configuration files for reproducing experiments are included in the configs/ directory. You can launch an experiment with:

python train.py --config=configs/pllm_phi.yaml

Replace pllm_phi.yaml with other config files for variants such as llm_phi.yaml, pllm.yaml, etc.

📚 Citation

@inproceedings{
rankovic2025gollum,
title={{GOLL}uM: Gaussian Process Optimized {LLM}s {\textemdash} Reframing {LLM} Finetuning through Bayesian Optimization},
author={Bojana Rankovi{\'c} and Philippe Schwaller},
booktitle={ICLR 2025 Workshop on World Models: Understanding, Modelling and Scaling},
year={2025},
url={https://openreview.net/forum?id=2ORViHAUbf}
}

⚖️ License

This project is licensed under the Apache 2.0 License. See the LICENSE file for details.

🤝 Acknowledgements

This work was supported by NCCR Catalysis (grant number 225147), a National Centre of Competence in Research funded by the Swiss National Science Foundation.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github		.github
assets		assets
configs		configs
docs/source		docs/source
src/gollum		src/gollum
tests		tests
.bumpversion.cfg		.bumpversion.cfg
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
CITATION.cff		CITATION.cff
DEPENDENCIES.md		DEPENDENCIES.md
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
tox.ini		tox.ini
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GOLLuM: Gaussian Process Optimized LLMs – Reframing LLMs as Principled Bayesian Optimizers 🧙‍♂️📈

🔍 Overview

🧠 Key Features

🚀 Quickstart

📦 Project Dependencies

🛠 Install GOLLuM in editable mode

⚙️ Running Experiments

📚 Citation

⚖️ License

🤝 Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

schwallergroup/gollum

Folders and files

Latest commit

History

Repository files navigation

GOLLuM: Gaussian Process Optimized LLMs – Reframing LLMs as Principled Bayesian Optimizers 🧙‍♂️📈

🔍 Overview

🧠 Key Features

🚀 Quickstart

📦 Project Dependencies

🛠 Install GOLLuM in editable mode

⚙️ Running Experiments

📚 Citation

⚖️ License

🤝 Acknowledgements

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages