Calibrating LLMs with Label Smoothing

Introduction

This is the official repository for the supporting code to our paper Calibrated Language Models and How to Find Them with Label Smoothing, presented at ICML 2025.

This repository is built off of two public repositories

Each of these is included in their own separate folder and contains their own requirements.txt file for running. If you run into any issues during installation or running the code, please access the specific repositories.

FAQ

Here are some issues that we ran into that may be helpful.

Installing `flash-attn` is slow.

We ran into this and our solution was to install the package directly from the wheels here. Just match your torch, gcc and CUDA versions. We generally use the abiFalse version of any wheel.

A specific tokenizer does not work when training models.

You may have to add some handling code in open-instruct/open_instruct/dataset_transformation.py if your tokenizer isn't directly supported.

CUDA Out of Memory

We generally suggest to use at least 4 NVIDIA A100 80GB for training models. For testing/benchmarking, only a single 80GB GPU is necessary, but this can vary depending on the model (Gemma2 does not use flash-attention and therefore may require more resources).

Citation

@inproceedings{
    huang2025calibrated,
    title={Calibrated Language Models and How to Find Them with Label Smoothing},
    author={Jerry Huang and Peng Lu and QIUHAO Zeng},
    booktitle={Forty-second International Conference on Machine Learning},
    year={2025},
    url={https://openreview.net/forum?id=soLNj4l2EL}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
ml-cross-entropy		ml-cross-entropy
openinstruct		openinstruct
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Calibrating LLMs with Label Smoothing

Introduction

FAQ

Installing `flash-attn` is slow.

A specific tokenizer does not work when training models.

CUDA Out of Memory

Citation

About

Uh oh!

Releases

Packages

Languages

License

jhuang265/Calibrating-LLMs-with-Label-Smoothing

Folders and files

Latest commit

History

Repository files navigation

Calibrating LLMs with Label Smoothing

Introduction

FAQ

Installing flash-attn is slow.

A specific tokenizer does not work when training models.

CUDA Out of Memory

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Installing `flash-attn` is slow.

Packages