Heads up! Large Language Models Can Perform Tasks Without Your Instruction via Selective Attention Head Masking

This repository provides code for training attention head masks and for plotting some of the figures presented in our paper Heads up! Large Language Models Can Perform Tasks Without Your Instruction via Selective Attention Head Masking (ICML'25).

Environment

conda create -n headsup python=3.10 -y
conda activate headsup
pip install -r requirements.txt

We use FlashAttention for efficient training. You may install it as your need, or disable FlashAttention in train_mask.py.

Download Attention Head Masks

Trained head mask for Meta-Llama-3.1-8B-Instruct on XNLI and FV datasets are available here (Google Drive). Put the output folder under this directory, then you can directly run the cells in eval.ipynb and partial cells in playground.ipynb.

Train Attention Head Masks

We provide the training scripts under scripts/ directory. You may modify them to your own training settings.

bash scripts/llama_xnli.sh      # Train llama-3.1 on XNLI dataset

Citation

@inproceedings{han2025heads,
    title={Heads up! Large Language Models Can Perform Tasks Without Your Instruction via Selective Attention Head Masking},
    author={Senyu Han and Hongchuan Zeng and Kai Yu and Lu Chen},
    booktitle={Forty-second International Conference on Machine Learning},
    year={2025},
    url={https://openreview.net/forum?id=x2Dw9aNbvw}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Heads up! Large Language Models Can Perform Tasks Without Your Instruction via Selective Attention Head Masking

Environment

Download Attention Head Masks

Train Attention Head Masks

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
dataset		dataset
models		models
scripts		scripts
README.md		README.md
eval.ipynb		eval.ipynb
playground.ipynb		playground.ipynb
requirements.txt		requirements.txt
train_mask.py		train_mask.py
utils.py		utils.py

OpenDFM/HeadsUp

Folders and files

Latest commit

History

Repository files navigation

Heads up! Large Language Models Can Perform Tasks Without Your Instruction via Selective Attention Head Masking

Environment

Download Attention Head Masks

Train Attention Head Masks

Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages