Train Small, Infer Large
Memory-Efficient LoRA Training for LLMs

🚀 15.81×～16.95× Parameter Reduction ⬇️

LoRAM is a memory-efficient LoRA training method for cost-effective performance gains by
training low-rank matrices on a pruned model and merging them for inference on the original model.

Jun Zhang¹, Jue Wang¹, Huan Li¹, Lidan Shou¹, Ke Chen¹,
Yang You², Guiming Xie³, Xuejian Gong³, Kunlong Zhou³

¹ Zhejiang University, ² National University of Singapore, ³ OPPO AI Center

📌 Overview

🔥 Features

✅ Train LoRA on a pruned model to reduce memory footprint
✅ Recover LoRA for high-quality full model inference

🛠 Installation

Clone the repository and install dependencies:

git clone https://github.com/your-repo/LoRAM.git
cd LoRAM/loram

🙌 Acknowledgments

🤝 Institutional Collaboration

This project was made possible thanks to a collaboration with:

🤝 Tool Contributions

Shout out to LLM-Pruner and SparseGPT!
LoRAM leverages these tools, and we appreciate their contributions to the research community.

📖 Citation

If you find the resources in this repository useful, please cite our paper:

@inproceedings{
zhang2025train,
title={Train Small, Infer Large: Memory-Efficient Lo{RA} Training for Large Language Models},
author={Jun Zhang and Jue WANG and Huan Li and Lidan Shou and Ke Chen and Yang You and Guiming Xie and Xuejian Gong and Kunlong Zhou},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025},
url={https://openreview.net/forum?id=s7DkcgpRxL}
}

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
assets		assets
loram		loram
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Train Small, Infer Large
Memory-Efficient LoRA Training for LLMs

🚀 15.81×～16.95× Parameter Reduction ⬇️

📌 Overview

🔥 Features

🛠 Installation

🙌 Acknowledgments

🤝 Institutional Collaboration

🤝 Tool Contributions

📖 Citation

About

Uh oh!

Releases

Packages

Languages

License

SuDIS-ZJU/LoRAM

Folders and files

Latest commit

History

Repository files navigation

Train Small, Infer Large Memory-Efficient LoRA Training for LLMs

🚀 15.81×～16.95× Parameter Reduction ⬇️

📌 Overview

🔥 Features

🛠 Installation

🙌 Acknowledgments

🤝 Institutional Collaboration

🤝 Tool Contributions

📖 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Train Small, Infer Large
Memory-Efficient LoRA Training for LLMs

Packages