Skip to content

evseevgrv/ZO_LLM

Repository files navigation

Leveraging Coordinate Momentum in SignSGD and Muon: Memory-Optimized Zero-Order LLM Fine-Tuning

This repository contains the code for experiments applying Jaguar SignSGD, Jaguar Muon and ZO-Muon methods for different LLM Fine-Tuning tasks.

The code is based on the benchmark

Requirements

To install requirements:

pip install -r requirements.txt

Training and Evaluation

To train and evaluate the model in the paper, run this command:

./run_script.sh

Methods

  • zo_ns_jaguar is Jaguar Muon
  • zo_jaguar is Jaguar SignSGD
  • zo_muon is ZO-Muon

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •