Cycle Consistent Model Merging

Merging models in a cycle-consistent fashion.

Development installation

This project requires a wandb account for storing checkpoints and logs.

Setup the development environment:

git clone git@github.com:crisostomi/cycle-consistent-model-merging.git
cd cycle-consistent-model-merging
uv sync

Run the tests:

pytest -v

Usage

All the scripts can be found under src/scripts/. Each script has a corresponding configuration file in conf/matching where you can change stuff as dataset and model to use.

Training

You can train models using train.py with a dataset and model of your choice. These must be selected in conf/train.yaml.

Matching two models

get the permutations to align the two models (identified by their seed in the config) by running match_two_models.py. The config is conf/matching.yaml (see inside the config to see the subconfigs).
evaluate the interpolation of the models using evaluate_matched_models.py and the same config used for the previous step. Be sure to have matching.yaml as config in the script itself.

To change the matching technique, you have to change the matcher in conf/matching/match_two_models.yaml. Each matcher has its own config file in conf/matching/matcher/.

To run all the pairs of models with different seeds, run shell_scripts/run_all_seeds.sh.

Matching multiple models

get the permutations to align the models (identified by their seed in the config) by running match_n_models.py. The config is conf/matching_n_models.yaml (see inside the config to see the subconfigs).
evaluate the interpolation of the models using evaluate_matched_models.py and the same config used for the previous step. Be sure to have matching_n_models.yaml as config in the script itself.

To change the matching technique, you have to change the matcher in conf/matching/match_n_models.yaml. Each matcher has its own config file in conf/matching/matcher/.

Merging models

get the merged model by running merge_n_models.py.
evaluate the merged model using evaluate_merged_model.py and the same config used for the previous step.

To change the merging technique, you have to change the merger in conf/matching/merge_n_models.yaml. Each merger has its own config file in conf/matching/merger/.

Reproducing the paper

Merging an increasing number of models

To reproduce the scaling experiment in Figure 7, use script scripts/scaling_merging.py.

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
.github/workflows		.github/workflows
conf		conf
data		data
meta		meta
misc		misc
notebooks		notebooks
output		output
shell_scripts		shell_scripts
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg
setup.py		setup.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Cycle Consistent Model Merging

Development installation

Usage

Training

Matching two models

Matching multiple models

Merging models

Reproducing the paper

Merging an increasing number of models

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

crisostomi/cycle-consistent-model-merging

Folders and files

Latest commit

History

Repository files navigation

Cycle Consistent Model Merging

Development installation

Usage

Training

Matching two models

Matching multiple models

Merging models

Reproducing the paper

Merging an increasing number of models

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages