Skip to content

Wire up CLI for scan and host offloading #156

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Mar 19, 2025
Merged

Wire up CLI for scan and host offloading #156

merged 1 commit into from
Mar 19, 2025

Conversation

tengyifei
Copy link
Collaborator

@tengyifei tengyifei commented Mar 17, 2025

In the previous PR (#157) we added the ability to replace a sequence of layers with scan and in an earlier PR (#151) we added the ability to offload tensors by transforming a graph. This PR changes the CLI to be able to select either for-loop, scan, or scan+host offloading.

In order to mix-and-match sharding configs with scan/offloading configs without creating a combinatorial explosion, I had to add a new config group called remat and add scan and offloading there. Since activation_checkpointing_layers and the optimization barrier is also part of remat, I moved it there too. This means the only left config in the scaling group is sharding, so I've hoisted sharding to be top level. This moves a bunch of files but it's mostly s/scaling/sharding.

@tengyifei tengyifei force-pushed the yifeit/scan branch 6 times, most recently from b782cb4 to 88a04d4 Compare March 18, 2025 07:52
@tengyifei tengyifei changed the title scan pr to be decomposed Wire up CLI for scan and host offloading Mar 18, 2025
@tengyifei tengyifei marked this pull request as ready for review March 18, 2025 16:25
@tengyifei tengyifei requested review from zpcore, bhavya01 and qihqi March 18, 2025 17:43
Copy link
Collaborator

@zpcore zpcore left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Left one question.

@tengyifei tengyifei enabled auto-merge (squash) March 19, 2025 00:35
@tengyifei tengyifei merged commit b0bd47e into main Mar 19, 2025
11 checks passed
@tengyifei tengyifei deleted the yifeit/scan branch March 19, 2025 01:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants