feat(2D): support selective-checkpoint-offload for 2D attention #396

yingtongxiong · 2024-12-31T08:10:14Z

Motivation

Support selective-checkpoint-offload for 2D attention, which offload the attention activation to CPU when enable selective activation checkpoint.

Modification

internlm/model/ops/ring_flash_attn/zigzag_ring_flash_attn_with_sliding_window.py

Use cases (Optional)

TODO

loss testing

huangting4201 · 2024-12-31T08:28:57Z

internlm/model/ops/ring_flash_attn/zigzag_ring_flash_attn_with_sliding_window.py

+        _ckpt_block_num = int(gpc.config.model.checkpoint * gpc.config.isp_num_layers)

-        if gpc.is_forward is False and gpc.config.selective_checkpoint:
-            assert layer_idx in fa_output_mapping


第11行fa_output_mapping的初始化可以去掉了

哦对好的

merge develop

support 2D offload

dc50325

mm-assistant bot assigned yhcc Dec 31, 2024

huangting4201 reviewed Dec 31, 2024

View reviewed changes

yingtongxiong added 5 commits December 31, 2024 17:25

fix lint

ef5e8b6

add loss test config

91005b8

Merge branch 'develop' into feat/2D-SC-offload

3953594

merge develop

reset 7B_isp.py config

71031c0

reset 7B_isp.py config

0067f27

huangting4201 approved these changes Feb 25, 2025

View reviewed changes

huangting4201 merged commit caf30d8 into InternLM:develop Feb 25, 2025
25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(2D): support selective-checkpoint-offload for 2D attention #396

feat(2D): support selective-checkpoint-offload for 2D attention #396

Uh oh!

yingtongxiong commented Dec 31, 2024

Uh oh!

huangting4201 Dec 31, 2024

Uh oh!

yingtongxiong Dec 31, 2024

Uh oh!

yingtongxiong Dec 31, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(2D): support selective-checkpoint-offload for 2D attention #396

feat(2D): support selective-checkpoint-offload for 2D attention #396

Uh oh!

Conversation

yingtongxiong commented Dec 31, 2024

Motivation

Modification

Use cases (Optional)

TODO

Uh oh!

huangting4201 Dec 31, 2024

Choose a reason for hiding this comment

Uh oh!

yingtongxiong Dec 31, 2024

Choose a reason for hiding this comment

Uh oh!

yingtongxiong Dec 31, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants