[OpenSora-hpcai] add support for MS 2.7 and OSv1.2 performance optimization #687

hadipash · 2024-10-09T09:53:33Z

Add:

Support for MS2.7
Improve performance

Tests were conducted in dynamic DVM mode, on MS daily from 09.04 with CANN 8.0 RC2. Results include training step average time only (no data loading time):

Changes	Shape (res x frames x batch)	Time (s)	Change (s)	Comment
Original	720p x 51 x 2	30.409
	144p x 204 x 10	19.934
~~Switch to `repeat_interleave_ext_v2`~~	720p x 51 x 2	28.913	-1.496 (-4.9%)
	144p x 204 x 10	19.872	-0.062 (-0.3%)
Remove SiLU & GELU FP32 upcast	720p x 51 x 2	30.346	-0.062 (-0.2%)	No performance improvement,
	144p x 204 x 10	20.506	+0.572 (+2.9%)	will consult with the MS team.
Convert parameters to BF16	720p x 51 x 2	28.957	-1.452 (-4.8%)
	144p x 204 x 10	18.747	-1.187 (-3.9%)
Remove redundant `ops.transpose` in VAE	720p x 51 x 2	30.448	+0.040 (+0.1%)	No changes due to the kernel fusion.
	144p x 204 x 10	20.103	+0.168 (+0.8%)	Beneficial in KBK & PyNative modes.

Final improvement	720p x 51 x 2	27.896	-2.512 (-8.3%)
	144p x 204 x 10	18.804	-1.130 (-5.7%)

zhtmike

seems no code change for Convert parameters to BF16 ?

hadipash · 2024-10-10T02:20:32Z

seems no code change for Convert parameters to BF16 ?

This refers to the network parameters that are explicitly defined with nn.Parameter(), such as self.scale_shift_table. For some reason, any calculations performed on self.scale_shift_table are upcast to the parameter type (i.e. fp32) and the new type is propagated in the network, even with AMP enabled.

# Conflicts: # examples/opensora_hpcai/opensora/models/layers/blocks.py # examples/opensora_hpcai/opensora/utils/model_utils.py

# Conflicts: # examples/opensora_hpcai/scripts/inference.py

# Conflicts: # examples/opensora_hpcai/opensora/utils/model_utils.py

- Added PR links to model components where specific PRs exist (#1288, #1148) - Added PR links to examples models that have individual PRs (#1378, #1233, #1363, #1243, #687, #1362, #1227, #1346, #1200, #1369) - Noted that some components were added as part of broader pipeline implementations - Improved traceability for specific model additions

performance optimization

de01058

hadipash requested review from CaitinZhao, SamitHuang, vigo999 and zhanghuiyao as code owners October 9, 2024 09:53

hadipash requested a review from zhtmike October 9, 2024 09:55

linting

37fd542

zhtmike approved these changes Oct 9, 2024

View reviewed changes

hadipash added 3 commits October 10, 2024 16:01

fix

8d6d25b

linting

39a15fc

Merge branch 'master' into perf_op

4a83b34

# Conflicts: # examples/opensora_hpcai/opensora/models/layers/blocks.py # examples/opensora_hpcai/opensora/utils/model_utils.py

SamitHuang approved these changes Feb 28, 2025

View reviewed changes

hadipash added 3 commits March 7, 2025 11:28

Merge branch 'master' into perf_op

5c01279

# Conflicts: # examples/opensora_hpcai/scripts/inference.py

Merge branch 'master' into perf_op

8bffac6

# Conflicts: # examples/opensora_hpcai/opensora/utils/model_utils.py

drop custom repeat_interleave

7e5b4ef

This was referenced Jun 12, 2025

[OpenSora-hpcai] OSv1.2 performance optimization hadipash/mindone#15

Merged

[OpenSora HPC-AI] OpenSora v2.0 train + OpenSora v1.2 optimization #1075

Open

zhtmike approved these changes Jul 18, 2025

View reviewed changes

hadipash added 2 commits October 15, 2025 15:44

Merge branch 'master' into perf_op

3fca449

update code to add support for MS2.7

83bfe94

hadipash changed the title ~~[OpenSora-hpcai] OSv1.2 performance optimization~~ [OpenSora-hpcai] add support for MS 2.7 and OSv1.2 performance optimization Oct 17, 2025

vigo999 approved these changes Oct 18, 2025

View reviewed changes

vigo999 added this pull request to the merge queue Oct 18, 2025

Merged via the queue into mindspore-lab:master with commit 48a8dea Oct 18, 2025
3 checks passed

hadipash deleted the perf_op branch October 31, 2025 06:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[OpenSora-hpcai] add support for MS 2.7 and OSv1.2 performance optimization #687

[OpenSora-hpcai] add support for MS 2.7 and OSv1.2 performance optimization #687

Uh oh!

hadipash commented Oct 9, 2024 •

edited

Loading

Uh oh!

zhtmike left a comment

Uh oh!

hadipash commented Oct 10, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[OpenSora-hpcai] add support for MS 2.7 and OSv1.2 performance optimization #687

[OpenSora-hpcai] add support for MS 2.7 and OSv1.2 performance optimization #687

Uh oh!

Conversation

hadipash commented Oct 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zhtmike left a comment

Choose a reason for hiding this comment

Uh oh!

hadipash commented Oct 10, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hadipash commented Oct 9, 2024 •

edited

Loading