Skip to content

[Bugfix] Fix deepseek V0 percision issue and add acc ci for it #905

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

MengqingCao
Copy link
Collaborator

What this PR does / why we need it?

Fix deepseek V0 percision issue on V0 and add acc ci for it

How was this patch tested?

CI passed with new added test.

@MengqingCao MengqingCao marked this pull request as ready for review May 20, 2025 06:29
@@ -198,6 +198,7 @@ def fused_experts(
num_experts = w1.shape[0]
dtype = hidden_states.dtype
device = hidden_states.device
topk_weights = topk_weights.to(dtype)
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actually this fix is done by @whx-sjtu as a part in #863, but I think it should be merge quickly instead of blocking in #863

Signed-off-by: MengqingCao <cmq0113@163.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant