Skip to content

[V0.9.1][BugFix] Fix load weight error and add new e2e case #1651

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jul 8, 2025

Conversation

shikang-hangzhou
Copy link

@shikang-hangzhou shikang-hangzhou commented Jul 7, 2025

What this PR does / why we need it?

  1. Quant parameters has been modified but DBO file didn't match it.
  2. remove useless init code, mostly reuse v2 init code.
  3. add DBO e2e case and remove case skip.

Does this PR introduce any user-facing change?

None

How was this patch tested?

‘tests/multicard/test_offline_inference_distributed.py’

Signed-off-by: shikang-hangzhou <459956190@qq.com>
@ganyi1996ppo ganyi1996ppo merged commit 5559443 into vllm-project:v0.9.1-dev Jul 8, 2025
16 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants