Add Eagle-3 Qwen support (follow-up to #20436) #2

rahul-tuli · 2025-07-16T03:01:47Z

Summary

This PR adds support for Eagle-3 speculative decoding with Qwen models, as a follow-up to vllm-project#20436.

The changes enable Qwen models to work with Eagle-3 draft models in speculators format.

Related Issues

Follows up on feat: Add support for speculators Eagle checkpoints vllm-project/vllm#20436 which added initial Eagle speculators support
This PR is based on the feat/speculators-eagle-support branch

Changes

Added Qwen support to Eagle-3 implementation
Proper handling of Qwen model architecture in Eagle-3 speculative decoding

Testing

Tested with the following verification script:

#\!/usr/bin/env python3
"""
Verification script for Qwen Eagle-3 support.
"""

from vllm import LLM, SamplingParams

# Initialize vLLM with Qwen model and Eagle-3 draft model
llm = LLM(
    model="Qwen/Qwen2.5-7B-Instruct",
    speculative_config={
        "method": "eagle",
        "model": "nm-testing/Qwen3-8B-Eagle3-speculators-converted",
        "num_speculative_tokens": 5
    },
    max_model_len=1024,
    enforce_eager=True
)

# Test generation
outputs = llm.generate(
    ["The future of AI is"], 
    SamplingParams(max_tokens=20, temperature=0)
)

print(f"Generated: {outputs[0].outputs[0].text}")
print("✅ Eagle-3 with Qwen works\!")

Output confirms that Eagle-3 speculative decoding works correctly with Qwen models.

github-actions · 2025-07-16T03:01:55Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gemini-code-assist

Code Review

This PR adds support for Eagle-3 speculative decoding with Qwen models. The return type hint in Qwen2Model.forward is incorrect and should be addressed.

vllm/model_executor/models/qwen3.py

vllm/model_executor/models/qwen2.py

vllm/config.py

Add: preliminary qwen support

8724f40

gemini-code-assist bot reviewed Jul 16, 2025

View reviewed changes

rahul-tuli added 2 commits July 15, 2025 23:06

backward compatibility

df0a336

Enable: vllm serve <speculators_model>

26a74b8

dsikka reviewed Jul 16, 2025

View reviewed changes

vllm/model_executor/models/qwen3.py Show resolved Hide resolved

vllm/model_executor/models/qwen2.py Show resolved Hide resolved

vllm/config.py Show resolved Hide resolved

rahul-tuli added 2 commits July 17, 2025 14:17

Remove support to disable autodetection

984ba42

Update: condition to reflect real support

a3dd2d0

rahul-tuli merged commit a3dd2d0 into feat/speculators-eagle-support Jul 17, 2025
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Eagle-3 Qwen support (follow-up to #20436) #2

Add Eagle-3 Qwen support (follow-up to #20436) #2

Uh oh!

rahul-tuli commented Jul 16, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Jul 16, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add Eagle-3 Qwen support (follow-up to #20436) #2

Add Eagle-3 Qwen support (follow-up to #20436) #2

Uh oh!

Conversation

rahul-tuli commented Jul 16, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Related Issues

Changes

Testing

Uh oh!

github-actions bot commented Jul 16, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rahul-tuli commented Jul 16, 2025 •

edited by github-actions bot

Loading