[Model] Re-add the implicit conversion feature for as_seq_cls_model #21103

noooop · 2025-07-17T08:14:55Z

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Purpose

Fix the issue reported in Support for LlamaForSequenceClassification #20807 where [Model][2/N] Automatic conversion of CrossEncoding model #19978 makes ForSequenceClassification using TRANSFORMERS Impl unusable.

Because: ForSequenceClassification using TRANSFORMERS Impl It is implemented using TransformersForCausalLM + as_classification_model, instead of directly using TransformersForSequenceClassification

That is, implicit conversion is essential.

We should implicit conversion ForSequenceClassification models, instead of adding them one by one to registry.py
hitchhike
- is_matryoshka
  
  matryoshka_dimensions is [] or None, is_matryoshka should be false.
- get_and_verify_max_len
  
  Consider max_model_len in tokenizer_config only when pooling models and using absolute position_embedding.
- fix load_weights_no_post_processing tp weight loader

cc @DarkLight1337 @maxdebayser

Test Plan

pytest -s -vvv tests/models/test_transformers.py::test_classify
pytest -s -vvv tests/models/test_initialization.py::test_implicit_converted_models

Test Result

passed

(Optional) Documentation Update

Known Issues

ForSequenceClassification using TRANSFORMERS Impl It is implemented using TransformersForCausalLM + as_classification_model, instead of directly using TransformersForSequenceClassification

as_classification_model only supports gpt style score head, and does not support bert style classifier head, which makes vllm unable to support models like DebertaV2ForSequenceClassification

Signed-off-by: wang.yuqi <noooop@126.com>

gemini-code-assist

Code Review

This pull request re-introduces implicit model conversion for sequence classification models, which is a great simplification. The changes involve refactoring the model registry and configuration logic to automatically handle these conversions. My review found a critical issue in the model registry where a cached object is being mutated, which could lead to incorrect behavior for other models. I've provided a detailed explanation and a suggested fix for this issue.

vllm/model_executor/models/registry.py

Signed-off-by: wang.yuqi <noooop@126.com>

github-actions · 2025-07-17T08:58:12Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: wang.yuqi <noooop@126.com>

hmellor

This is a super cool PR! Thanks for enabling this for the Transformers backend!

Would it be better if the Transformes backend directly supported ForSequenceClassification instead of the substitution you're doing in vllm/model_executor/models/registry.py?

noooop · 2025-07-17T11:30:08Z

This is a super cool PR! Thanks for enabling this for the Transformers backend!

Would it be better if the Transformes backend directly supported ForSequenceClassification instead of the substitution you're doing in vllm/model_executor/models/registry.py?

The Transformers backend directly supporting ForSequenceClassification is definitely better,
allowing users to use models like DebertaV2ForSequenceClassification that vllm as_classification_model does not support.

I am not familiar with the Transformers backend and don't know what difficulties might arise.

hmellor · 2025-07-17T13:52:45Z

Once #20543 is merged I can look at adding explicit support for ForSequenceClassification models to the Transformers backend.

To clarify, is this PR specifically about enabling the Transformers backend or is it useful for other reasons too?

noooop · 2025-07-17T15:07:02Z

This PR should wait for #21058 landing.

cc @DarkLight1337 @maxdebayser

noooop added 2 commits July 17, 2025 15:55

+ implicit conversion

9a6125d

Signed-off-by: wang.yuqi <noooop@126.com>

+ test_implicit_converted_models

043cdfe

Signed-off-by: wang.yuqi <noooop@126.com>

noooop requested review from DarkLight1337, ywang96, simon-mo, WoosukKwon, youkaichao, robertgshaw2-redhat, mgoin, tlrmchlsmth, houseroad and hmellor as code owners July 17, 2025 08:14

mergify bot added llama Related to Llama models new-model Requests to new models qwen Related to Qwen models labels Jul 17, 2025

gemini-code-assist bot reviewed Jul 17, 2025

View reviewed changes

vllm/model_executor/models/registry.py Outdated Show resolved Hide resolved

noooop added 2 commits July 17, 2025 16:25

fix

14f6490

Signed-off-by: wang.yuqi <noooop@126.com>

- gpt2

04f9256

Signed-off-by: wang.yuqi <noooop@126.com>

+ _SEQUENCE_CLASSIFICATION_EXAMPLE_MODELS

63166f2

Signed-off-by: wang.yuqi <noooop@126.com>

noooop mentioned this pull request Jul 17, 2025

[Model] Auto retrieve default_pooling_type #20930

Draft

4 tasks

hmellor reviewed Jul 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Model] Re-add the implicit conversion feature for as_seq_cls_model #21103

[Model] Re-add the implicit conversion feature for as_seq_cls_model #21103

noooop commented Jul 17, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

github-actions bot commented Jul 17, 2025

Uh oh!

hmellor left a comment •

edited

Loading

Uh oh!

noooop commented Jul 17, 2025 •

edited

Loading

Uh oh!

hmellor commented Jul 17, 2025

Uh oh!

noooop commented Jul 17, 2025

Uh oh!

Uh oh!

Uh oh!

[Model] Re-add the implicit conversion feature for as_seq_cls_model #21103

Are you sure you want to change the base?

[Model] Re-add the implicit conversion feature for as_seq_cls_model #21103

Conversation

noooop commented Jul 17, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Essential Elements of an Effective PR Description Checklist

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Known Issues

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

github-actions bot commented Jul 17, 2025

Uh oh!

hmellor left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

noooop commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hmellor commented Jul 17, 2025

Uh oh!

noooop commented Jul 17, 2025

Uh oh!

Uh oh!

noooop commented Jul 17, 2025 •

edited by github-actions bot

Loading

hmellor left a comment •

edited

Loading

noooop commented Jul 17, 2025 •

edited

Loading