Skip to content

[Bug]: Qwen3 Rerank 模型的准确率存在问题 #20478

@MrBlue-1996

Description

@MrBlue-1996

Your current environment

按照官网下载最新版的vllm pip包,daily的,0.9.2rc版本 main分支 GPU卡为 H20

🐛 Describe the bug

启动命令: python3 -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --port 8181 --served-model-name Qwen3-Rerank --model /mnt/data/.t1/dianjin-0701/Qwen3-Reranker-0.6B --task score --hf_overrides '{"architectures": ["Qwen3ForSequenceClassification"],"classifier_from_token": ["no", "yes"],"is_original_qwen3_reranker": true}' &> start.log & curl命令: curl http://127.0.0.1:8181/v1/rerank \ -H 'Content-Type: application/json' \ -d '{ "model": "Qwen3-Rerank", "query": "什么是机器学习?", "documents": [ "机器学习是人工智能的一个分支,通过算法让计算机从数据中学习模式", "机器学习是一种编程语言,用于开发网站", "机器学习是数据库管理系统的一种", "机器学习是操作系统的一种类型" ], "top_n": 2 }' 结果:

Image
使用bge-rerank-v2-m3版本模型进行同样测试
启动命令:
python3 -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --port 8181 --served-model-name bge-rerank --model /mnt/data/.t1/dianjin-0701/bge-reranker-v2-m3/ --task score &> start.log &
curl结果:

Image

很明显Qwen3-rerank-0.6B模型在目前的使用中有问题,谁能帮我解决这个问题

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions