-
-
Notifications
You must be signed in to change notification settings - Fork 8.8k
Closed
Labels
bugSomething isn't workingSomething isn't working
Description
Your current environment
按照官网下载最新版的vllm pip包,daily的,0.9.2rc版本 main分支
GPU卡为 H20
🐛 Describe the bug
启动命令:
python3 -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --port 8181 --served-model-name Qwen3-Rerank --model /mnt/data/.t1/dianjin-0701/Qwen3-Reranker-0.6B --task score --hf_overrides '{"architectures": ["Qwen3ForSequenceClassification"],"classifier_from_token": ["no", "yes"],"is_original_qwen3_reranker": true}' &> start.log &
curl命令:
curl http://127.0.0.1:8181/v1/rerank \
-H 'Content-Type: application/json' \
-d '{
"model": "Qwen3-Rerank",
"query": "什么是机器学习?",
"documents": [
"机器学习是人工智能的一个分支,通过算法让计算机从数据中学习模式",
"机器学习是一种编程语言,用于开发网站",
"机器学习是数据库管理系统的一种",
"机器学习是操作系统的一种类型"
],
"top_n": 2
}'
结果:
使用bge-rerank-v2-m3版本模型进行同样测试
启动命令:
python3 -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --port 8181 --served-model-name bge-rerank --model /mnt/data/.t1/dianjin-0701/bge-reranker-v2-m3/ --task score &> start.log &
curl结果:
很明显Qwen3-rerank-0.6B模型在目前的使用中有问题,谁能帮我解决这个问题
Before submitting a new issue...
- Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.
Renpf2022
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working