-
Notifications
You must be signed in to change notification settings - Fork 151
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[perf][WIP]: using NZ optimization for quantized GMM
module:quantization
#906
opened May 20, 2025 by
linfeng-yuan
Loading…
[Bugfix] Fix deepseek V0 percision issue and add acc ci for it
module:ops
module:tests
#905
opened May 20, 2025 by
MengqingCao
Loading…
[BugFix] Fix accuracy bugs for unquantized deepseekv3 models
module:ops
module:quantization
#897
opened May 19, 2025 by
Angazenn
Loading…
[V1][LoRA][Test] V1 Engine LoRA support & e2e test
module:tests
ready
read for review
#893
opened May 19, 2025 by
paulyu12
Loading…
[1/N][UT][v1 MTP] add basic v1 mtp features
module:tests
#890
opened May 17, 2025 by
XWFAlone
Loading…
[CI/UT][PD Disaggreate] Initialize PD Disaggreate UT
module:pd
PD disaggregation related
module:tests
#889
opened May 17, 2025 by
MengqingCao
Loading…
Fix the device error when using ray as vllm-acend backend
module:core
module:ops
module:tests
#884
opened May 16, 2025 by
zhuo97
Loading…
Revert the modifications of cache engine for npu graph mode
#875
opened May 15, 2025 by
linfeng-yuan
Loading…
perf: set weights of mla_v1 contiguous to avoid doing it while running
#869
opened May 15, 2025 by
NeverRaR
Loading…
[BugFix][WIP] Fix accuray problems with deepseek in situation of ep=1, etp>1
module:ops
module:quantization
#863
opened May 14, 2025 by
whx-sjtu
Loading…
[WIP] Fixed: v0 style Scheduler broken issue in vllm main branch.
#862
opened May 14, 2025 by
gawainx
Loading…
[Performance] Add EPLB expert map import capabilities
module:ops
#860
opened May 14, 2025 by
songshanhu07
Loading…
[Feature] Impl v1 disaggregated prefill in ascend scheduler
#852
opened May 14, 2025 by
jianzs
Loading…
[BugFix] Fix chunked prefill bugs in engine v1
module:core
#849
opened May 14, 2025 by
rjg-lyh
Loading…
[BugFix] Fix chunked prefill bugs in engine v1
module:core
#844
opened May 14, 2025 by
rjg-lyh
Loading…
[aclgraph] implentment NPUPiecewiseBackend to enable aclgraph
module:core
#836
opened May 13, 2025 by
MengqingCao
•
Draft
Previous Next
ProTip!
Updated in the last three days: updated:>2025-05-17.