Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add LLGuidance Support for PyTorch Backend
#5214 opened Jun 14, 2025 by jellysnack Loading…
Add MTP support for Online EPLB
#5213 opened Jun 14, 2025 by dongxuy04 Draft
Overlap: Skip last iter on length
#5211 opened Jun 13, 2025 by IzzyPutterman Loading…
[fix] Fix Llama4 min-latency import error
#5209 opened Jun 13, 2025 by nv-yilinf Loading…
[feat] Add EAGLE3 support for Qwen3
#5206 opened Jun 13, 2025 by nv-yilinf Loading…
feat: Enable EPLB to existing MoE models
#5203 opened Jun 13, 2025 by syuoni Loading…
[fix][test] Speedup Nemotron NAS unittests
#5202 opened Jun 13, 2025 by omera-nv Loading…
Test
#5199 opened Jun 13, 2025 by ZhanruiSunCh Draft
Merge current waive list with the ToT waive list
#5198 opened Jun 13, 2025 by yiqingy0 Loading…
tests: add ds r1 tp4 test
#5197 opened Jun 13, 2025 by xinhe-nv Draft
tests: add multi nodes tests
#5196 opened Jun 13, 2025 by xinhe-nv Draft
test: add deepseek rcca cases
#5195 opened Jun 13, 2025 by ruodil Loading…
refactor: dummy request creation
#5192 opened Jun 13, 2025 by lfr-0531 Loading…
[chore] Linking fixes to NVRTC wrapper Community want to contribute PRs initiated from Community
#5189 opened Jun 13, 2025 by AlessioNetti Loading…
test: add llama4 models for perf test
#5187 opened Jun 13, 2025 by ruodil Loading…
add dgx b200 8gpu test case in post merge
#5185 opened Jun 13, 2025 by yuanjingx87 Loading…
feat: MoE trtllm backend kernel update
#5183 opened Jun 13, 2025 by rosenrodt Loading…
ProTip! no:milestone will show everything without a milestone.