-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][doc] Created Deployment Guide for SGLang DeepSeek-R1 FP8 and NVFP4
Community want to contribute
PRs initiated from Community
#6610
opened Aug 4, 2025 by
jamieliNVIDIA
Loading…
Update CMakeLists.txt extend find_library names
Community want to contribute
PRs initiated from Community
#6609
opened Aug 4, 2025 by
mc-nv
Loading…
feat: Enable nanobind as the default binding library
#6608
opened Aug 4, 2025 by
Linda-Stadter
•
Draft
[https://nvbugs/5433581][infra] Update install docs and CI script for SBSA deep_gemm workaround
#6607
opened Aug 4, 2025 by
chzblych
Loading…
[TRTLLM-5633][infra] Change the TOT repo to default-llm-repo
#6605
opened Aug 4, 2025 by
yiqingy0
Loading…
[TRTLLM-6864][feat] add CONTEXT_ONLY benchmark flag in disagg-server
#6593
opened Aug 4, 2025 by
reasonsolo
•
Draft
[None][fix] fix kimi k2 serving and add test for Kimi-K2
#6589
opened Aug 4, 2025 by
pengbowang-nv
Loading…
[https://nvbugs/5433581][fix] DeepGEMM installation on SBSA
#6588
opened Aug 4, 2025 by
zongfeijing
Loading…
[None][feat] Support CancelRequest for Disaggregated Serving
#6587
opened Aug 4, 2025 by
Shunkangz
Loading…
[TRTLLM-6334][feat] Add runtime swap AB for SM100 FP8 blockwise GEMM
#6586
opened Aug 4, 2025 by
Barry-Delaney
Loading…
[https://nvbugs/5409420][fix] Fix test_ptp_star_attention_example
#6584
opened Aug 4, 2025 by
Superjomn
Loading…
[https://nvbugs/5355007][fix] Set
enable_chunked_context
as True by default in trtllm bench
#6582
opened Aug 4, 2025 by
Wanli-Jiang
Loading…
doc: Add link to the examples for deploying Dynamo with TRT-LLM on K8s
#6580
opened Aug 4, 2025 by
Tabrizian
Loading…
doc: [TRTLLM-6089] Add long sequence document for Feature section
#6575
opened Aug 3, 2025 by
lfr-0531
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-07-04.