-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix: Double build time limit since #5027 halfs
NUM_JOBS
#5212
opened Jun 14, 2025 by
yuantailing
Loading…
[TRTLLM-5770] feat: Integrate TRT-LLM Gen FP8 block scale MoE with Pytorch workflow kernel autotuner
#5207
opened Jun 13, 2025 by
DomBrown
Loading…
[draft][fix] rewrite completion API to avoid repetitive tokens
#5201
opened Jun 13, 2025 by
LinPoly
Loading…
[chore] Linking fixes to NVRTC wrapper
Community want to contribute
PRs initiated from Community
#5189
opened Jun 13, 2025 by
AlessioNetti
Loading…
[TRTLLM-5653][infra] Run docs build only if PR contains only doc changes
#5184
opened Jun 13, 2025 by
zhanga5
Loading…
Add debug hook to support dump tensor data and add new debug functions easily
#5182
opened Jun 13, 2025 by
HuiGao-NV
Loading…
Removed <think> on head of reasoning_content for DeepSeek-R1 model
#5181
opened Jun 13, 2025 by
k-l-lambda
Loading…
test: Add json_mode_eval for guided decoding evaluation
#5179
opened Jun 13, 2025 by
syuoni
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.