Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP]: Masked layout fp4 gemm using cute-dsl
#1331 opened Jul 25, 2025 by yzh119 Draft
5 tasks
refactor: Improved metainfo for trtllm-gen kernels
#1328 opened Jul 25, 2025 by cyx-6 Loading…
5 tasks
Add moe benchmark routine
#1327 opened Jul 25, 2025 by aleozlx Draft
3 of 5 tasks
Add k_scale and v_scale to persistent attention
#1322 opened Jul 24, 2025 by Edenzzzz Loading…
5 tasks
Add blockwise-scaled FP8 GEMM via TRTLLM-Gen.
#1320 opened Jul 24, 2025 by sergachev Loading…
5 tasks done
feat: support output nvfp4 in trtllm-gen function call.
#1318 opened Jul 24, 2025 by weireweire Loading…
2 of 5 tasks
Allow cudnn prefill kernels to be called natively
#1317 opened Jul 24, 2025 by Anerudhan Draft
5 tasks done
minor: add trtllm_gen_mla benchmark
#1316 opened Jul 24, 2025 by yyihuang Loading…
5 tasks done
Wrap cudnn backend to unified interface
#1312 opened Jul 23, 2025 by cyx-6 Loading…
5 tasks
Refactor Fused Moe Module
#1309 opened Jul 23, 2025 by wenscarl Loading…
5 tasks
Api regression test for trtllmgen fp8 moe
#1308 opened Jul 23, 2025 by aleozlx Loading…
5 tasks done
fix: a workaround to make fp8 kv-cache work for prefill
#1304 opened Jul 22, 2025 by chenyang78 Loading…
2 tasks
3rparty: upgrade cutlass dependency to v4.1.0
#1299 opened Jul 22, 2025 by yzh119 Loading…
5 tasks
add mm_fp4 use cutlass backend for large bs
#1296 opened Jul 21, 2025 by ttyio Loading…
5 tasks done
Add native cudnn_decode for improved cudnn decode performance
#1283 opened Jul 18, 2025 by Anerudhan Loading…
5 tasks done
ci: add github actions to upload sdist to pypi
#1270 opened Jul 16, 2025 by yzh119 Loading…
5 tasks
Bug fix: fix duplicate launch in POD
#1267 opened Jul 16, 2025 by Edenzzzz Loading…
5 tasks
feat(aot): add nvshmem module for aot compilation
#1261 opened Jul 15, 2025 by EmilienM Loading…
3 of 5 tasks
refactor: separate SM100 and legacy TRT-LLM comm modules
#1259 opened Jul 15, 2025 by EmilienM Loading…
3 of 5 tasks
ProTip! Filter pull requests by the default branch with base:main.