Change the repository type filter
All
Repositories list
24 repositories
LeetCUDA
Public📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners, 200+ CUDA & Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥Awesome-LLM-Inference
Public📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, Parallelism, MLA, etc.Awesome-DiT-Inference
Publictorchlm
Public💎A high level pipeline for face landmarks detection: train, eval, inference (Python/C++) and 100+ data augmentations.lite.ai.toolkit
PublicSpargeAttn
PublicSageAttention
Publiclihang-notes
Public📚《统计学习方法-李航: 笔记-从原理到实现》 这是一份非常详细的学习笔记,200页PDF,各种手推公式细节讲解以及R语言实现. 🎉.github
PublicHGEMM
Publicffpa-attn
Public📚FFPA(Split-D): Extend FlashAttention with Split-D for large headdim, O(1) GPU SRAM complexity, 1.8x~3x↑🎉 faster than SDPA EA.xlite-cli
Publicflashinfer
Publictutorial-template
Public templateRVM-Inference
Publicnetron-vscode-extension
Publicyolov5face-toolkit
PublicYOLO5Face 2021 with MNN/NCNN/TNN/ONNXRuntimessrnet-toolkit
Publicfsanet-toolkit
Publicmgmatting-toolkit
Publicscrfd-toolkit
Publicnanodet-toolkit
Publicyolox-toolkit
Publicyolop-toolkit
Public