-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Issues: NVIDIA/cutlass
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[FEA] cute hopper conv example
CuTe
CuTe Functionality
feature request
New feature or request
help wanted
Interested in support from contributors
inactive-30d
inactive-90d
[BUG] Implicitly generate unexpected LDGSTS instructions for A100
bug
Something isn't working
inactive-30d
inactive-90d
#1231
opened Dec 4, 2023 by
cctry
updated Apr 3, 2024
[QST] How does local_tile work? Could you provide an detailed explanation?
CuTe
CuTe Functionality
inactive-30d
inactive-90d
question
Question
#1243
opened Dec 6, 2023 by
ziyuhuang123
updated May 22, 2024
[QST] thread num assert in sm70_epilogue_vectorized
? - Needs Triage
inactive-30d
inactive-90d
question
Question
#1334
opened Feb 5, 2024 by
mammoth831
updated Jun 6, 2024
[BUG] Illegal CUDA shared memory access in SM90 GEMM TMA Warpspecialized at ClusterBarrier::init
? - Needs Triage
bug
Something isn't working
inactive-30d
inactive-90d
#1247
opened Dec 6, 2023 by
kadeng
updated Jun 21, 2024
[QST] Epilogue Swizzle
inactive-30d
inactive-90d
question
Question
#1383
opened Mar 6, 2024 by
jeromeku
updated Jul 6, 2024
[QST] How to Use Gemv in a Cuda Kernel
inactive-30d
inactive-90d
question
Question
#1401
opened Mar 14, 2024 by
StarrickLiu
updated Jul 23, 2024
[QST] Is there any fp16xfp16 GEMM sample using CUTE with a performance comparable to cublas?
feature request
New feature or request
good first issue
Good first issue for contributors
help wanted
Interested in support from contributors
question
Question
#1686
opened Aug 6, 2024 by
xiaonans
updated Aug 6, 2024
[BUG] Disordered header files: detail::is_prefetch used before declaration.
bug
Something isn't working
inactive-30d
inactive-90d
#1484
opened Apr 15, 2024 by
rchardx
updated Aug 14, 2024
[QST] How to use swizzle to avoid bank conflict?
? - Needs Triage
inactive-30d
inactive-90d
question
Question
#1234
opened Dec 5, 2023 by
ziyuhuang123
updated Aug 16, 2024
[QST] How local_partition works?
? - Needs Triage
CuTe
CuTe Functionality
inactive-30d
inactive-90d
question
Question
#1244
opened Dec 6, 2023 by
ziyuhuang123
updated Aug 16, 2024
[QST] 2D Convolution for NCHW Row-Major images, kernels and output
? - Needs Triage
inactive-30d
inactive-90d
question
Question
#1496
opened Apr 19, 2024 by
chart21
updated Aug 17, 2024
[QST]iterator store interface[store(Fragment &frag,TensorCoord const &tile_offset)] not offset in units of whole tiles
inactive-30d
inactive-90d
question
Question
#1314
opened Jan 20, 2024 by
zhiyu-deep
updated Aug 20, 2024
[QST] make error caused by glibc 2.32 in ubuntu 18.04 "//usr/local/lib/libpthread.so.0: undefined reference to"
? - Needs Triage
inactive-30d
inactive-90d
question
Question
#1490
opened Apr 17, 2024 by
Zmy6
updated Aug 22, 2024
[QST] How to do concurrent GEMMs ?
inactive-30d
inactive-90d
question
Question
#1418
opened Mar 23, 2024 by
jomivaan
updated Aug 23, 2024
[BUG] Broken copy.hpp
bug
Something isn't working
inactive-30d
inactive-90d
#1508
opened Apr 28, 2024 by
kroburg
updated Aug 27, 2024
[QST] StreamK ReductionStrategy: "Atomic" or "Mixed"
inactive-30d
inactive-90d
question
Question
#1488
opened Apr 16, 2024 by
HanGuo97
updated Aug 27, 2024
[BUG] Python Something isn't working
inactive-30d
inactive-90d
EVT
Pytorch
Emitter Broken
bug
#1462
opened Apr 8, 2024 by
jeromeku
updated Aug 27, 2024
[QST] use FastLinearCombinationClamp to convert half accumulator to int8_t output?
? - Needs Triage
inactive-30d
inactive-90d
question
Question
#1516
opened Apr 30, 2024 by
hychiang-git
updated Aug 28, 2024
[QST] Epilogue Reduction
? - Needs Triage
inactive-30d
inactive-90d
question
Question
#1518
opened Apr 30, 2024 by
jeromeku
updated Aug 28, 2024
[QST] Question
cutlass::Array
and cute::Tensor
--- using CUTLASS utility structs/classes with CUTE (such as NumericArrayConverter
)
? - Needs Triage
inactive-30d
inactive-90d
question
#1532
opened May 10, 2024 by
HanGuo97
updated Sep 7, 2024
[QST] Epilogue Broadcast: Question
Adapter
vs GemmUniversal
inactive-30d
question
#1459
opened Apr 7, 2024 by
jeromeku
updated Sep 20, 2024
[QST] Hopper mixed precision gemm always worse than FP8
? - Needs Triage
inactive-30d
inactive-90d
question
Question
#1549
opened May 24, 2024 by
divchenko
updated Sep 23, 2024
Previous Next
ProTip!
Updated in the last three days: updated:>2025-05-07.