-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Closed
Labels
Description
What is your question?
The following code (which is essentially an empty kernel) will trigger DSLCudaRuntimeError: Unknown CUDA error.
import cutlass
import cutlass.cute as cute
@cute.kernel
def kernel():
layout = cute.make_layout((16, 16))
smem = cutlass.utils.SmemAllocator()
smem.allocate_tensor(element_type=cute.Float32, layout=layout)
@cute.jit
def device():
kernel().launch(grid=(1,1,1), block=(1,1,1))
device()The above code will lead to
DSLCudaRuntimeError: DSLCudaRuntimeError: Unknown CUDA error
Error Code: 201
🔍 Additional Context:
- Error name: CUDA_ERROR_INVALID_CONTEXT
- CUDA_TOOLKIT_PATH: not set
- Target SM ARCH: not set
📊 GPU Information:
- CUDA devices available: 8 (current: 0)
- Architecture: Ampere (sm_80)
- Compatible SM archs: sm_80
Compatibility Check:
❌ Error: Target SM ARCH unknown is not compatible
💡 Please use one of SM ARCHs: sm_80
Interestingly, if I did ran another working kernel, the same code will work again.