Skip to content

Commit 3be8d31

Browse files
authored
[Kernel][Bugfix] Fixup some warnings in nvfp4_blockwise_moe when CUDA < 12.8 (#20324)
Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com>
1 parent 3abfe22 commit 3be8d31

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

csrc/quantization/fp4/nvfp4_blockwise_moe_kernel.cu

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -335,8 +335,10 @@ void run_fp4_blockwise_scaled_group_mm(
335335
TORCH_CHECK(status == cutlass::Status::kSuccess, "Failed to run GEMM");
336336
}
337337

338+
#if defined ENABLE_NVFP4 && ENABLE_NVFP4
338339
constexpr auto FLOAT4_E2M1X2 = at::ScalarType::Byte;
339340
constexpr auto SF_DTYPE = at::ScalarType::Float8_e4m3fn;
341+
#endif
340342

341343
#define CHECK_TYPE(x, st, m) \
342344
TORCH_CHECK(x.scalar_type() == st, ": Inconsistency of Tensor type:", m)

0 commit comments

Comments
 (0)