Skip to content

Commit 4aefa90

Browse files
committed
fix tf32 mma llvm intrinsic
1 parent 3bce1ab commit 4aefa90

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

src/device/intrinsics/wmma.jl

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -324,7 +324,7 @@ for ops in all_wmma_ops,
324324

325325
# Name of the LLVM intrinsic
326326
# If integer/sub-byte/bit A/B types, name is determined by A/B types
327-
if d_elem_type == "s32"
327+
if d_elem_type == "s32" || a_elem_type == "tf32"
328328
llvm_intr = "llvm.nvvm.wmma.$shape.mma.$a_layout.$b_layout.$a_elem_type"
329329
# Name of the Julia wrapper function
330330
func_name = Symbol(join(filter(!isempty, ["llvm", "wmma", "mma", a_layout, b_layout, shape, a_elem_type]), "_"))

0 commit comments

Comments
 (0)