You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
For MoE inference, mixed type group-gemm is helpful. But now cutlass seems only support mixed type matmul, and group-gemm of non-mixed type.
Describe the solution you'd like
group-gemm of fp16_int4, fp16_int8, e4m3_int4