You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Typo and small fixes to CK fp8 rowwise grouped (pytorch#4550)
Summary:
Pull Request resolved: pytorch#4550
X-link: facebookresearch/FBGEMM#1593
- Typos+grammar fix found by LLM
- Validation bug found by LLM
- Small logic simplification I missed in my prior PR.
Differential Revision: D78827450
Copy file name to clipboardExpand all lines: fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/fp8_rowwise_grouped_gemm.hip
+4-4Lines changed: 4 additions & 4 deletions
Original file line number
Diff line number
Diff line change
@@ -109,7 +109,7 @@ void set_static_kernel_args(
109
109
int64_t output_offset = 0;
110
110
// When group count is large, we can more efficiently initialize
111
111
// by doing host setup and a memcpy. This is only viable if cuda
Copy file name to clipboardExpand all lines: fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/kernels/fp8_rowwise_grouped_common.h
0 commit comments