Skip to content

Commit 467a2bb

Browse files
June release changelog (#451)
This PR updates the changelog to reflect the changes and new features included in the June release --------- Co-authored-by: Mehdi Goli <mehdi.goli@codeplay.com>
1 parent 3da91e1 commit 467a2bb

File tree

1 file changed

+16
-0
lines changed

1 file changed

+16
-0
lines changed

CHANGELOG-SYCL.md

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,21 @@
11
# SYCL CUTLASS Changelog
22

3+
## [Cutlass 3.9.2 SYCL backend Version 0.3](https://github.com/codeplay/cutlass-fork/releases/tag/v3.9.2-0.3) (2025-06-30)
4+
- Add support for GEMM FP8 (E5M2 and E4M3)
5+
- Add example for GEMM FP8 with support for channel-wise and group-wise quantization
6+
- Add support for Grouped GEMM FP8
7+
- Improve performance for FP8 to FP16 conversion
8+
- Add support for epilogue data conversion
9+
- Add support for FP16 GEMM with FP16 accumulator
10+
- Add support for BF16 GEMM with BF16 accumulator
11+
- Add support for mixed dtype GEMM with support for tensor-wise, channel-wise and group-wise quantization
12+
- Add example of mixed dtype BF16 + INT8 using channel-wise and group-wise quantization
13+
- Add example of mixed dtype FP16 + INT8 using tensor-wise quantization
14+
- Add example of mixed dtype FP16 + INT4 using channel-wise and group-wise quantization
15+
- Add support for zero-point quantization in INT4 and INT8 data types
16+
- Add support for Flash Attention prefill FP8 with and without KV cache
17+
- Add support for Flash Attention decode FP8 with and without KV cache
18+
319
## [Cutlass 3.9.2 SYCL backend Version 0.2](https://github.com/codeplay/cutlass-fork/releases/tag/v3.9.2-0.2) (2025-05-30)
420
- GEMM/StreamK/SplitK with support for FP16 data type
521
- Flash attention prefill with Paged KV cache with support for FP16 data type

0 commit comments

Comments
 (0)