Skip to content

Conversation

jiejanezhang
Copy link
Contributor

@jiejanezhang jiejanezhang commented Jul 31, 2025

mma.sync.aligned.m16n8k16.row.col.f16.f16.f16.f16.
LIT test and E2E test (SYCLomatic-test:#935) can pass.

Support mma.sync.aligned.m16n8k16.row.col.f16.f16.f16.f16
more enhancement
@jiejanezhang jiejanezhang requested a review from a team as a code owner July 31, 2025 06:22
@jiejanezhang jiejanezhang changed the title Support m16n8k16.row.col.f16.f16.f16.f16 Support mma.sync.aligned.m16n8k16.row.col.f16.f16.f16.f16 Jul 31, 2025
Copy link
Contributor

@tomflinda tomflinda left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Comment on lines 2727 to 2735
// for (int j = 0; j < 4; j++) {
// *d[0] +=
// static_cast<CDType>(ra[j]) * static_cast<CDType>(rb[j]);
// *d[1] += static_cast<CDType>(ra[j]) *
// static_cast<CDType>(rb[j + 4]);
// *d[2] += static_cast<CDType>(ra[j + 4]) *
// static_cast<CDType>(rb[j]);
// *d[3] += static_cast<CDType>(ra[j + 4]) *
// static_cast<CDType>(rb[j + 4]);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we remove those line?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed.

Comment on lines 2722 to 2726
// fragments and adds it to the corresponding D matrix fragment d0
// += row0{ a0, a1, a2, a3 } * col0{ b0, b1, b2, b3 } d1 += row0{
// a0, a1, a2, a3 } * col1{ b0, b1, b2, b3 } d2 += row1{ a0, a1,
// a2, a3 } * col0{ b0, b1, b2, b3 } d3 += row1{ a0, a1, a2, a3 } *
// col1{ b0, b1, b2, b3 }
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we format those lines? like line 2662

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed.

Minor fixings.
Fix LIT test and refine the comments
@jiejanezhang jiejanezhang merged commit 3fe420f into oneapi-src:SYCLomatic Aug 26, 2025
2 of 6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants