Skip to content

[QST] How to use Grouped_gemm when MNK is not a regular shape #2263

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
chenhongyu2048 opened this issue Apr 25, 2025 · 3 comments
Closed

[QST] How to use Grouped_gemm when MNK is not a regular shape #2263

chenhongyu2048 opened this issue Apr 25, 2025 · 3 comments

Comments

@chenhongyu2048
Copy link

What is your question?

Hi all, I'm having some issues with the 57_cutlass_grouped_gemm.
I need to do some gemm calculations where there may be a matrix form like 1024xNx1024, but N may not be a multiple of 8.
I believe this is common, but how should I modify my code to accommodate this kind of problem?

@chenhongyu2048
Copy link
Author

Supplement:
I tried tweaking alignment, but it went against the needs of the TMA.

@thakkarV
Copy link
Collaborator

You'll have to either pad your allocations to be 16B aligned or use the cpasync based grouped GEMM kernel instead

@chenhongyu2048
Copy link
Author

Thanks, I'll try it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants