Skip to content

Add small block shapes to warpspec matmul configs #299

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

njriasan
Copy link
Contributor

Summary: Many of the shapes performed much worse, which may be due to the tutorial assuming large configs. This adds smaller configs as an option so we can test warp spec vs just the impact of blackwell configs.

Reviewed By: PaulZhang12

Differential Revision: D78059342

njriasan added 2 commits July 22, 2025 12:19
Summary:

Fixes a couple tutorial assumptions, most notably:

1. It only worked with fp8 and fp16. Now it works with all dtypes for OmniFm.
2. The shapes were not compatible due to layout mismatches. Since every shape will have a layout mismatch this adds an explicit tranpose to allow benchmarking a "best case" although this may not be accurate.
3. Some shapes will never be compatible with TMA as the strides are not divisble by 16. I added an explicit check in the code to simplify this issue, but I will be skipping these.

Differential Revision: D77950060
Summary: Many of the shapes performed much worse, which may be due to the tutorial assuming large configs. This adds smaller configs as an option so we can test warp spec vs just the impact of blackwell configs.

Reviewed By: PaulZhang12

Differential Revision: D78059342
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D78059342

facebook-github-bot pushed a commit that referenced this pull request Jul 22, 2025
Summary:

Many of the shapes performed much worse, which may be due to the tutorial assuming large configs. This adds smaller configs as an option so we can test warp spec vs just the impact of blackwell configs.

Reviewed By: PaulZhang12

Differential Revision: D78059342
njriasan added a commit that referenced this pull request Jul 22, 2025
Summary:

Many of the shapes performed much worse, which may be due to the tutorial assuming large configs. This adds smaller configs as an option so we can test warp spec vs just the impact of blackwell configs.

Reviewed By: PaulZhang12

Differential Revision: D78059342
njriasan added a commit that referenced this pull request Jul 22, 2025
Summary:

Many of the shapes performed much worse, which may be due to the tutorial assuming large configs. This adds smaller configs as an option so we can test warp spec vs just the impact of blackwell configs.

Reviewed By: PaulZhang12

Differential Revision: D78059342
@njriasan njriasan requested a review from PaulZhang12 July 22, 2025 22:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants