tensor[tile] when tile size is 1 returns a 1D tensor, instead of a scalar #275

joydddd · 2025-07-11T01:08:49Z

Stacked PRs:

tensor[tile] when tile size is 1 returns a 1D tensor, instead of a scalar

…alar stack-info: PR: #275, branch: joydddd/stack/15

jansel

Add a test for the fix?

jansel · 2025-07-11T04:04:20Z

helion/_compiler/indexing_strategy.py

-            if fake_value.size(i) != 1:
-                stride = state.device_function.tensor_stride(fake_value, i).name
-                index_expr.append(f"{idx} * {stride}")
+            stride = state.device_function.tensor_stride(fake_value, i).name
+            index_expr.append(f"{idx} * {stride}")


What is the reason for this change?

something like this:
N = x.size(0)
for tile in hl.tile(N):
x_tile = x[tile]

When block_size=1, the if statement evaluates to be False, so the indexing ignore the N dimension, and generate
x_tile = tl.load(tile + tl.zeros([1], ...))

I'll add a test case for this.

Isn't this checking the tensor size not the block size?

jansel · 2025-07-11T04:05:04Z

helion/_compiler/tile_strategy.py

+            if block_size == 1:
+                extra_body.append(
+                    statement_from_string(
+                        f"{index_var} = {offset_var} + tl.zeros([1], {dtype})"


Doesn't this do the same thing as arange? I'd expect we wuld need shape=[] or even just offset_var directly?

Yes, this does the same thing. We don't need to make this change to fix tile indexing when block_size=1.
However, why does grid_codegen handle block_size == 1 differently with tl.zeros instead tl.arange?

jansel · 2025-07-12T01:21:51Z

The broadcasting behavior for size==1 tensors is intentional. We match numpy/pytorch broadcasting rules: https://numpy.org/devdocs/user/basics.broadcasting.html

I added some more tests for this here, which we should make sure this doesn't break: #285

joydddd force-pushed the joydddd/stack/15 branch from 41f371d to bc6ea11 Compare July 11, 2025 01:08

This was referenced Jul 11, 2025

Add indirect pointer to barrier support in hl.signal & hl.wait (as_ptrs) #261

Open

One shot all reduce & symm mem sync #245

Draft

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jul 11, 2025

tensor[tile] when tile size is 1 returns a 1D tensor, instead of a sc…

5348847

…alar stack-info: PR: #275, branch: joydddd/stack/15

joydddd force-pushed the joydddd/stack/15 branch from bc6ea11 to 5348847 Compare July 11, 2025 01:16

joydddd requested review from jansel, drisspg and yf225 July 11, 2025 02:09

jansel requested changes Jul 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

tensor[tile] when tile size is 1 returns a 1D tensor, instead of a scalar #275

tensor[tile] when tile size is 1 returns a 1D tensor, instead of a scalar #275

joydddd commented Jul 11, 2025 •

edited

Loading

Uh oh!

jansel left a comment

Uh oh!

jansel Jul 11, 2025

Uh oh!

joydddd Jul 11, 2025

Uh oh!

jansel Jul 12, 2025

Uh oh!

jansel Jul 11, 2025

Uh oh!

joydddd Jul 11, 2025

Uh oh!

jansel commented Jul 12, 2025

Uh oh!

Uh oh!

tensor[tile] when tile size is 1 returns a 1D tensor, instead of a scalar #275

Are you sure you want to change the base?

tensor[tile] when tile size is 1 returns a 1D tensor, instead of a scalar #275

Conversation

joydddd commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!