Check the scan op input for requires_grad #9083

haifeng-jin · 2025-05-02T20:33:24Z

Resolves #8783

Added a test for LongTensor inputs, which would fail without this PR.
Only set carry.requires_grad to True when the dtype is floating point.

Return None, None in the backward() function if none of the outputs has gradients.

Questions:

Why don't we just set carry.requires_grad the same as the user input init? (I did run into some errors when I tried that)
Do we also need to return None for init.grad in backward(), when only x requires grad?
It fails when init is LongTensor and x is float32 because it requires carry.dtype == init.dtype, while carry is float because it is produced by int + float. Do we need to support this case?

tengyifei

Thanks for helping to improve scan!

Chatted offline, couple of needed changes:

Need to test and support a case where the inputs are pytrees, and one of the (many) init leaf nodes is a long tensor.

We also talked about a potential need to test and support a case where the init pytrees is a type of (long tensor, float tensor). For example, the long tensor could be used to index into the float tensor, producing an output. Then (long tensor + 1, float tensor) is used as the next carry. It's advisable to check what's the behavior of the Jax scan (both forward and backward) in this case and match that.

Topologized · 2025-06-20T07:15:55Z

FYI, although I know it's somewhat unlikely that this will actually be used, complex tensors can have gradients as well, and they're not supported by this commit.

haifeng-jin added 6 commits May 2, 2025 20:31

add a long tensor test

8fcd0a9

update

d843a90

2 failed tests

bbc7248

1 failed test

75770f2

only requires grad for floating point

357bba2

fix formatting issue

7140d5b

haifeng-jin marked this pull request as ready for review May 6, 2025 21:44

haifeng-jin requested a review from tengyifei May 6, 2025 21:44

miladm requested a review from bhavya01 May 7, 2025 19:38

miladm assigned haifeng-jin May 7, 2025

tengyifei requested changes May 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Check the scan op input for requires_grad #9083

Check the scan op input for requires_grad #9083

Uh oh!

haifeng-jin commented May 2, 2025 •

edited

Loading

Uh oh!

tengyifei left a comment

Uh oh!

Topologized commented Jun 20, 2025

Uh oh!

Uh oh!

Check the scan op input for requires_grad #9083

Are you sure you want to change the base?

Check the scan op input for requires_grad #9083

Uh oh!

Conversation

haifeng-jin commented May 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tengyifei left a comment

Choose a reason for hiding this comment

Uh oh!

Topologized commented Jun 20, 2025

Uh oh!

Uh oh!

haifeng-jin commented May 2, 2025 •

edited

Loading