-
Couldn't load subscription status.
- Fork 109
Fix the behavior of extent(r) for r > rank in TeamGEMM #2667
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Conversation
b49ade1 to
3ee86a4
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't know the precedence of || and ternary offhand, could you please insert parens (even if not technically necessary)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd rather see this conditional nesting inverted with a constexpr check on the rank, but I won't block on it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we add a small unit-test that exhibit the current issue and shows that the proposed PR fixes it?
Sure |
If this is related, it means that there is a GEMM operation on rank 0 view. |
|
I'm building the Ifpack2 tests to check the impact of this PR, thanks for tracking this down @yasahi-hpc |
|
Unfortunately I still see the failures in |
3ee86a4 to
285d5ef
Compare
Thanks a lot for testing and reporting. |
Signed-off-by: Yuuichi Asahi <y.asahi@nr.titech.ac.jp>
Signed-off-by: Yuuichi Asahi <y.asahi@nr.titech.ac.jp>
Signed-off-by: Yuuichi Asahi <y.asahi@nr.titech.ac.jp>
285d5ef to
9f1b00a
Compare
|
Hi @yasahi-hpc , I retested with 9f1b00a in a Cuda build but I am still seeing failures with the |
Thank you for testing. I will investigate on my side. |
May partially resolve #2622
After looking at the bahaviors of older versions, I found that
extent(r)for r > rank should return1This PR fixes the behavior.