You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
Currently, converting from tf32 to f32 with round to nearest dispatches to a PTX cvt instruction only for sm90.
Describe the solution you'd like
If we allow rna rounding, we can dispatch to cvt.rna.tf32.f32, which works for sm80.
Describe alternatives you've considered
N/A
Additional context
A simple code sample is given below:
Is your feature request related to a problem? Please describe.
Currently, converting from tf32 to f32 with round to nearest dispatches to a PTX
cvt
instruction only for sm90.Describe the solution you'd like
If we allow
rna
rounding, we can dispatch tocvt.rna.tf32.f32
, which works for sm80.Describe alternatives you've considered
N/A
Additional context
A simple code sample is given below:
The text was updated successfully, but these errors were encountered: