You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Apr 28, 2023. It is now read-only.
promoteToRegistersBelow: limit the number of registers to use
Introduce the per-thread limit on the total number of registers to use
during promotion. This limit does not differentiate between the data
types because we cannot control the register allocation at CUDA level
anyway. It rather serves as a controllable input to the promotion
heuristic.
0 commit comments