You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[NVFP4] Fix global scale update when dealing with offloaded layers (#1554)
SUMMARY:
- Updating the global scale using the `align_module` context does not
persist the scale parameter
- Update outside of the context so that the offloaded dict is upadated
as well
Testing
- Resolves CPU offloading issues seen with a Llama 70b FP4
0 commit comments