You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I was looking into the code inside the SAC algorithm, and I encountered an odd line here.
When comparing to make the update of the target, should not be using self._n_updates, or something similar instead gradient_step? such variable is always zero, hence, is always entering in the condition?.
There is maybe any specific reason to do it this way?
Thanks!
Checklist
I have checked that there is no similar issue in the repo