Skip to content

[Question] why SAC train method updates the target every steps? #2158

@angel-ayala

Description

@angel-ayala

❓ Question

Hi, I was looking into the code inside the SAC algorithm, and I encountered an odd line here.

When comparing to make the update of the target, should not be using self._n_updates, or something similar instead gradient_step? such variable is always zero, hence, is always entering in the condition?.

There is maybe any specific reason to do it this way?

Thanks!

Checklist

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions