Skip to content

Target policy smoothing noise out of predict function #3

@leandergrech

Description

@leandergrech

FERMI_RL_Paper/naf2_new.py

Lines 564 to 569 in cd2545c

action = self.noise_function(np.squeeze(model.get_action([state])),self.idx_episode)
if self.q_smoothing_clip is None:
return_value = np.clip(action, -1, 1)
else:
return_value = np.clip(action + np.clip(self.sigma * np.random.randn(self.action_size),
-self.q_smoothing_clip, self.q_smoothing_clip), -1, 1)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions