Skip to content

Continous Action Space outputs are unbounded #329

Answered by Toni-SM
Sanjay1911 asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @Sanjay1911

The skrl's Gaussian model concept represents the data flow when using such a mixin:

The mean (in the interval [-1,1] due do the tanh output) feeds a Gaussian distribution. Such distribution, when the log_std is not 0, samples values not bounded in the limits [-1,1], as discussed in #63 (comment) (I recommend you take a look at this discussion).

If you want to limit the final output of the Gaussian model, you need to enable clip_actions while defining a bounded Gymnasium space (action_space = spaces.Box(low=-1, high=1, shape=(2,))) letting the application of the scale (0.0005) to the task implementation.

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Answer selected by Toni-SM
Comment options

You must be logged in to vote
1 reply
@Toni-SM
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants