-
Notifications
You must be signed in to change notification settings - Fork 183
Unable to reproduce results on Humanoid-v2 in new SAC #16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hmm..... |
Thank you very much!!! |
I use almost the same parameters except that "--automatic_entropy_tuning = True" for 10 million steps, and I got the following result: |
For fixed temperature, the results should be around 6000. |
Hi, I have the same problems. I run the code with automatic_entropy_tuning = True, but the result is still around 6000. Would you mind share the running config for you curve? Thank you very much. |
I am unable to obtain the result as reported in the paper ‘Soft Actor-Critic Algorithms and Applications ’ on the openai environment Humanoid-v2. The result is 6000 while the original paper is 8000, after 10million steps.
Do you know what might be causing this issue? Thank you!
The text was updated successfully, but these errors were encountered: