Replies: 2 comments 2 replies
-
It is not difficult to implement naive policy gradient in DI-engine, but I want to know why you need it? We didn't add naive pg before because it shows poor performance in most environments. |
Beta Was this translation helpful? Give feedback.
2 replies
-
I have implemented REINFORCE in this #544. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
The REINFORCE algorithm is a classical policy gradient method. It has been implemented in some other RL libraries, such as RLlib and Tianshou. I wonder if we can also have it in DI-engine.
Beta Was this translation helpful? Give feedback.
All reactions