IPPO actions almost always the same #240
Unanswered
ginesmoratalla
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello, I am new to this framework but I was trying to train IPPO on a custom environment. I do not know wether it is a problem of training or execution, when using the trained actor network in the environment, each agent always picks the same action (e.g., agent_0 always picks action 4). Since this is just the execution part, I just wanted to know if anyone can give me tips on how to debug this. Could it be a problem when training the network, or could it be found here in the execution script?
It is the first train that I do in the env, and it run for 240k steps with 4 agents. (execution script below)
Beta Was this translation helpful? Give feedback.
All reactions