Can't adapt torch_ant_ppo.py to use LSTM #324
ivanpanshin
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi!
I'm trying to figure out how to train an agent with LSTM in order to account for observation history. Based on previous issues, I found out that standalone scripts is a good starting point.
I trained an Ant using torch_ant_ppo.py and am now trying to adapt it to use ppo_rnn.py
In short, I adapted LSTM model, but keep getting mismatch error
Which heavily hints that model awaits a sequence of states, but instead is presented with a single state, which is exactly the difference between training a regular model and a sequential one.
My question is simple - what's wrong with my script adaptation? Attaching it here along with full logs.
Script code (click to expand)
Full traceback (click to expand)
Beta Was this translation helpful? Give feedback.
All reactions