We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 795e362 commit 46ccbaeCopy full SHA for 46ccbae
tutorials/sphinx-tutorials/multiagent_ppo.py
@@ -404,7 +404,8 @@
404
n_agent_inputs=env.observation_spec["agents", "observation"].shape[
405
-1
406
], # n_obs_per_agent
407
- n_agent_outputs=2 * env.action_spec.shape[-1], # 2 * n_actions_per_agents
+ n_agent_outputs=2
408
+ * env.full_action_spec[env.action_key].shape[-1], # 2 * n_actions_per_agents
409
n_agents=env.n_agents,
410
centralised=False, # the policies are decentralised (ie each agent will act from its observation)
411
share_params=share_parameters_policy,
0 commit comments