Calculating rewards for multyagent task

Hello! I want to learn how to control a swarm of UAVs with obstacles. Could you tell me, please, in what order to change the code? Which action to use and how to work with it correctly. How to figure out the rgb calculation?

I'm figuring out the pipeline, I found adding obstacles, but there are difficulties with calculating rewards. Can you explain how you got these threshold values? Why such a formula


`
    #### Target cumulative rewards (problem-dependent) ##########
    if DEFAULT_ACT == ActionType.ONE_D_RPM:
        target_reward = 474.15 if not multiagent else 949.5
    else:
        target_reward = 467. if not multiagent else 920.
    callback_on_best = StopTrainingOnRewardThreshold(reward_threshold=target_reward,
                                                     verbose=1)
`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Calculating rewards for multyagent task #282

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Calculating rewards for multyagent task #282

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions