Skip to content

Calculating rewards for multyagent task #282

@Ange1ika

Description

@Ange1ika

Hello! I want to learn how to control a swarm of UAVs with obstacles. Could you tell me, please, in what order to change the code? Which action to use and how to work with it correctly. How to figure out the rgb calculation?

I'm figuring out the pipeline, I found adding obstacles, but there are difficulties with calculating rewards. Can you explain how you got these threshold values? Why such a formula

#### Target cumulative rewards (problem-dependent) ########## if DEFAULT_ACT == ActionType.ONE_D_RPM: target_reward = 474.15 if not multiagent else 949.5 else: target_reward = 467. if not multiagent else 920. callback_on_best = StopTrainingOnRewardThreshold(reward_threshold=target_reward, verbose=1)

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions