Skip to content

munchausen_addon and action #3

@quyouyuan

Description

@quyouyuan

Hello!Excuse me!I used your code in another environment, but I encountered difficulties. Action is a decimal array,, so how to rewrite the " munchausen_addon = log_pi.gather(1, actions)" line of code! And the action space is very large! By the way, these lines of code are:adding the scald log-policy to the immediate reward
looking forward to your reply!

2021-10-25 16-48-17 的屏幕截图
)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions