Skip to content

During training on Hardware using qube_baselines episodes suddenly terminate after one step only. #50

@jonkor29

Description

@jonkor29

When training on hardware, at some random point in traning (not the same every time) the episodes suddenly starts terminating after one step and the batch of 2048 steps suddenly becomes 2048 individual episodes of 1 step each, naturally always yielding 0 reward (for the QubeSwingupEnv)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions