Skip to content

training stage 1 #62

@Wh1t3zZwhite

Description

@Wh1t3zZwhite

During training stage 1, it always gets stuck at a certain step. After debugging, I found that it is stuck at accelerator.backward(loss). At this point, the GPU usage is still very high. How should I resolve this?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions