Skip to content

About the model_without_ddp #23

@chazo1994

Description

@chazo1994

@KevinMIN95 Why you use model_without_ddp and discriminator_without_ddp to calculate some tensors participating the losses calculation? I think the gradients of model_without_ddp will not be synchronized and reduced accross the device, and could this lead to mistakes in distributed training?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions