Running On Multiple GPUs

Hi I am running the image harmonization part of the model with a --train_stages 6 --max_size 350 and --lr_scale 0.5 to increase the quality of the images. 

However, once I get to the 2 stage of the training, it crashes because of lack of CUDA memory. I altered the torch device for the model to accept more than 1 gpu (let's say gpus 0 and 1) and made changes to the model to be encapsulated in a DataParallel model so that it can run parallel on multiple GPUs. However, it still only runs on 1 GPU. 

Do you have any suggestions to fix this issue?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Running On Multiple GPUs #27

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Running On Multiple GPUs #27

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions