How to continue the training process from a checkpoint

Hi @mkshing,
Your work is amazing, and I have tried to experiment with SVDiff on Google Colab with your repo.
But I actually got a little problem. Because the time for a Google Colab session is very limited, so I can not complete the training process in one Google Colab session. When I tried to resume the training process with "--resume_from_checkpoint", I coped with the following error:
FileNotFoundError: [Errno 2] No such file or directory: /path/to/checkpoints/pytorch_model.bin
If I understand correctly, your repo only saves spectral_shifts_te information, but accelerator.load_state() needs more (u-net/text encoder weight). Any solution for that?
Btw, thank you again for your repo. It is great when you publicize the repo. 
 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to continue the training process from a checkpoint #8

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

How to continue the training process from a checkpoint #8

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions