-
Notifications
You must be signed in to change notification settings - Fork 18
Open
Description
Hi,
Thanks for your beautiful code for distributed computation! Could you please share how to save a checkpoint/model? I tried to use the following but it does not work. Thank you in advance!
saver = tf.train.Saver(tf.all_variables(), max_to_keep=1000)
if (epoch+1) % 10 == 0:
with tf.device("/job:worker/task:5"):
checkpoint_path = os.path.join(outdir + '/model.ckpt')
saver.save(sess, checkpoint_path)
Metadata
Metadata
Assignees
Labels
No labels