-
Notifications
You must be signed in to change notification settings - Fork 96
Description
I got to this point following the RL-teacher
Usage docs
Once you have finished labeling the 175 pre-training comparisons, we train the predictor to ? convergence on the initial comparisons. After that, it will request additional comparisons every few seconds."
I was able to use the human-feedback-api webapp
to provide feedback for the 175 pre-training labels. After that, the agent
began to learn based on the pre-training feedback
8900/10000 predictor pretraining iters...
9000/10000 predictor pretraining iters...
9100/10000 predictor pretraining iters...
9200/10000 predictor pretraining iters...
9300/10000 predictor pretraining iters...
9400/10000 predictor pretraining iters...
9500/10000 predictor pretraining iters...
9600/10000 predictor pretraining iters...
9700/10000 predictor pretraining iters...
9800/10000 predictor pretraining iters...
9900/10000 predictor pretraining iters...
Starting joint training of predictor and agent
But joint training failed. The human-feedback-api webapp
displayed only blank screens. When I checked the URL for the videos in a separate tab, I got an XML error message that said The specified key does not exist
At the same time, the teacher.py
script continued to generate video samples and upload to GoogleCloud
Operation completed over 1 objects/14.4 KiB.
Copying media to gs://rl-teacher-snappi/abb3e1ed-f78e-459d-bed8-a1865ed541b1-right.mp4 in a background process
Copying media to gs://rl-teacher-snappi/c21384b2-7395-49b5-b263-5200221a3a36-right.mp4 in a background process
Copying media to gs://rl-teacher-snappi/c21384b2-7395-49b5-b263-5200221a3a36-left.mp4 in a background process
Copying file:///tmp/rl_teacher_media/c21384b2-7395-49b5-b263-5200221a3a36-left.mp4 [Content-Type=video/mp4]...
Copying file:///tmp/rl_teacher_media/c21384b2-7395-49b5-b263-5200221a3a36-right.mp4 [Content-Type=video/mp4]...
Copying file:///tmp/rl_teacher_media/abb3e1ed-f78e-459d-bed8-a1865ed541b1-right.mp4 [Content-Type=video/mp4]...
\ [1 files][ 14.8 KiB/ 14.8 KiB]
Operation completed over 1 objects/14.8 KiB.
\ [1 files][ 15.8 KiB/ 15.8 KiB]
Operation completed over 1 objects/16.1 KiB.
Operation completed over 1 objects/15.8 KiB.
I can manually confirm that the media files exist in Google Cloud
I waited many minutes, refreshed the webapp, even clicked can't tell
a few times, but the video never reappeared after the (successful) pre-training.