Skip to content

only pretraining comparisons appear in the labeling interface #36

@mixuala

Description

@mixuala

I got to this point following the RL-teacher Usage docs

Once you have finished labeling the 175 pre-training comparisons, we train the predictor to ? convergence on the initial comparisons. After that, it will request additional comparisons every few seconds."

I was able to use the human-feedback-api webapp to provide feedback for the 175 pre-training labels. After that, the agent began to learn based on the pre-training feedback

8900/10000 predictor pretraining iters... 
9000/10000 predictor pretraining iters... 
9100/10000 predictor pretraining iters... 
9200/10000 predictor pretraining iters... 
9300/10000 predictor pretraining iters... 
9400/10000 predictor pretraining iters... 
9500/10000 predictor pretraining iters... 
9600/10000 predictor pretraining iters... 
9700/10000 predictor pretraining iters... 
9800/10000 predictor pretraining iters... 
9900/10000 predictor pretraining iters... 
Starting joint training of predictor and agent

But joint training failed. The human-feedback-api webapp displayed only blank screens. When I checked the URL for the videos in a separate tab, I got an XML error message that said The specified key does not exist

At the same time, the teacher.py script continued to generate video samples and upload to GoogleCloud

Operation completed over 1 objects/14.4 KiB.                                     
Copying media to gs://rl-teacher-snappi/abb3e1ed-f78e-459d-bed8-a1865ed541b1-right.mp4 in a background process
Copying media to gs://rl-teacher-snappi/c21384b2-7395-49b5-b263-5200221a3a36-right.mp4 in a background process
Copying media to gs://rl-teacher-snappi/c21384b2-7395-49b5-b263-5200221a3a36-left.mp4 in a background process
Copying file:///tmp/rl_teacher_media/c21384b2-7395-49b5-b263-5200221a3a36-left.mp4 [Content-Type=video/mp4]...
Copying file:///tmp/rl_teacher_media/c21384b2-7395-49b5-b263-5200221a3a36-right.mp4 [Content-Type=video/mp4]...
Copying file:///tmp/rl_teacher_media/abb3e1ed-f78e-459d-bed8-a1865ed541b1-right.mp4 [Content-Type=video/mp4]...
\ [1 files][ 14.8 KiB/ 14.8 KiB]                                                
Operation completed over 1 objects/14.8 KiB.                                     
\ [1 files][ 15.8 KiB/ 15.8 KiB]                                                
Operation completed over 1 objects/16.1 KiB.                                     

Operation completed over 1 objects/15.8 KiB.                             

I can manually confirm that the media files exist in Google Cloud

I waited many minutes, refreshed the webapp, even clicked can't tell a few times, but the video never reappeared after the (successful) pre-training.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions