GitHub

Behavior cloning for Error Discovery

This repository contains the code for the paper "Self Supervised Detection of Incorrect Human Demonstrations: A Path Toward Safe Imitation Learning by Robots in the Wild" by Noushad Sojib & Momotaz Begum.

BED is a BC model with an additional parameter vector $w$ of length $|D|$. It utilize a loss function that penalize different kinds of inconsistency and help to learn $w_i\approx1$ for good demos and $w_i\approx0$ for bad demos. As bad demos add more loss the the total loss, discarding them (by assigning $w_i=0$) will reduce the total loss and help to detect the bad demos.

Installation

Install Robomimic and Robosuite as follows https://robomimic.github.io/docs/introduction/installation.html
Add the two lines in the 'forward' function of class MIMO_MLP in "robomimic/robomimic/models/obs_nets.py line 585" as shown below.

    def forward(self, **inputs):
        enc_outputs = self.nets["encoder"](**inputs) 
        self.last_latent = enc_outputs[-1,:]    #add this line
        self.latent=enc_outputs                 #add this line
        mlp_out = self.nets["mlp"](enc_outputs)
        return self.nets["decoder"](mlp_out)

Download the data

Use this link to download the dataset. Put them in the bed/dataset folder. Or use the following commands

cd bed
mkdir dataset
cd dataset

# download can_task data
wget https://universitysystemnh-my.sharepoint.com/:u:/g/personal/mb1215_usnh_edu/EdaW2mZ4mRpGg0CKbTEwG5UBbKCxqXqlGnyIHdhL-o8Ahw?download=1 -O layman_v1_can_510.hdf5

# download square_task data
wget https://universitysystemnh-my.sharepoint.com/:u:/g/personal/mb1215_usnh_edu/ERbUWCBrp1xAj49yUOmoHJ8B4x6G_1EgNaUNHiZsSd_V7g?download=1 -O layman_v1_square_180.hdf5

# download lift_task data
wget https://universitysystemnh-my.sharepoint.com/:u:/g/personal/mb1215_usnh_edu/EQyR2TBr5aZKusxWCnn0Y6ABJJXDNeHZL2vhUCq-4__9Sw?download=1 -O layman_v1_lift_260.hdf5

create configuration file

We use the same configuration file as Robomimic. Please see the "bed/configs/can/bed_layman_can_p20b.json" file for an example configuration file. You can create a similar configuration file for other tasks. Based on the dataset you may want to change the following three lines.

    "data": "dataset/layman_v1_can_510.hdf5",
    "output_dir": "/home/ns1254/bed/training_data",
    "hdf5_filter_key": "p20b",

Training BED

Run the following command to train the BED model

python bed_training_path.py --config config_full_path.json --m 0.8 --accelerate 40 --gscale 5

Example: Train BED on can data to detect 80% as good and 20% as bad. You can press Ctrl+C for early stopping.

python bed_training_path.py --config /home/ns1254/bed/configs/can/bed_layman_can_p20b.json --m 0.8 --accelerate 40 --gscale 5

Explnation of the arguments:

--config: path to the configuration file
--m: percentage of demos we want to keep
--accelerate: use higher learning rate after this epoch for faster convergence
--gscale: importance of path loss

Expected w: As there are 150 demos total in the can dataset, for m=0.8 we expect 30 (150*0.2=30) of them will get $w\approx0$ and 120 of them will get $w\approx1$. Rounding will make them binary. Here is the expected w vector before rounding:

w:  [ 1.    1.    1.    1.    1.    1.    1.    1.    1.    1.    1.    1.
  1.    1.    1.    1.    1.    1.    1.    1.    0.49  1.    1.    1.
  1.    1.    1.    1.    1.    1.    1.    1.    1.    1.    1.    1.
  1.    1.    1.    1.    1.    1.    1.    1.    1.    1.    1.    1.
  1.    1.    1.    1.    1.    1.    1.    1.    1.    0.96  1.    1.
  1.    1.    1.    1.    1.    1.    1.    1.    1.    1.    1.    1.
  0.73  1.    1.    1.    1.    0.99  1.    1.    1.    1.    1.    1.
  1.    1.    0.99  1.    1.    0.99  0.91  1.    1.    1.    1.    1.
  1.    0.62  0.99  0.99  1.    1.    1.    0.48  0.08  0.72  1.    1.
  0.84  1.    1.    1.    1.    1.    0.99  0.99  1.    1.    1.    1.
 -0.   -0.   -0.    0.51 -0.   -0.   -0.   -0.   -0.   -0.   -0.   -0.
 -0.   -0.   -0.   -0.   -0.   -0.   -0.    0.   -0.    0.59 -0.    0.
  0.37 -0.   -0.   -0.    0.    1.  ]

View the training log located in logs to see how training looks like. "logs/demos.txt" contains the list of all demo names used in the training. "logs/masked_0.txt" contains the name of the demos detected as bad.

It tooks 47 minutes to train on a Single NVIDIA A40 GPU.

Generate good and bad videos from the log data

python generate_videos.py --dataset_path dataset/layman_v1_can_510.hdf5 --bed_trained_dir training_data/bed_can/20250213192737

replace the dataset_path and bed_trained_dir with the correct path. The generated videos will be saved in the bed_trained_dir/videos_good and bed_trained_dir/videos_bad folder.

Example videos.

detected as good	detected as bad

Acknowledgement

Robomimic: https://robomimic.github.io/

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
__pycache__		__pycache__
configs/can		configs/can
logs/20240408221524		logs/20240408221524
media		media
.gitignore		.gitignore
bed_model.py		bed_model.py
bed_training_path.py		bed_training_path.py
bed_utils.py		bed_utils.py
generate_videos.py		generate_videos.py
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Behavior cloning for Error Discovery

Installation

Download the data

create configuration file

Training BED

Generate good and bad videos from the log data

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

AssistiveRoboticsUNH/bed

Folders and files

Latest commit

History

Repository files navigation

Behavior cloning for Error Discovery

Installation

Download the data

create configuration file

Training BED

Generate good and bad videos from the log data

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages