Pupper Sim

Simulation and Reinforcement Learning for DJI Pupper v2 Robot

System setup

Operating system requirements

Mac
Linux
Windows (untested, not recommended)

Mac-only setup

Install xcode command line tools.

xcode-select --install

If you already have the tools installed you'll get an error saying so, which you can ignore.

Conda setup

Install miniconda, then

conda create --name rl_pupper python=3.7
conda activate rl_pupper

Getting the code ready

mkdir -p ~/pupper_ws/src
cd ~/pupper_ws/src
git clone --recurse-submodules git@github.com:montrealrobotics/puppersim.git
cd puppersim
pip install -e .

You will also need to use this version of pybullet:

cd ~/pupper_ws/src
git clone https://github.com/montrealrobotics/bullet3.git
cd bullet3
pip install -e .

Then to verify the installation, run

python3 puppersim/pupper_example.py

You should see the PyBullet GUI pop up and see Pupper doing an exercise.

Click for instructions if pupper_example.py is running slowly

Stop pupper_example.py. Then run

python3 puppersim/pupper_minimal_server.py

then in a new terminal tab/window

python3 puppersim/pupper_example.py --render=False

This runs the visualizer GUI and simulator as two separate processes.

Training

From the outer puppersim folder run:

python puppersim/pupper_train_ppo_cont_action.py --seed 1 --env-id PupperGymEnv-v0 
--total-timesteps 5000 --save-model --capture_video

Depending on your computer specs, each training iteration will take around 1 - 5 seconds.

Troubleshooting

Click to expand

Pybullet hangs when starting training. Possible issue: You have multiple suspended pybullet clients. Solution: Restart your computer.

Protocol for saving policies

Click to expand

If you want to save a policy, create a folder within puppersim/data with the type of gait and date, eg pretrained_trot_1_22_22. From the data folder, copy the following files into the folder you just made.

The .npz policy file you want, e.g. lin_policy_plus_latest.npz
log.txt
params.json

From puppersim/config also copy the .gin file you used to train the robot, e.g. pupper_pmtg.gin file into the folder you just made. When you run a policy on the robot, make sure your pupper_robot_*_.gin file matches the pupper_pmtg.gin file you saved.

Then add a README.md in the folder with a brief description of what you did, including your motivation for saving this policy.

Test a policy

TODO

Deployment

Prerequisites

Linux

Set up Avahi (once per computer)

sudo apt install avahi-*

Run the following, you should see Pupper's IP address

avahi-resolve-host-name raspberrypi.local -4

Setup the zero password login for your pupper (once per computer) (original raspberry pi password: raspberry)

ssh-keygen
cat ~/.ssh/id_rsa.pub | ssh pi@`avahi-resolve-host-name raspberrypi.local -4 | awk '{print $2}'` 'mkdir .ssh/ && cat >> .ssh/authorized_keys'

Mac

Setup the zero password login for your pupper (only once per computer) (original raspberry pi password: raspberry)

Once per computer, run

ssh-keygen
cat ~/.ssh/id_rsa.pub | ssh pi@raspberrypi.local 'mkdir -p .ssh/ && cat >> .ssh/authorized_keys'

Run pretrained policy on Pupper

Turn on the Pupper robot, wait for it to complete the calibration motion.
Connect your laptop with the Pupper using an USB-C cable
Run the following command on your laptop:

./deploy_to_robot.sh python3 puppersim/puppersim/pupper_ars_run_policy.py --expert_policy_file=puppersim/data/lin_policy_plus_latest.npz --json_file=puppersim/data/params.json --run_on_robot

Simulating the heuristic controller

Click to expand

Navigate to the outer puppersim folder and run

python3 puppersim/pupper_server.py

Clone the the heuristic controller:

git clone https://github.com/stanfordroboticsclub/StanfordQuadruped.git
cd StanfordQuadruped
git checkout dji

In a separate terminal, navigate to StanfordQuadruped and run

python3 run_djipupper_sim.py

Keyboard controls:

wasd --> moves robot forward/back and left/right
arrow keys --> turns robot left/right
q --> activates/deactivates robot
e --> starts/stops trotting gait
ijkl --> tilts and raises robot

Name		Name	Last commit message	Last commit date
Latest commit History 246 Commits
cleanrl @ dcc289f		cleanrl @ dcc289f
puppersim		puppersim
.gitignore		.gitignore
.gitmodules		.gitmodules
IMU identification.ipynb		IMU identification.ipynb
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
copy_log.sh		copy_log.sh
deploy_to_robot.sh		deploy_to_robot.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Pupper Sim

System setup

Operating system requirements

Mac-only setup

Conda setup

Getting the code ready

Training

Troubleshooting

Protocol for saving policies

Test a policy

Deployment

Prerequisites

Run pretrained policy on Pupper

Simulating the heuristic controller

About

Uh oh!

Releases

Packages

Languages

License

montrealrobotics/puppersim

Folders and files

Latest commit

History

Repository files navigation

Pupper Sim

System setup

Operating system requirements

Mac-only setup

Conda setup

Getting the code ready

Training

Troubleshooting

Protocol for saving policies

Test a policy

Deployment

Prerequisites

Run pretrained policy on Pupper

Simulating the heuristic controller

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages