Drone Simulator: Quadrotor Swarm

Based on: https://github.com/Zhehui-Huang/quad-swarm-rl

Why This Version

In the original repository, I encountered persistent issues running the built-in visualization on my machine (Ubuntu 24.04). Although the author confirmed it works on their own 24.04 setup, the problem seemed to be specific to my environment. To help others who might face similar compatibility issues, I decided to implement a custom 3D visualization using Three.js. While enhancing the simulation, I also introduced a new evader-pursuit scenario to enrich the swarm behavior dynamics

New Features

Enhanced Visualization
Integration of Three.js for 3D visualization interface
Added New Scenario
- The classic figure-eight ∞ curve (Bernoulli's lemniscate) (3D version)
- Scenario name: ep_figure8_lemniscate (For evader-pursuit)

Setup

Conda

conda create -n quadrotor_swarm python=3.11

conda activate quadrotor_swarm

Packages

git clone https://github.com/ali-kosemen/quadrotor-swarm.git

cd quadrotor_swarm

pip install -e .

Train

For train single drone

./train_single.sh

In quadrotor_swarm/swarm_rl/runs/single_quad/baseline.py, you can change some parameters (quads_mode for scenario choice etc.)

For train multi drone (default 8 drones)

./train_multi.sh

In quadrotor_swarm/swarm_rl/runs/multi_quad/baseline.py, you can change some parameters (quads_mode for scenario choice etc.)

To monitor the experiments

tensorboard --logdir=./

Test (Inference Mode)

First, start the HTTP server for web UI

./web_server.sh

Secondly, launch the WebSocket server to enable real-time data transmission to the web UI

./web_inference.sh APPO quadrotor_multi_websocket <train_dir> <experiment_name>

<experiment_name> and <train_dir> are set in the config.json file of your trained model

Lastly, go to the HTTP server URL

http://localhost:8083

Pre-trained Models

I have trained models for several scenarios

Dynamic Same Goal (8 Quadrotors)
Evader-Pursuit Lissajous (8 Quadrotors)
Evader-Pursuit Bernoulli Lemniscate (8 Quadrotors)
Swarm vs Swarm (8 Quadrotors) (Not fully stable, but still capable of demonstrating reasonable behavior)
Swap Goals (8 Quadrotors) (Not fully stable, but still capable of demonstrating reasonable behavior)

Drag and drop the model folder (starts with test_) you downloaded directly into the train_dir. Then, simply provide the paths of experiment_name and train_dir from the config.json file to the inference command

Citation

If you use this repository in your work or otherwise wish to cite it, please make reference to following papers

QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control

ICRA Workshop: The Role of Robotics Simulators for Unmanned Aerial Vehicles, 2023

Drone Simulator for Reinforcement Learning.

@article{huang2023quadswarm,
  title={Quadswarm: A modular multi-quadrotor simulator for deep reinforcement learning with direct thrust control},
  author={Huang, Zhehui and Batra, Sumeet and Chen, Tao and Krupani, Rahul and Kumar, Tushar and Molchanov, Artem and Petrenko, Aleksei and Preiss, James A and Yang, Zhaojing and Sukhatme, Gaurav S},
  journal={arXiv preprint arXiv:2306.09537},
  year={2023}
}

Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to Multiple Quadrotors

IROS 2019

Single drone: a unified control policy adaptable to various types of physical quadrotors.

@inproceedings{molchanov2019sim,
  title={Sim-to-(multi)-real: Transfer of low-level robust control policies to multiple quadrotors},
  author={Molchanov, Artem and Chen, Tao and H{\"o}nig, Wolfgang and Preiss, James A and Ayanian, Nora and Sukhatme, Gaurav S},
  booktitle={2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
  pages={59--66},
  year={2019},
  organization={IEEE}
}

Decentralized Control of Quadrotor Swarms with End-to-end Deep Reinforcement Learning

CoRL 2021

Multiple drones: a decentralized control policy for multiple drones in obstacle free environments.

@inproceedings{batra21corl,
  author    = {Sumeet Batra and
               Zhehui Huang and
               Aleksei Petrenko and
               Tushar Kumar and
               Artem Molchanov and
               Gaurav S. Sukhatme},
  title     = {Decentralized Control of Quadrotor Swarms with End-to-end Deep Reinforcement Learning},
  booktitle = {5th Conference on Robot Learning, CoRL 2021, 8-11 November 2021, London, England, {UK}},
  series    = {Proceedings of Machine Learning Research},
  publisher = {{PMLR}},
  year      = {2021},
  url       = {https://arxiv.org/abs/2109.07735}
}

Collision Avoidance and Navigation for a Quadrotor Swarm Using End-to-end Deep Reinforcement Learning

ICRA 2024

Multiple drones: a decentralized control policy for multiple drones in obstacle dense environments.

@inproceedings{huang2024collision,
  title={Collision avoidance and navigation for a quadrotor swarm using end-to-end deep reinforcement learning},
  author={Huang, Zhehui and Yang, Zhaojing and Krupani, Rahul and {\c{S}}enba{\c{s}}lar, Bask{\i}n and Batra, Sumeet and Sukhatme, Gaurav S},
  booktitle={2024 IEEE International Conference on Robotics and Automation (ICRA)},
  pages={300--306},
  year={2024},
  organization={IEEE}
}

HyperPPO: A scalable method for finding small policies for robotic control

ICRA 2024

A method to find the smallest control policy for deployment: train once, get tons of models with different size by using HyperNetworks.

We only need four neurons to control a quadrotor! That is super amazing!

Please check following videos for more details:

@inproceedings{hegde2024hyperppo,
  title={Hyperppo: A scalable method for finding small policies for robotic control},
  author={Hegde, Shashank and Huang, Zhehui and Sukhatme, Gaurav S},
  booktitle={2024 IEEE International Conference on Robotics and Automation (ICRA)},
  pages={10821--10828},
  year={2024},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 807 Commits
asset		asset
gym_art		gym_art
swarm_rl		swarm_rl
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py
train_multi.sh		train_multi.sh
train_single.sh		train_single.sh
web_inference.sh		web_inference.sh
web_server.sh		web_server.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Drone Simulator: Quadrotor Swarm

Why This Version

New Features

Setup

Conda

Packages

Train

Test (Inference Mode)

Pre-trained Models

Citation

QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control

Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to Multiple Quadrotors

Decentralized Control of Quadrotor Swarms with End-to-end Deep Reinforcement Learning

Collision Avoidance and Navigation for a Quadrotor Swarm Using End-to-end Deep Reinforcement Learning

HyperPPO: A scalable method for finding small policies for robotic control

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 11

Uh oh!

Languages

ali-kosemen/quadrotor-swarm

Folders and files

Latest commit

History

Repository files navigation

Drone Simulator: Quadrotor Swarm

Why This Version

New Features

Setup

Conda

Packages

Train

Test (Inference Mode)

Pre-trained Models

Citation

QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control

Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to Multiple Quadrotors

Decentralized Control of Quadrotor Swarms with End-to-end Deep Reinforcement Learning

Collision Avoidance and Navigation for a Quadrotor Swarm Using End-to-end Deep Reinforcement Learning

HyperPPO: A scalable method for finding small policies for robotic control

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 11

Uh oh!

Languages

Packages