📃 Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving

🔥 Source Code Released! 🔥

[arXiv]

This work introduces Distributional Proxy Value Propagation (D-PVP), which integrates human intention into distributional reinforcement learning, enabling efficient policy learning with minimal human intervention.
A shared control mechanism and policy confidence evaluation algorithm dynamically balance human-guided and self-learning policies, ensuring both safety and performance in autonomous driving.
The proposed method is validated in both MetaDrive and real-world urban driving using a sensor-equipped UGV. Extensive experiments demonstrate superior performance in terms of sample efficiency, safety, and generalization across diverse traffic scenarios.

Email: lizeqiao@tju.edu.cn

Framework

Demonstration

Training example using C-HAC

Testing example

C-HAC Real-World Driving Demonstration – Route 1

C-HAC Real-World Driving Demonstration – Route 2

User Guide

Clone the repository.

cd to your workspace and clone the repo.

git clone https://github.com/lzqw/C-HAC.git

Create a new Conda environment.

cd to your workspace:

conda create -n CHAC python=3.9

Activate virtual environment.

conda activate CHAC

Install Pytorch

Select the correct version based on your cuda version and device (cpu/gpu):

pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

Install other requirements.

# Install the requirements.
pip install -r requirements.txt

Training

Modify the sys path in example_train file, and run:

python train_dsact_pvp_rl.py

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.idea		.idea
__pycache__		__pycache__
env_gym		env_gym
example_train		example_train
networks		networks
pic		pic
training		training
utils		utils
README.md		README.md
dsac_v2_pvp.py		dsac_v2_pvp.py
dsac_v2_pvp_rl.py		dsac_v2_pvp_rl.py
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📃 Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving

🔥 Source Code Released! 🔥

[arXiv]

Framework

Demonstration

Training example using C-HAC

Testing example

C-HAC Real-World Driving Demonstration – Route 1

C-HAC Real-World Driving Demonstration – Route 2

User Guide

Clone the repository.

Create a new Conda environment.

Activate virtual environment.

Install Pytorch

Install other requirements.

Training

About

Uh oh!

Releases

Packages

Languages

lzqw/C-HAC

Folders and files

Latest commit

History

Repository files navigation

📃 Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving

🔥 Source Code Released! 🔥

[arXiv]

Framework

Demonstration

Training example using C-HAC

Testing example

C-HAC Real-World Driving Demonstration – Route 1

C-HAC Real-World Driving Demonstration – Route 2

User Guide

Clone the repository.

Create a new Conda environment.

Activate virtual environment.

Install Pytorch

Install other requirements.

Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages