Beluga-AI-Challenge-Toolkit-BDQ

This repository explores solutions using Branching Dueling Q-Networks (BDQ) for sequential decision making in the Airbus BelugaXL cargo logistics planning problem. The toolkit is developed to tackle the Airbus Beluga AI Challenge and provides an implementation using a custom reinforcement learning agent.

Overview

The logistics problem involves selecting an optimal sequence of actions which could be broken down to choosing the appropriate jig, action, and destination to efficiently load and transport cargo. This repository models the problem as a sequential decision process, where each step (e.g., jig selection, action selection, destination selection) is conditioned on prior decisions (state).

The Branching Q-Network (BDQ) is used to decompose the action space and learn Q-values for each action component separately, allowing for better exploration and evaluation of multi-branch decision spaces.

Repository Structure

🔁 `Main Code`

The core implementation of the Branching Q-Network (BDQ) algorithm used for solving the logistics problem.

RL_agent.py: BDQ agent implementation.
prioritised_experience_replay.py: Prioritised experience replay buffer used for training.
Beluga_custom_GYM.py: Custom GYM environment used to train the BDQ model for the Airbus Beluga cargo logistics problem.

⚙️ `Hyperparameters`

hyperparameters.yaml contains configuration used for training and evaluation. You can modify these to experiment with different learning rates, batch sizes, discount factors, and more.

📁 `Example Instances`

Example problem cases for evaluation. Each folder represents a distinct instance of the logistics scheduling problem.

_three_jigs/
_four_jigs/
_six_jigs/

Each directory contains JSON files such as:

problem_s3_j3_r2_oc00_f3.json
problem_s3_j4_r2_oc00_f3.json

which represent logistics problems with different configurations (e.g., number of jigs, racks, etc.).

Getting Started

Clone the repository:

  git clone https://github.com/leonardfelix/Beluga-AI-Challenge-Toolkit-BDQ.git
  cd Beluga-AI-Challenge-Toolkit-BDQ

Install required packages:

  pip install -r requirements.txt

Train and evaluate the BDQ agent:

Modify the hyperparameters.yaml file if needed and run:

python evaluate_instance.py --input "[folder_name]/[problem_name.json]"

e.g.

python evaluate_instance.py --input "_three_jigs/problem_s3_j3_r2_oc00_f3.json"

This will:

Load the specified logistics problem.
Train the BDQ agent on the environment to make sequential decisions.
Output the resulting plan and statistics.

Notes

The BDQ architecture is particularly well-suited for hierarchical or compositional action spaces.
You can add your own problem instances using the JSON schema of the existing files or generate using 'generate_instance.py'.

Acknowledgements

This repository was developed as part of the Airbus Beluga AI Challenge and is based on the logistics planning task involving BelugaXL cargo operations. This code was built upon the Beluga-AI-Challenge-Toolkit, developed by the team at Airbus and Tuples Trustworthy AI.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
_four_jigs		_four_jigs
_six_jigs		_six_jigs
_three_jigs		_three_jigs
_twelve_jigs		_twelve_jigs
beluga_lib		beluga_lib
encoder		encoder
evaluation		evaluation
generator		generator
runs		runs
skd_domains		skd_domains
utils		utils
.gitignore		.gitignore
Beluga_custom_GYM.py		Beluga_custom_GYM.py
DQN.py		DQN.py
LICENSE.md		LICENSE.md
README.md		README.md
RL_agent.py		RL_agent.py
RL_utils.py		RL_utils.py
__init__.py		__init__.py
branch_DQN_planner.py		branch_DQN_planner.py
commands.txt		commands.txt
encode_instances.py		encode_instances.py
evaluate_instance.py		evaluate_instance.py
generate_instance.py		generate_instance.py
generate_instances.py		generate_instances.py
generate_simulate_test.py		generate_simulate_test.py
generate_solve_rllib_test.py		generate_solve_rllib_test.py
hyperparameters.yaml		hyperparameters.yaml
json2PDDL.py		json2PDDL.py
prioritised_experience_replay.py		prioritised_experience_replay.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Beluga-AI-Challenge-Toolkit-BDQ

Overview

Repository Structure

🔁 `Main Code`

⚙️ `Hyperparameters`

📁 `Example Instances`

Getting Started

Notes

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

License

leonardfelix/Beluga-AI-Challenge-Toolkit-BDQ

Folders and files

Latest commit

History

Repository files navigation

Beluga-AI-Challenge-Toolkit-BDQ

Overview

Repository Structure

🔁 Main Code

⚙️ Hyperparameters

📁 Example Instances

Getting Started

Notes

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

🔁 `Main Code`

⚙️ `Hyperparameters`

📁 `Example Instances`

Packages