MFVRP: Mean-Field RL for Large-Scale Unit-Capacity Pickup-and-Delivery

This repository contains the reference implementation for the paper "Mean-Field RL for Large-Scale Unit-Capacity Pickup-and-Delivery Problems". See the paper: Mean-Field RL for Large-Scale Unit-Capacity Pickup-and-Delivery Problems.

Installation

Create a fresh Python environment (3.9 recommended).
Install exact dependencies:
```
pip install -r requirements.txt
```
Install JAX with CUDA (adjust CUDA version as needed):
```
pip install -U "jax[cuda12]"
```

Datasets and Preprocessing

Generate datasets (uniform, clustered, cities) as needed using scripts under datasets/.
Run k-means preprocessing for clustering datasets:
```
python preprocessing.py
```

Running Experiments

Main entrypoint is exp_run.py which wraps PPO training for the mean-field VRP environment. Example usage:

python exp_run.py \
  --seed=0 \
  --config=0 \
  --load_datasets=0 \
  --dataset=clustered \
  --k=5 \
  --N=500 \
  --timesteps=300000000 \
  --save_full=1

Saved artifacts are written under results/flax_ckpt/<exp_dir>/ including learned parameters and brief metrics.

Provided Experiment Drivers

train_mfvrp_g.py: Training for limiting MFVRP problem with staged step sizes. (MFVRP-G)
train_mfvrp_g_extra.py: Same as above with --extra_run=1; used for a rank-1 parametrization of actions.
eval_finetune_sweeps.py: Finetuning and cross-dataset evaluation sweeps (uniform/cities) using a pretrained checkpoint. (MFVRP-F)

Plotting

After experiments, generate figures and tables via the plotting and eval scripts.

Ensure logs (exp0.log, etc.) and checkpoints exist as expected before plotting.

Baselines

We include simple baselines for comparison:

greedy_approach/greedy_approach.py for greedy baseline
pyvrp_approach/run_pyvrp.py for PyVRP baseline
ortools/vrp.py for OR-Tools baseline

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
datasets		datasets
greedy_approach		greedy_approach
ortools		ortools
purejaxrl		purejaxrl
pyvrp_approach		pyvrp_approach
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
distance_matrix.py		distance_matrix.py
eval_F_table.py		eval_F_table.py
eval_G_table.py		eval_G_table.py
exp_run.py		exp_run.py
plot_MFC_training_curve.py		plot_MFC_training_curve.py
plot_higher_k.py		plot_higher_k.py
plot_leftover_fraction_vs_N.py		plot_leftover_fraction_vs_N.py
plot_mf_vs_finite_error_over_time.py		plot_mf_vs_finite_error_over_time.py
plot_rank_one_approx.py		plot_rank_one_approx.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
train_finetune_sweeps.py		train_finetune_sweeps.py
train_mfvrp_g.py		train_mfvrp_g.py
train_mfvrp_g_rank1.py		train_mfvrp_g_rank1.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MFVRP: Mean-Field RL for Large-Scale Unit-Capacity Pickup-and-Delivery

Installation

Datasets and Preprocessing

Running Experiments

Provided Experiment Drivers

Plotting

Baselines

About

Uh oh!

Releases

Packages

Languages

License

tudkcui/MFVRP

Folders and files

Latest commit

History

Repository files navigation

MFVRP: Mean-Field RL for Large-Scale Unit-Capacity Pickup-and-Delivery

Installation

Datasets and Preprocessing

Running Experiments

Provided Experiment Drivers

Plotting

Baselines

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages