This repository provides code and tutorials to help participants get started with developing dose prediction models for the GDP-HMM Challenge at AAPM 2025.
- Announcements and News
- What this repo does
- What this repo does NOT
- Important Timeline
- Simplified Background
- Data Understanding and Visualization
- Get Started and Training
- Evaluation Methods
- Prizes and Publication Plan
- Challenge Organizers
- Citation
🎉 [06/10/2025] The top six solutions have been released here, including train/inference code, pretrained model, and simple tech report!
🎉 [05/27/2025] The challenge rankings and results have been released here. Congrats to the winners again!
🎉 [05/20/2025] The challenge rankings and results have been provisionally released to participants of final phase. We will make them public one week later. Congrats to the winners!
🚀 [04/20/2025] The final test phase is started. Please check the docker_tutorial.ipynb to prepare docker, which need to be submitted through challenge platform. Participants also are required to provide documentation of their methods (up to 2 pages), source code for training and testing, and pre-trained model; which could be zip file or accessible web link.
🚀 [03/11/2025] We updated the Arxiv to include some analyses about the simple baseline; timeline for testing is adjusted a bit.
🚀 [02/15/2025] The Phase II is open for submission. See tutorial at evaluation.ipynb and Forum.
🚀 [02/11/2025] Docker tutorial/example is included, please check it at docker_tutorial.ipynb. The docker will be needed in the final phase (Phase III).
🚀 [01/21/2025] Input data of validation set have been released; rings structures of previous train_XX.zip have been updated to match the validation set.
🚀 [01/21/2025] DICOMs of the data are released (not necessary for this challenge, but helpful for other research topics in radiotherapy).
🚀 [01/2025] Sanity check submission is open in challenge platform. See tutorial at evaluation.ipynb and Forum.
🚀 [01/2025] The challenge is officially started; training data and pretrained simple baseline are released; see links at get_started_and_train.ipynb!
🚀 [12/2024] The challenge is announced to public; the GitHub repo and challenge platform are made public!
-
Simplified RT Background Tutorial. We provide links of materials (most are written by organizers) introducing the (simplied) background of radiotherapy, dose prediction, and GDP-HMM challenge.
-
Data Understanding and Visualization. Jupyter notebook is available to guide participants in loading and visualizing the data step by step, with detailed explanations.
-
Data Preprocess. We provide code of data preprocessing step by step with Jupyter notebook.
-
Simple baseline. We provide a simple baseline with the backbone of MedNeXt. The ways of integrated condition to the network are motivated from [2]. We include data loader, network, loss function, running command line, etc., to help participants get started.
-
Evaluation Methods. We provide code or/and details of evaluation metrics with Jupyter notebook.
For any questions related to above, welcome to open issues.
To ensure this repository remains focused on the challenge and to maintain fairness for all participants, we discourage opening issues related to the following topics:
-
Urgent matters - For time-sensitive issues like data access problems or data issues, email the lead organizer directly.
-
AI Engineering tricks - We plan not to provide AI implementation suggestions for the challenge.
-
AI Technical paper questions - We plan not to address AI methodology questions in the issues. but welcome any questions related to clinical background and data/task understanding.
For critical concerns, please email the lead organizer. Updates will be shared with all participants as needed.
- January, 2025: Phase I starts. Registration opens. Training dataset and GitHub are made available.
- February, 2025: Phase II starts. Validation datasets are made available. Participants can submit preliminary results and receive feedback on relative scoring for limited number of times.
- April 20, 2025: Phase III starts. Final test datasets are made available.
- May 02, 2025: Deadline for testing phase.
Radiation therapy (RT) is a key cancer treatment used in about 50% of cancer cases. 3D dose prediction is vital in RT planning. References [1,2] offer a good introduction and related works for the challenge background. An simplified introduction to RT and terminologies can be found in the Appendix A of this paper. Additionally, the summary paper from a previous related challenge OpenKBP can be helpful, though the context of this challenge is much more heterogenous.
The input for this task includes CT images, PTVs/OARs masks, beam geometries, and more. The output is a 3D dose distribution generated from Eclipse (Varian’s treatment planning system), following the method described in [1].
An example of Eclipse visualization is shown below. For Jupyter visualization with NPZ files, please refer to data_visual_understand.ipynb. This notebook also introduces important numbers (e.g., prescribed doses for PTVs) and explains how to use them.
We provide a training script with fewer than 40 effective lines of Python/PyTorch code and minimal package dependencies. Participants can quickly start with this and adjust the code to build more advanced models. Our goal is to help user run the training code in 5 minutes, and understand the code logic and parameters in 20 minutes.
Detailed instructions (e.g., challenge registration, data download, baseline) are available in get_started_and_train.ipynb.
Two metrics are used for evaluation. The first is the mean absolute error, masked by the 5 Gy isodose lines and body mask. This metric aims to measure how close the prediction is to the reference under specific settings (e.g., beam geometries). The second metric is the quality index of the deliverable plan generated from the dose prediction, based on the scorecard described in Ref [1].
Example code is provided in evaluation.ipynb.
- Monetary Awards: Total $4,000 for top N teams (the number N will range from 5 to 10, depending on the number of teams participating in Phase III).
- Certificate: Top N teams will receive certificate endorsed organization team and AAPM.
- Authorship: Top N teams (up to two members per team) will be invited as co-authors on a journal manuscript summarizing the challenge. Additional team members will be acknowledged.
- Presentations: Top two teams will be awarded complimentary meeting registration to present their work.
- Internship Opportunities: Lead students of Top five teams will receive priority consideration for internships at Siemens Healthineers AI Center.
- Riqiang Gao, Ph.D., lead organizer, (Siemens Healthineers)
- Florin Ghesu, Ph.D., (Siemens Healthineers)
- Wilko Verbakel, Ph.D., (Varian, a Simens Healthineers company)
- Rafe Mcbeth, Ph.D., (University of Pennsylvania)
- Sandra Meyers, Ph.D., (UC San Diego Health)
- Masoud Zarepisheh, Ph.D., (Memorial Sloan Kettering Cancer Center)
- Ali Kamen, Ph.D., (Siemens Healthineers)
Please contact Riqiang Gao with riqiang.gao@siemens-healthineers.com for further questions or collaborations.
To acknowledge the work of challenge organization team and insights from previous publication, please kindly follow the instructions below.
-
Data citation. If you find the data and challenge helpful for your research, please cite the following technique paper [1] that built the dataset (or/and the challenge summary paper when available).
-
Baseline citation. If you find the method and code for data preprocessing and data loading in the repo (e.g., creating the angle and beam plates) inspiring for your work, please cite [2]. If you use or adjust MedNeXt as your network structure, please cite [3].
In addtion to above, if any data and code resources in this repo is helpful for your research, please kindly cite [1] or/and [2]. Please kindly cite external linked resources accordingly if they helped you.
[1] Riqiang Gao, Mamadou Diallo, Han Liu, Anthony Magliari, Wilko Verbakel, Sandra Meyers, Masoud Zarepisheh, Rafe Mcbeth, Simon Arberet, Martin Kraus, Florin Ghesu, Ali Kamen. Automating High Quality RT Planning at Scale. arXiv preprint arXiv:2501.11803. 2025.
[2] Riqiang Gao, Bin Lou, Zhoubing Xu, Dorin Comaniciu, and Ali Kamen. "Flexible-cm gan: Towards precise 3d dose prediction in radiotherapy." In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023.
[3] Saikat Ray, Gregor Koehler, Constantin Ulrich, Michael Baumgartner, Jens Petersen, Fabian Isensee, Paul F. Jaeger, and Klaus H. Maier-Hein. "Mednext: transformer-driven scaling of convnets for medical image segmentation." In International Conference on Medical Image Computing and Computer-Assisted Intervention, 2023.
-
The resources and information provided in this challenge are derived from research findings and are intended solely for research purposes. Commercial availability in the future is not guaranteed.
-
The radiotherapy plans used in this study are not clinically approved. The concept of "Plan Quality" may vary between institutions and individual planners, and should be considered subjective. While the dataset is valuable for training and validating AI models from a computational standpoint, it is the responsibility of the user to thoroughly evaluate the data before applying it to any clinical or research studies.