Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning

This repository contains the code and resources for "Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning," which introduces RetroDFM-R.

Setup

Training Environment

We recommend using the provided Docker image for training. Follow the installation instructions for OpenRLHF to set up your environment.

# Follow OpenRLHF installation guide
# https://github.com/OpenRLHF/OpenRLHF/tree/main?tab=readme-ov-file#installation
# We recommend using the Docker image for training.

Once inside the Docker container, install rdkit:

pip install rdkit

Inference Environment

To set up the environment for inference, follow these steps:

conda create -n retrodfmR python=3.10
conda activate retrodfmR
pip install -r requirements.txt

You can specify the CUDA version if needed (e.g., for CUDA 12.8):

pip install vllm --extra-index-url https://download.pytorch.org/whl/cu128 # Replace cu128 with your CUDA version

Data Preparation

All data used in this work are sourced from publicly accessible datasets:

SMILES/IUPAC Name Conversion Data: Paired SMILES and IUPAC names are obtained from PubChem, used to construct the name conversion data.
Retrosynthesis Data:
- USPTO-50K: Accessed via the GLN repository: https://github.com/Hanjun-Dai/GLN (specifically, the schneider50k dataset).
- USPTO-FULL: Also obtained from the GLN repository: https://github.com/Hanjun-Dai/GLN (specifically, uspto_multi dataset).

We provide the processed test data on Hugging Face: https://huggingface.co/datasets/OpenDFM/retrodfm-R-inference

Training

Ensure your Docker container is successfully launched before initiating training.

Navigate to the train directory:

cd train

Then, execute the following training script:

For Continual Pretraining:

bash examples/scripts/train_continual_pretrain.sh

For Cold-Start Distillation:

bash examples/scripts/train_cold_start_distill.sh

For Reinforcement Learning:

On USPTO-50K:

bash examples/scripts/train_dapo_retrodfm_R_50k.sh

On USPTO-FULL:

bash examples/scripts/train_dapo_retrodfm_R_full.sh

Inference

After downloading the processed test data (as mentioned in Data Preparation), you can run the inference script. The following command will perform inference using beam search and test augmentation:

conda activate retrodfmR
cd inference && bash eval.sh

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
inference		inference
train		train
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning

Setup

Training Environment

Inference Environment

Data Preparation

Training

Inference

About

Uh oh!

Releases

Packages

Languages

License

OpenDFM/RetroDFM-R

Folders and files

Latest commit

History

Repository files navigation

Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning

Setup

Training Environment

Inference Environment

Data Preparation

Training

Inference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages