Ctrl-DNA: Controllable Cell-Type-Specific Regulatory DNA Design via Constrained RL

Overview

We present Ctrl-DNA, a constrained reinforcement learning framework for the controllable design of cell-type-specific regulatory DNA sequences. Ctrl-DNA fine-tunes autoregressive genomic language models by framing sequence generation as a biologically informed constrained optimization problem. Using a value-model free, Lagrangian-guided policy optimization strategy, Ctrl-DNA iteratively refines sequences to maximize gene expression in a target cell type while suppressing activity in off-target cell types. Applied to human enhancer and promoter datasets, Ctrl-DNA generates biologically plausible, high-fitness sequences enriched for key transcription factor motifs, achieving state-of-the-art specificity and performance in regulatory sequence design.

Installation

Clone the repository and install the required dependencies:

git clone https://github.com/bowang-lab/Ctrl-DNA.git 
cd ctrl-dna
pip install -r requirements.txt

Data Preprocessing

We follow the preprocessing pipeline from Genentech/regLM. Please refer to their repository for detailed instructions.

Training

To train the model on the enhancer and promoter dataset using our method, run:

bash reinforce_lagrange_promoters.sh
bash reinforce_lagrange_enhancers.sh

Acknowledgements

Our implementation builds upon several open-source projects:

regLM: Provided the reward model architecture and data preprocessing pipeline.
TACO: Supplied the reinforcement learning framework that our method extends.

We sincerely thank the authors of these projects for making their code and datasets publicly available.

Citation

If you find this work useful, please cite our paper:

@misc{chen2025ctrldnacontrollablecelltypespecificregulatory,
      title={Ctrl-DNA: Controllable Cell-Type-Specific Regulatory DNA Design via Constrained RL}, 
      author={Xingyu Chen and Shihao Ma and Runsheng Lin and Jiecong Lin and Bo Wang},
      year={2025},
      eprint={2505.20578},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2505.20578}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
assets		assets
ctrl_dna		ctrl_dna
scripts		scripts
.DS_Store		.DS_Store
.gitignore		.gitignore
Ctrl-DNA.png		Ctrl-DNA.png
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ctrl-DNA: Controllable Cell-Type-Specific Regulatory DNA Design via Constrained RL

Overview

Installation

Data Preprocessing

Training

Acknowledgements

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

bowang-lab/Ctrl-DNA

Folders and files

Latest commit

History

Repository files navigation

Ctrl-DNA: Controllable Cell-Type-Specific Regulatory DNA Design via Constrained RL

Overview

Installation

Data Preprocessing

Training

Acknowledgements

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages