Required Software

To run the pipeline you will need the following software and associated packages:

R (GGally, bio3d, data.table, ggplot2, ggpubr, gplots, msir, scales, viridis)

Required Data

The read counts (DiMSum output), fitness scores, MoCHI weights, and required miscellaneous files should be downloaded from here and copied to an 'analysis_files' folder in your project directory (named 'base_dir'). An 'output_files' directory in which results files will be written should be created in 'base_dir'.

Installation Instructions

Make sure you have git and conda installed and then run (expected install time <10min):

# Install dependencies (preferably in a fresh conda environment)
conda install -c conda-forge r-ggally r-bio3d r-data.table r-ggplot2 r-ggpubr r-gplots r-msir r-scales r-viridis

Usage

The R Markdown files contain the code to reproduce the figures and results from the computational analyses described in the following publication: The allosteric landscape of the Src kinase (Beltran A et al, 2024). See Required Data for instructions on how to obtain all required data and miscellaneous files before running the pipeline.

R Markdown files are meant to be run in the following order:

1. 00_fitness_reproducibility_and_mochi_evaluation.Rmd
2. 00_mochi_ddGs_onto_structure.Rmd
3. 01_Figure1.Rmd
4. 02_Figure2.Rmd
5. 03_Figure3.Rmd
6. 04_Figure4.Rmd
7. 05_Figure5.Rmd
8. 06_Figure6.Rmd
9. 07_FigureS9_allopredictors.Rmd
10. 08_Figure2_mRNAD_validation.Rmd
11. 09_Figure5_pocket_overlap_stringency.Rmd

Additional scripts and software

If you wish to regenerate all the fitness scores and inferred energies from the raw FASTQ files, the following software packages are required:

DiMSum v1.2.9 (pipeline for pre-processing deep mutational scanning data i.e. FASTQ to fitness). Download the FastQ files from Gene Expression Omnibus (GEO) with accession number GSE247740:link to your base directory (base_dir). Shell scripts to run Dimsum and configuration files can be found in the 'DiMSum' folder in Required Data.

The following software package is required to fit thermodynamic models to the fitness data (DiMSum output):

MoCHI (pipeline to fit thermodynamic models to fitness data i.e. fitness to energies). In order to fit all 5 blocks of Src together, DiMSum fitness tables need to be modified to extend the sequence of each block to the full length Src sequence, and the sign of the kinase activity fitness assay needs to be changed due to the inverse relationship between fitness and activity in the activity assay. DiMSum output tables, the code to modify them, the modified tables ready for MoCHI fitting, and shell scripts to execute MoCHI can be found in the 'MoCHI' folder in Required Data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Table Of Contents

Required Software

Required Data

Installation Instructions

Usage

Additional scripts and software

About

Uh oh!

Releases

Packages

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
00_fitness_reproducibility_and_mochi_evaluation.Rmd		00_fitness_reproducibility_and_mochi_evaluation.Rmd
00_mochi_ddGs_onto_structure.Rmd		00_mochi_ddGs_onto_structure.Rmd
01_Figure1.Rmd		01_Figure1.Rmd
02_Figure2.Rmd		02_Figure2.Rmd
03_Figure3.Rmd		03_Figure3.Rmd
04_Figure4.Rmd		04_Figure4.Rmd
05_Figure5.Rmd		05_Figure5.Rmd
06_Figure6.Rmd		06_Figure6.Rmd
07_FigureS9_allopredictors.Rmd		07_FigureS9_allopredictors.Rmd
08_Figure2_mRNAD_validation.Rmd		08_Figure2_mRNAD_validation.Rmd
09_Figure5_pocket_overlap_stringency.Rmd		09_Figure5_pocket_overlap_stringency.Rmd
LICENSE		LICENSE
README.md		README.md

License

lehner-lab/src_allostery

Folders and files

Latest commit

History

Repository files navigation

Table Of Contents

Required Software

Required Data

Installation Instructions

Usage

Additional scripts and software

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages