slide_snake
is a snakemake workflow for preprocessing, alignment, and quantification of spatial RNA-sequencing data, designed to work for any commercial or custom platform.
- Please see
documentation/*.md
for detailed info on custom recipes, pipeline details, etc.
- Clone the repository:
git clone https://github.com/mckellardw/slide_snake.git
- Build base environment w/
mamba
:
mamba env create --name slsn --file=envs/slsn.yml
mamba activate slsn
-
Run on the provided test data for Visium, SlideSeq/Seeker, and Patho-DBIT with the default sample sheet (see below for runtime options).
-
Build your own sample sheet (see `docs/1_sam)
- Install and ensure that the executable paths are functioning
- Build a sample sheet, containing details on each sample you would like to analyze. Be sure to add the path to the
SAMPLE_SHEET_PATH
variable inconfigs/config.yaml
.- Fill out the file paths for the input .fastqs (can pass multiple sequencing runs, illumina and/or ONT data), reference genomes, etc.
- Add the recipe(s) for each sample, separated by spaces.
- Comment out any unwanted output files in the target rule in
Snakefile
- Run the snakemake pipeline!
snakemake -k -p --use-conda --conda-frontend mamba -j 56
Make sure the slurm plugin is installed first! (it is included in envs/slsn.yml
)
snakemake -k -p --nt --use-conda --conda-frontend mamba --executor slurm --workflow-profile profiles/slurm -j 24
Or, my preferred way to run it in the background:
nohup snakemake -k -p --use-conda --conda-frontend mamba -j 36 > snake.log 2>&1 &
- Barcode download from Curio
- Extract DNB barcode whitelist for StereoSeq with ST_BarcodeMap
- Use the "mask format change" code mentioned in the
README
; example:
ST_BarcodeMap-0.0.1 --in B01807A3.barcodeToPos.h5 --out B01807A3.barcodeToPos.txt --action 3 --thread 24
- Use the "mask format change" code mentioned in the
- "Documentation" on
kallisto-lr
link