Skip to content

SPoTLIghT, a computational framework to extract interpretable features of the spatial distribution of multiple cell types by combining unannotated pathology images with bulk transcriptomics.

License

Notifications You must be signed in to change notification settings

SysBioOncology/SPoTLIghT

Repository files navigation

SysBioOncology/SPoTLIghT

Nextflowrun with singularity

Introduction

Our pipeline, SPoTLIghT, as presented in our paper, can be used to derive spatial graph-based interpretable features from H&E slides and is available as a Nextflow pipeline.

The pipeline comprises the following modules:

  1. Extracting histopathological features
  2. Deconvolution of bulkRNAseq data
  3. Building a multi-task cell type model to predict cell type abundances on a tile-level
  4. Predicting tile-level cell type abundances using the multi-task models
  5. Compute spatial features using the tile-level cell type abundances

The training of the cell type models have been perfomed using fresh frozen (FF) slides for the TCGA-SKCM dataset (melanoma) as described in the paper. The trained models are provided here.

See also the figures below.

Workflow part 1 Workflow part 2

Usage

Software

  • Docker version 28.0.4, build b8034c0
  • Apptainer version 1.0.
  • Nextflow version 24.10.5 build 5935

These were the versions used for testing the pipeline.

Note

If you are new to Nextflow and nf-core, please refer to this page on how to set-up Nextflow.

  1. Create apptainer/singularity containers from Docker images
# Easiest route (internet access needed)
apptainer build spotlight.sif docker://joank23/spotlight
apptainer build immunedeconvr.sif docker://joank23/immunedeconvr

# Alternative route
# Usecase: if working on an HPC that does not have docker & internet access for building the image

# A) on you local desktop
# 1. save docker as tar or tar.gz (compressed)
docker pull joank23/spotlight
docker pull joank23/immunedeconvr
docker save joank23/spotlight > spotlight.tar
docker save joank23/immunedeconvr > immunedeconvr.tar

# 2. Move to HPC (optionally)
# 3. Build apptainer images (.sif) from docker (.tar) 
apptainer build spotlight.sif docker-archive:spotlight.tar
apptainer build immunedeconvr.sif docker-archive:immunedeconvr.tar
  1. Download retrained models to extract the histopathological features, available from Fu et al., Nat Cancer, 2020
    1. Download from (Retrained_Inception_v4)
    2. Unzip the folder
    3. Extract the files to a folder called Retrained_Inception_v4.

IMPORTANT: Please rename your images file names, so they only include "-", to follow the same sample coding used by the TCGA.

Now, you can run the pipeline using:

Since the SKCM multi-task models are provided, the 'example_workflow_params.yml' can be used to predict the cell type abundances for other H&E images and optionally to compute the spatial features.

For more information see examples.

nextflow run SysBioOncology/SPoTLIghT \
   -profile <docker/singularity/.../institute> \
   -params-file assets/example_workflow_params.yml \
   --outdir <OUTDIR>

Warning

Please provide pipeline parameters via the CLI or Nextflow -params-file option. Custom config files including those provided by the -c Nextflow option can be used to provide any configuration except for parameters; see docs.

Credits

SysBioOncology/SPoTLIghT was originally written by Joan Kant, Óscar Lapuente-Santana & Federica Eduati.

Contributions and Support

If you would like to contribute to this pipeline, please see the contributing guidelines.

Citations

Lapuente-Santana, Ó., Kant, J. & Eduati, F. Integrating histopathology and transcriptomics for spatial tumor microenvironment profiling in a melanoma case study. npj Precis. Onc. 8, 254 (2024). https://doi.org/10.1038/s41698-024-00749-w

An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md file.

This pipeline uses code and infrastructure developed and maintained by the nf-core community, reused here under the MIT license.

The nf-core framework for community-curated bioinformatics pipelines.

Philip Ewels, Alexander Peltzer, Sven Fillinger, Harshil Patel, Johannes Alneberg, Andreas Wilm, Maxime Ulysse Garcia, Paolo Di Tommaso & Sven Nahnsen.

Nat Biotechnol. 2020 Feb 13. doi: 10.1038/s41587-020-0439-x.

About

SPoTLIghT, a computational framework to extract interpretable features of the spatial distribution of multiple cell types by combining unannotated pathology images with bulk transcriptomics.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 3

  •  
  •  
  •