This repository contains the computational analysis code for the manuscript "Integrated Cross-Disease Atlas of Human And Mouse Astrocytes Reveals Heterogeneity and Conservation of Astrocyte Subtypes in Neurodegenerationβ¬" currently available as a preprint on bioRxiv.
This study presents a comprehensive meta-analysis of astrocyte populations across multiple neurodegenerative diseases using single-cell RNA sequencing data. We identified disease-associated astrocyte (DAA) subtypes and characterized their molecular signatures, spatial distributions, and functional roles in Alzheimer's Disease (AD), Multiple Sclerosis (MS), and Parkinson's Disease (PD).
- Identification of distinct disease-associated astrocyte subtypes (DAA1 and DAA2)
- Cross-species validation using mouse models and human post-mortem tissue
- Spatial analysis showing DAA proximity to amyloid plaques in mice
- Analysis showing human DAA enrichment in MS lesions
- Functional pathway analysis revealing disease-specific astrocyte responses
- Preprint: bioRxiv
- Static Data Exploration: http://research-pub.gene.com/AstroAtlas/
- Interactive Shiny Applications:
βββ AnalysisScripts
βββ FigureScripts
| βββ Figure1.R # Mouse disease volcano plots and pathway analysis
| βββ Figure2.R # Mouse astrocyte clustering and marker analysis
| βββ Figure3.R # Mouse DAA1 vs DAA2 comparison and LPS validation
| βββ Figure4.R # Mouse spatial analysis of astrocyte subtypes
| βββ Figure5.R # Human astrocyte clustering analysis
| βββ Figure6.R # Human disease comparisons (AD/MS/PD)
| βββ Figure7.R # Human cluster 2 subanalysis
| βββ Figure8.R
β βββSupplemental
βββ README.md # This file
βββ shiny # web applications to explore data
βββ utils/
βββ scHelpers.R # Custom helper functions
This analysis requires R version 4.0 or higher. Install the required packages using:
# Bioconductor packages
if (!require("BiocManager", quietly = TRUE))
install.packages("BiocManager")
BiocManager::install(c(
"Seurat",
"SingleCellExperiment",
"scran.chan",
"scuttle",
"edgeR",
"clusterProfiler",
"org.Mm.eg.db",
"org.Hs.eg.db",
"scDblFinder",
"batchelor",
"scater"
))
# CRAN packages
install.packages(c(
"ggplot2",
"wesanderson",
"tidyverse",
"ggdendro",
"cowplot",
"ggtree",
"patchwork",
"scales",
"ggrepel",
"viridis",
"RColorBrewer",
"gridExtra",
"ggpubr",
"rstatix",
"networkD3",
"ggrastr",
"extrafont"
))
# Additional packages
remotes::install_github("GuangchuangYu/GOSemSim")
devtools::install_github("YuLab-SMU/clusterProfiler")
The analysis depends on custom helper functions in scHelpers.R
. Ensure this file is sourced at the beginning of each script:
source("./scHelpers.R")
The processed count datasets can be found on Zenodo
Use the associated counts and metadata to create SCE/Seurat objects. Name them accordingly.
Due to file size limitations, the raw data files are not included in this repository. To reproduce the analysis:
- Download counts and metadata from Zenodo
- Create Seurat objects with counts and metadata. and save.
- Update file paths/names in scripts if using a different directory structure
Each figure script is self-contained and generates specific analyses:
- Figure 1: Mouse disease-specific differential expression analysis
- Figure 2: Mouse astrocyte subtype identification and characterization
- Figure 3: Mouse DAA1 vs DAA2 comparison and experimental validation
- Figure 4: Spatial relationship analysis
- Figure 5: Human astrocyte clustering
- Figure 6: Cross-disease human analysis
- Figure 7: Detailed human subcluster analysis
- Figure 8: Cross species comparisons
# Set working directory
setwd("path/to/repository")
# Source helper functions
source("scHelpers.R")
# Run individual figure scripts
source("Figure1.R")
source("Figure2.R")
# ... continue for other figures
- Seurat v4 for single-cell analysis
- Harmony for batch correction in human data
- SCVI coordinates for mouse data visualization
- edgeR for pseudo-bulk differential expression
- Meta-analysis approaches for cross-study comparisons
- Bonferroni correction for multiple testing
- clusterProfiler for GO and KEGG enrichment
- Gene Set Enrichment Analysis (GSEA)
- Comparative pathway analysis across diseases
- Wilcoxon rank-sum tests for group comparisons
- Kruskal-Wallis tests for multi-group analysis
- Centered Log-Ratio (CLR) transformation for compositional data
The analysis generates publication-ready figures using: - ggplot2 for most visualizations - Cairo PDF for high-quality vector graphics - Custom color palettes for consistent theming - ggrepel for intelligent label placement
- Operating System: Linux/macOS/Windows
- RAM: Minimum 16GB recommended (32GB+ for large datasets)
- Storage: ~50GB for all datasets
- R Version: 4.0+
- Additional: Cairo graphics library for PDF generation
This repository contains the analysis code for a specific publication. For questions about the analysis or requests for collaboration:
- Open an issue for technical questions
- Contact the corresponding author for data access requests
- See the manuscript for detailed methodology
If you use this code or data, please cite:
@article {Lucas2025.02.12.637903,
author = {Lucas, Tawaun A. and Novikova, Gloriia and Rao, Sadhna and Wang, Yuanyuan and Laufer, Benjamin I. and Pandey, Shristi. and Webb, Michelle. G. and Jorstad, Nikolas. and Friedman, Brad A. and Hanson, Jesse E. and Kaminker, Joshua S.},
title = {Integrated Cross-Disease Atlas of Human And Mouse Astrocytes Reveals Heterogeneity and Conservation of Astrocyte Subtypes in Neurodegeneration},
elocation-id = {2025.02.12.637903},
year = {2025},
doi = {10.1101/2025.02.12.637903}
For questions regarding this analysis: - Corresponding Authors: kaminker.josh@gene.com Code Issues: Open a GitHub issue - Data Access: Contact corresponding author