Xenium Cell Segmentation Pipeline

End‑to‑end pipeline for segmenting spinal‑cord microscopy images using Cellpose.

Drag‑and‑drop GUI (Streamlit) • Headless CLI • Custom training workflow

Why this repo?

This repository provides a turn‑key workflow for turning raw histological slides of the Xenium Data (TIFF) into high‑quality, full‑resolution segmentation masks.
Huge TIFF slides (>40k × 40k px) do not fit into GPU memory. This repo scaled down, breaks them into tiles, runs a pre-trained Cellpose-SAM model in parallel, then stitches the probability masks back together.
The same code powers a drag‑and‑drop Streamlit interface for bench scientists and a fully scriptable CLI for batch jobs.

Quick Start (TL;DR)

# 1. Clone & install
git clone https://github.com/unikill066/xenium_cell_segmentation.git
cd xenium_cell_segmentation
pip install -r requirements.txt

# 2. Put your slide(s) in ./data/input
# 3. Download the model weights from huggingface/box and copy it to ./model/ model directory (see below)
# 4. Edit utils/constants.py, with model path and config path 
# 5. Run the pipeline
python main.py

Quick Demo

Installation

Python environment

Python ≥ 3.8
(Optional) CUDA‑enabled PyTorch for GPU speed‑ups Tested on CUDA 12.1 + RTX A5000.

python -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt

System packages

Dependency	Debian / Ubuntu	macOS (brew)
OpenCV	`sudo apt-get install python3-opencv`	`brew install opencv`
Tiff	`sudo apt-get install libtiff-dev`	`brew install libtiff`

Override any parameter at the CLI with --param=value. See python main.py --help.

Running the Pipeline

Command‑line

python main.py

Streamlit App

streamlit run streamlit_app.py

Drag a .tif (or folder) onto the Upload panel.
Tune tile size & model on the sidebar.
Click Run segmentation – outputs stream to data/output/.

The app supports one upload at a time; progress bars updates live. More on the steps that go into the pipeline can be found here:

Training on Your Own Data

Annotate nuclei or whole cells in QuPath
- Export objects as GeoJSON (File › Object data › Export as GeoJSON).
- Export the corresponding raw slide as an OME‑TIFF.
Generate training patches
```
python generate_training_data.py
```
Fine‑tune Cellpose (cellpose/train.py). Example:
```
python -m bin/train_cellpose_sam.py
```
Note: Make sure to update the constants in train_dir and model_name inbin/train_cellpose_sam.py.
Update MODEL_PATH and re‑run inference. For hyper‑parameter suggestions see the Cellpose docs.

Note: Download and copy this model to models directory.

Model weights are uploaded to Hugging Face: DRG Cellpose-SAM Model

Directory Structure

xenium_cell_segmentation/
├── bin/           # one‑shot utility scripts for training a new cellpose / cellpose-sam models
├── model/         # pre‑trained & fine‑tuned weights
├── utils/         # reusable helpers (tiling, stitching, viz, etc.)
├── notebooks/     # jupyter demos and exploratory analysis
├── docs/          # extra figures & extended docs
├── data/          # auto‑generated at runtime
│   ├── input/     # raw slides (.tif)
│   ├── tiles/     # png tiles fed to cellpose
│   ├── masks/     # per‑tile masks
│   └── output/    # stitched full‑res masks
└── streamlit_app.py

Additional Documentation:

Troubleshooting & FAQ

Symptom	Fix
OOM on GPU	Lower `--tile_px`, increase overlap.
No masks produced	Check contrast / staining; try `--net_avg=False` with Cellpose.
Streamlit app upload fails >200 MB	Increase `server.maxUploadSize` in `~/.streamlit/config.toml`.
GeoJSON mis‑aligned with slide	Verify that QuPath export uses Entire Image coordinate system.

Citing

If you use this code, please cite:

@article{Stringer_2025_Cellpose-SAM,
  title={Cellpose-SAM: how to train your own model},
  author={Stringer, Carsen and Pachitariu, Marius},
  journal={Nat. Methods},
  year={2025}
}

Cellpose-SAM paper: Pachitariu, M., Rariden, M., & Stringer, C. (2025). Cellpose-SAM: superhuman generalization for cellular segmentation. bioRxiv.

License

This project is licensed under the MIT License – see LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Xenium Cell Segmentation Pipeline

Table of Contents

Why this repo?

Quick Start (TL;DR)

Quick Demo

Installation

Python environment

System packages

Running the Pipeline

Command‑line

Streamlit App

Training on Your Own Data

Directory Structure

Troubleshooting & FAQ

Citing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
bin		bin
demo		demo
docs		docs
model		model
notebooks		notebooks
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
detect.py		detect.py
detect_sam.py		detect_sam.py
generate_training_data.py		generate_training_data.py
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py

License

unikill066/xenium_cell_segmentation

Folders and files

Latest commit

History

Repository files navigation

Xenium Cell Segmentation Pipeline

Table of Contents

Why this repo?

Quick Start (TL;DR)

Quick Demo

Installation

Python environment

System packages

Running the Pipeline

Command‑line

Streamlit App

Training on Your Own Data

Directory Structure

Troubleshooting & FAQ

Citing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages