Skip to content

czbiohub-sf/protdata

Repository files navigation

protdata

Test PyPI - Version docs online

image

Proteomics data loaders for the AnnData format.

This package provides loader functions to import proteomics data (e.g., MaxQuant) into the AnnData structure for downstream analysis and easy integration with single-cell and multi-omics workflows.

Features

  • Multiple formats: Support for MaxQuant, FragPipe, DIA-NN, and mzTab files
  • Reads metadata: Automatically extracts and organizes sample and protein metadata

Installation

Protdata has minimal dependencies listed in pyproject.toml

To install the latest release from PyPI, run:

pip install protdata

Or install from source:

git clone https://github.com/czbiohub-sf/protdata.git
cd protdata
pip install -e . # or make setup-develop for developers

Usage Example

MaxQuant Import

You can download an example proteinGroups file here

import protdata

adata = load_maxquant_to_anndata("/path/to/proteinGroups.txt")
print(adata)

DIA-NN Import

You can download an example DIA-NN report file here

from protdata.io import read_diann

adata = read_diann("/path/to/report.pg_matrix.tsv")
print(adata)

FragPipe Import

You can download an example FragPipe output file here

from protdata.io import read_fragpipe

adata = read_fragpipe("/path/to/combined_protein.tsv")
print(adata)

mzTab Import

You can download an example mzTab file here

from protdata.io import read_mztab

adata = read_mztab("/path/to/SILAC_SQ.mzTab")
print(adata)

Authors

protdata is created and maintained by the Computational Biology Platform at the Chan Zuckerberg Biohub San Francisco.

To get in touch please use the GihHub issues page.

Contributing

If you want to contribute to protdata, please read the Contribution Guide

Changelog

See Release Notes

About

Dataloader for common proteomics search engine outputs into AnnData format

Topics

Resources

License

Stars

Watchers

Forks

Contributors 2

  •  
  •