Skip to content

Added GEO Matrix file support for downloading and parsing expression data #232

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 4 commits into
base: develop
Choose a base branch
from

Conversation

priyankeshh
Copy link

Closes #229

This PR adds support for GEO Matrix files, which contain processed expression data from NCBI's Gene Expression Omnibus.

Features Added

  • New GEOMatrix class for downloading and parsing matrix files.
  • Extended GEOweb class with matrix-specific methods.
  • New CLI command: geo-matrix for matrix operations.
  • Added --matrix-only option to the existing download command.
  • Comprehensive documentation and tests to support the new functionality.

Example Usage

CLI

pysradb geo-matrix --accession GSE234190 --to-tsv

Python API

from pysradb import GEOMatrix

matrix = GEOMatrix("GSE234190")
df = matrix.to_dataframe()

- Added GEOMatrix class for handling GEO Matrix files, including methods for downloading, parsing, and converting to DataFrame and TSV.
- Extended GEOweb class to identify and download matrix files from GEO FTP.
- Introduced new CLI command `geo-matrix` for downloading and processing GEO Matrix files.
- Updated existing `download` command to support downloading only matrix files with `--matrix-only` flag.
- Enhanced README and documentation to include GEO Matrix file support and usage examples.
- Created tests for GEOMatrix functionality, including link retrieval, file downloading, parsing, DataFrame conversion, and TSV export.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[C4GT Community]: Support full GEO based downloads
1 participant