This repository contains metadata tables and resources for the Human Pangenome Reference Consortium (HPRC) sequencing datasets. These files provide detailed information on sequencing platforms, coverage, and file organization for ongoing analyses and project tracking.
- Clone the repository:
git clone git@github.com:human-pangenomics/HPRC_metadata.git
- Access metadata files in the data/hprc-data-explorer-tables/ directory.
- Use these tables to filter, track, or analyze sequencing data.
Folder-based incomplete collection of R2 submissions. Most subfolders contain a readme.md which explains how many files were uploaded to SRA and any known metadata inconsistencies, as well as a validation line for running validate_and_combine_per_submission.py
which can be found in /utils
(see below)
Contains:
- general data wrangling scripts and files. Install instructions included it
/utils/readme.md
- files relating to the AnVIL transfer -- see
/utils/AnVIL_transfer/readme.md
for context - files relating to the SRA transfer
We welcome contributions to improve metadata organization or add new resources. Submit a pull request or open an issue for discussion.