How to know the number of RAW files for all the public datasets in PRIDE. #57
ypriverol
started this conversation in
Statistics questions
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Recently, someone asked PRIDE support how to get the number of RAW files from all PRIDE projects.
Here, is the best solution:
Before anything, you have to install
pridepy
:> pip install --upgrade pridepy
Then download all the file metadata from public datasets in PRIDE:
> pridepy stream-files-metadata -o all_pride_files_metadata.json
This will download a json file named
all_pride_files_metadata.json
to your local computer with all the file metadata in PRIDE. Then, you can run the following script: https://gist.github.com/ypriverol/f29f59f3b4fd6d7e6fb30ebe2f202c82 and pass as input the downloaded file. It will produce a final output asproject_raw_counts.txt
with the following lines:Please let us know if this is useful for you.
Beta Was this translation helpful? Give feedback.
All reactions