Skip to content

NIU-Data-Science/arXiv-parser

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

arXiv Parser

(C) 2021 Mark M. Bailey, PhD

About

This set of scripts is useful for parsing arXiv using its API. The 'arxiv_scraper.py' script will save atom XML output from the API as a set of JSON files. The 'arxiv_parse.py' script will convert all the json files into one json file with the arxiv query metadata removed. This script is useful for collecting data for meta analysis of large bodies of scientific work.

Future Work

At some point, maybe I will build this into a library.

About

A set of scripts for parsing scientific articles from arXiv.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%