Skip to content

DemetersSon83/arXiv-parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

arXiv Parser

(C) 2021 Mark M. Bailey, PhD

About

This set of scripts is useful for parsing arXiv using its API. The 'arxiv_scraper.py' script will save atom XML output from the API as a set of JSON files. The 'arxiv_parse.py' script will convert all the json files into one json file with the arxiv query metadata removed. This script is useful for collecting data for meta analysis of large bodies of scientific work.

Future Work

At some point, maybe I will build this into a library.

About

A set of scripts for parsing scientific articles from arXiv.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages