A Python API for loading the Newsela corpus, which consists of news articles written for children at various grade levels. It is loaded in "age-order" (AO). That is, articles are loaded in order of increasing grade-level. Its primary purpose in NLP, is to study how increasing langauge complexity influences language learning in artificial learning systems.
This repository does not contain any language data, which is proprietary. The data is loaded from a password-protected drive, and only users with access can load the data.
Developed using Python 3.7 on Ubuntu 18.04