-
Notifications
You must be signed in to change notification settings - Fork 128
Description
-
There are some entries missing from the CSV and table.
-
Each library / entry in CSV should have it's own text file, even if they are duplicates - these will be marked as such.
-
New stopword lists from more software packages need to be added. More Versions that need entries #3
-
Some entries need to be updated (Lucene, Spacy, others.)
-
A new
date
column will include the last known commit date or edit of the stoplist at the source. -
Highly similar lists will have a list of the different words or an explanation of how they differ.
-
Readme needs links to papers that reference this repo
-
Extra notes / cautions for using stopwords in general.
-
Add links to software specific docs, not just source https://www.elastic.co/guide/en/elasticsearch/guide/current/stopwords.html