- [Stanford, Deeply Moving: Deep Learning for Sentiment Analysis] (https://nlp.stanford.edu/sentiment/)
- [John Hopkins University, Multi-Domain Sentiment Dataset] (http://www.cs.jhu.edu/~mdredze/datasets/sentiment/)
- [Stanford, Large Movie Review Dataset] (https://ai.stanford.edu/~amaas/data/sentiment/)
- [Sentiment140] (http://help.sentiment140.com/for-students)
- [News Category Dataset] (https://www.kaggle.com/rmisra/news-category-dataset)
- [AG's corpus of news articles] (http://groups.di.unipi.it/~gulli/AG_corpus_of_news_articles.html)
- [Other] (https://paperswithcode.com/datasets?mod=texts&task=text-classification&page=1)
-
data : folder storages all of training and test data
-
docs : folder storages all publications and other documents
-
drivers : folder storages all source code to controling
-
drivers/loaders : folder storages all source code to data loading and preprocessing
-
drivers/models : folder storages all source code wihich define the machine learning models for evaluation
-
drivers/tokenizers : folder storages all source code wihich define the tokenizers for evaluation
-
encodes : folder storages all pre-encoded training and test data
-
vocabs : folder storages all vocabs what was created by pre trained tokenizers
-
app.py : is the main point
-
evalt.py : contains the EvalT class what describes the main evaluation progress