GitHub - Hacettepe-University-CMP681-2020-Spring/ir-project-ir-term-project-talha-yilmaz: ir-project-ir-term-project-talha-yilmaz created by GitHub Classroom

CMP681 - Neural Query Expansion Project

İsmail Talha Yılmaz

N18140956

ExtractTweets.py -> extraction of tweets from JSON file

CleanTweets.py -> pre-processing of extracted tweets

PrepareQueryTweetsBM25.py -> matching tweets with most relevant queries

EncoderDecoderModel.py -> seq2seq architecture and predictions

ExpandQueriesByEmbeddingSimilarity.py -> calculate similar predicted query words to tweet and use them for expansion

Evaluate.py -> Evaluation of overall system

External Sources:

Google News pre-trained embeddings were used

TREC 2011 Microblog dataset is used

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
NeuralQueryExpansionProject		NeuralQueryExpansionProject
README.md		README.md
TalhaYilmaz_CMP681_FinalReport_N18140956.pdf		TalhaYilmaz_CMP681_FinalReport_N18140956.pdf

Provide feedback