Skip to content

Hacettepe-University-CMP681-2020-Spring/ir-project-ir-term-project-talha-yilmaz

Repository files navigation

CMP681 - Neural Query Expansion Project

İsmail Talha Yılmaz

N18140956

ExtractTweets.py -> extraction of tweets from JSON file

CleanTweets.py -> pre-processing of extracted tweets

PrepareQueryTweetsBM25.py -> matching tweets with most relevant queries

EncoderDecoderModel.py -> seq2seq architecture and predictions

ExpandQueriesByEmbeddingSimilarity.py -> calculate similar predicted query words to tweet and use them for expansion

Evaluate.py -> Evaluation of overall system

External Sources:

Google News pre-trained embeddings were used

TREC 2011 Microblog dataset is used

About

ir-project-ir-term-project-talha-yilmaz created by GitHub Classroom

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages