Skip to content

An E2E application that serves analytics about Shakespeare's works using inverted index DS

Notifications You must be signed in to change notification settings

rmnskb/inverted-idx-shakespeare

Repository files navigation

Shakespeare Works

The idea of this app is to create an E2E application that does following things:

  • Containerise the whole application, including Spark and Mongo
  • An ETL process to download Shakespeare's works and create a collection of the words that appeared there;
  • Upload the processed collection to MongoDB;

How do I make use of it

Ensure that you have Docker installed and spun up.
Then run following command in the main directory:

docker-compose up -d --build  # Build and run the containers in detached mode

docker exec -it spark-master bash  # Enter the Spark container

docker exec -it mongodb mongosh  # Enter the MongoDB container via Mongosh

About

An E2E application that serves analytics about Shakespeare's works using inverted index DS

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published