Skip to content

BigData-GL4/Vaccination-Rate-Spark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Vaccination-Rate-Spark

This project implements a Spark Streaming application to process real-time Measles vaccination rate data. It configures the Spark processing logic within the MeaslesDataConsumer.java file. The application consumes data from Kafka, calculates total enrollments by state, and persists the results to a Cassandra database named "city_enroll" in the "spark" keyspace. Finally, a JAR file containing the application code and its dependencies is built and deployed to the Hadoop master node.

Testing

The provided build.sh script automates building the JAR and copying it to the Hadoop master node. Simply run:

$ bash build.sh

Technologies Used

Apache Spark: Real-time data processing framework.
Apache Kafka: Distributed streaming platform.
Cassandra: NoSQL database for high-availability storage.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •