Vaccination-Rate-Spark

This project implements a Spark Streaming application to process real-time Measles vaccination rate data. It configures the Spark processing logic within the MeaslesDataConsumer.java file. The application consumes data from Kafka, calculates total enrollments by state, and persists the results to a Cassandra database named "city_enroll" in the "spark" keyspace. Finally, a JAR file containing the application code and its dependencies is built and deployed to the Hadoop master node.

Testing

The provided build.sh script automates building the JAR and copying it to the Hadoop master node. Simply run:

$ bash build.sh

Technologies Used

Apache Spark: Real-time data processing framework.
Apache Kafka: Distributed streaming platform.
Cassandra: NoSQL database for high-availability storage.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
src/main/java/spark/kafka		src/main/java/spark/kafka
.gitignore		.gitignore
README.md		README.md
build.sh		build.sh
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Vaccination-Rate-Spark

Testing

Technologies Used

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

BigData-GL4/Vaccination-Rate-Spark

Folders and files

Latest commit

History

Repository files navigation

Vaccination-Rate-Spark

Testing

Technologies Used

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages