Weather Stations Monitoring

Overview

The Weather Stations Monitoring project aims to efficiently process and analyze high-frequency data streams from distributed weather stations within the Internet of Things (IoT) context. This project was developed as part of the course "Designing Data Intensive Applications."

Team Members

Problem Statement

The project addresses the challenge of efficiently processing and analyzing high-frequency data streams from distributed weather stations. It involves designing and implementing a robust system architecture comprising data acquisition, processing, archiving, and indexing stages. Additionally, the system integrates with the Open-Meteo API to enhance data sources. The primary goal is to develop a scalable and reliable weather monitoring system capable of handling diverse data types, ensuring data integrity, and enabling advanced analytics for weather forecasting and analysis.

Implementation

Weather Station Integration

Integrated with the Open-Meteo API to simulate real-time weather data.
The Weather Station mock fetches weather information such as temperature, humidity, and wind speed for specified locations.
Data is periodically sent to the Central Station system for storage and retrieval, mimicking the behavior of actual weather stations.

Kafka Connection

Used Bitnami images for Kafka and Zookeeper to create corresponding containers.
Weather Stations use Kafka Producer API to send readings and station status on the "weather" topic.
Central Station utilizes the Consumer API to consume messages stored in the corresponding topic.

Raining Triggers in Kafka Process

Used Kafka’s Processor API to process arriving records at the "weather" topic.
Sent a special message to the "rain" topic indicating it’s raining if the humidity level is above 70%.

Base Central Station

The core component responsible for processing and archiving weather data received from multiple weather stations.
Utilizes Bitcask for data storage, archiving data into Parquet files, and potentially indexing in Elasticsearch for further analysis.
Manages the creation of Kafka topics for weather data and rain alerts.

BitCask Riak

Manages data segment files, periodically compacts segments to save storage, and takes snapshots for quick recovery.

Parquet-files Archiving

Converts stored weather data into Parquet format for efficient storage and querying.
Uses a watch script to monitor Parquet files and triggers a jar file to ingest data into Elasticsearch for real-time indexing and analysis.

Deployment Using Kubernetes

Created Docker images for weather stations, central station, and Elasticsearch uploader.
Used Kubernetes for deploying components, including weather stations, Zookeeper, Kafka broker, Central Station, Elasticsearch uploader, Elasticsearch, and Kibana.
Implemented shared storage for saving Parquet files and BitCask entries.

Enterprise Integration Patterns

Data Flow in Weather Station

Data Flow throughout Kafka

Data Flow in Central Station

Running The System

For a faster experience, the Central Station waits for only 100 records per station before archiving them into Parquet files.

Kibana’s Dashboard

Queries used to get dropped messages percentage and low battery percentage:

1 - count() / max(s_no)
count(kql='battery_status.keyword == 'low') / count()

Performance Diagnostics using JFR

Attached JFR to the Central Station JAR for 10 minutes to diagnose performance.

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
CentralStation		CentralStation
ElasticSearchUploader		ElasticSearchUploader
k8s		k8s
weatherstation		weatherstation
.env		.env
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build_all.sh		build_all.sh
docker-compose.yaml		docker-compose.yaml
dump_10.jfr		dump_10.jfr
kibana_dashboard_.ndjson		kibana_dashboard_.ndjson
load_images.sh		load_images.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Weather Stations Monitoring

Overview

Team Members

Problem Statement

Implementation

Weather Station Integration

Kafka Connection

Raining Triggers in Kafka Process

Base Central Station

BitCask Riak

Parquet-files Archiving

Deployment Using Kubernetes

Enterprise Integration Patterns

Data Flow in Weather Station

Data Flow throughout Kafka

Data Flow in Central Station

Running The System

Kibana’s Dashboard

Performance Diagnostics using JFR

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

AbdelrahmanMosly/Weather-ORama

Folders and files

Latest commit

History

Repository files navigation

Weather Stations Monitoring

Overview

Team Members

Problem Statement

Implementation

Weather Station Integration

Kafka Connection

Raining Triggers in Kafka Process

Base Central Station

BitCask Riak

Parquet-files Archiving

Deployment Using Kubernetes

Enterprise Integration Patterns

Data Flow in Weather Station

Data Flow throughout Kafka

Data Flow in Central Station

Running The System

Kibana’s Dashboard

Performance Diagnostics using JFR

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages