Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
README.md		README.md

Repository files navigation

Hello I'm Arjun

Aspiring Data Engineer

🙋‍♂️ About Me

👨‍💻 I’m currently working as a Database Administrator, building strong foundations in data management and reliability
🌱 Transitioning into Data Engineering by designing end‑to‑end batch and streaming pipelines
🛠️ Passionate about building scalable, reliable data pipelines that turn raw data into actionable insights
👯 Open to collaborating on Data Engineering & Open Source projects
👨‍💻 Explore my work here: My Portfolio
📫 Reach me at arjunmpec101@gmail.com
⚡ Fun fact: I debug pipelines the way I play games — with persistence and strategy

🚀 Tech Stack

📂 Featured Projects

🗄️ YouTube Data Engineering Pipeline (Batch Processing)
End‑to‑end batch ETL pipeline implementing the Medallion Architecture (Bronze → Silver → Gold).
- Orchestrated with Apache Airflow (3.x)
- Transformations with Apache Spark
- Data lake layers on local filesystem (Bronze/Silver/Gold)
- Serving layer in Postgres (analytics‑ready tables)
- Interactive Streamlit + Altair dashboard via SQLAlchemy
- Ingests raw YouTube trending data (CSV/JSON), cleans, enriches, and computes derived metrics for BI
📊 StockPulse (Streaming Pipeline)
Real‑time streaming pipeline simulating stock ticks and processing them end‑to‑end.
- Ingestion via Kafka producer publishing to stock_ticks topic
- Processing with Spark Structured Streaming (schema enforcement + derived metrics)
- Dual sinks: Postgres (serving layer) + Parquet (partitioned by index/date)
- Interactive Streamlit + Altair dashboard for real‑time visualization
- Fully orchestrated with Apache Airflow

📊 My GitHub Stats

Arjun M's Github Stats

Arjun M's Top Languages

Note: Top languages is only a metric of the languages my public code consists of and doesn't reflect experience or skill level.

Arjun's Graph

🌐 Connect with Me

❤ Views and Followers

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published