Skip to content

Arjun-M-101/Arjun-M-101

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

23 Commits
Β 
Β 

Repository files navigation

Hello I'm Arjun

Aspiring Data Engineer


πŸ™‹β€β™‚οΈ About Me

  • πŸ‘¨β€πŸ’» I’m currently working as a Database Administrator, building strong foundations in data management and reliability
  • 🌱 Transitioning into Data Engineering by designing end‑to‑end batch and streaming pipelines
  • πŸ› οΈ Passionate about building scalable, reliable data pipelines that turn raw data into actionable insights
  • πŸ‘― Open to collaborating on Data Engineering & Open Source projects
  • πŸ‘¨β€πŸ’» Explore my work here: My Portfolio
  • πŸ“« Reach me at arjunmpec101@gmail.com
  • ⚑ Fun fact: I debug pipelines the way I play games β€” with persistence and strategy

πŸš€ Tech Stack

pandas Spark Kafka Airflow MySQL Streamlit HTML CSS JavaScript Bootstrap Postman


πŸ“‚ Featured Projects

  • πŸ—„οΈ YouTube Data Engineering Pipeline (Batch Processing)
    End‑to‑end batch ETL pipeline implementing the Medallion Architecture (Bronze β†’ Silver β†’ Gold).

    • Orchestrated with Apache Airflow (3.x)
    • Transformations with Apache Spark
    • Data lake layers on local filesystem (Bronze/Silver/Gold)
    • Serving layer in Postgres (analytics‑ready tables)
    • Interactive Streamlit + Altair dashboard via SQLAlchemy
    • Ingests raw YouTube trending data (CSV/JSON), cleans, enriches, and computes derived metrics for BI
  • πŸ“Š StockPulse (Streaming Pipeline)
    Real‑time streaming pipeline simulating stock ticks and processing them end‑to‑end.

    • Ingestion via Kafka producer publishing to stock_ticks topic
    • Processing with Spark Structured Streaming (schema enforcement + derived metrics)
    • Dual sinks: Postgres (serving layer) + Parquet (partitioned by index/date)
    • Interactive Streamlit + Altair dashboard for real‑time visualization
    • Fully orchestrated with Apache Airflow

πŸ“Š My GitHub Stats

Arjun's streak

Arjun M's Github Stats Arjun M's Top Languages

Note: Top languages is only a metric of the languages my public code consists of and doesn't reflect experience or skill level.


Arjun's Graph

🌐 Connect with Me


❀ Views and Followers

GitHub Badge

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published