Merge Development Branch into Main: Complete Data Engineering Pipeline Implementation #29

jdomdev · 2025-06-22T21:38:02Z

This pull request merges the latest changes from the dev branch into main, delivering the full implementation of the data engineering pipeline for HR Pro. It includes robust ETL processes, integration with Kafka, MongoDB, and Supabase (PostgreSQL), comprehensive monitoring with Prometheus and Grafana, improved logging, unit tests, and Dockerized deployment.

All project objectives and delivery requirements have been met, ensuring a scalable, maintainable, and production-ready data platform.

…ject-ix-data-engineering-team-2 into new/branch

…ta-engineering-team-2 into feature/mongo

feat: Write on .log for dates

…t-ix-data-engineering-team-2 into new/branch

New/branch with cleaning and transformation data which came from kafka server.

…ase).

…g approach.

…ervice.

…press. feat: WIP(Work In Progress) partial data load to Supabase (3 of 5 tables) and mongo user setup.

Feature/kafka to supabase aprobado

feat:adding testing to etl_utils.py

…rvices.

Feature/redis

docs: add Grafana README and pipeline overview dashboard.

…imports from etl_utils.

Feature/api This API provides endpoints for retrieving and filtering professional and demographic data from a Supabase-managed database. Built with FastAPI, it supports searching for people by job title and city, and delivers results in a clean JSON format. The API is designed to serve data for web interfaces, making it easy to build interactive data-driven applications.

…ource provisioning. chore: relocate unit test to /tests/unit/.

…ranches.

Feature/prometheus metrics

… 472 permissions - Fixed consumer.py and storage_mongo.py failures due to merge issues. - Added Dockerfile to ensure Grafana uses correct folder ownership (UID 472). - docker-compose updated with build context for Grafana.

Feature/readme

fintihlupik and others added 30 commits June 4, 2025 13:27

feat: add simple dockerized consumer and mongo

c942d8f

chore(structure): initialize project folder structure in dev branch.

a6e7317

feat: save raw data per collection in mongo

32821d5

fix: add logic to avoid duplicates in mongo collections insertion

97ef793

feat:creating a new brnach

fd2cafd

Merge branch 'feature/mongo' of https://github.com/Bootcamp-IA-P4/pro…

7ff6128

…ject-ix-data-engineering-team-2 into new/branch

Merge branch 'dev' of https://github.com/Bootcamp-IA-P4/project-ix-da…

99cb232

…ta-engineering-team-2 into feature/mongo

fix : add dev folder structure, changed compose.yml

db31f2e

feat: Write on .log for dates

096787a

feat:filling the task

8cb1222

Merge pull request #19 from Bootcamp-IA-P4/feature/log

aa40711

feat: Write on .log for dates

feat:adding procesing of data

894b1be

Merge branch 'new/branch' of https://github.com/Bootcamp-IA-P4/projec…

470c760

…t-ix-data-engineering-team-2 into new/branch

resolving conflicts

8cd97ae

feat:processing and cleaning data

3d3e492

feat:processing data

4ce53b6

Merge pull request #20 from Bootcamp-IA-P4/new/branch

9931795

New/branch with cleaning and transformation data which came from kafka server.

feat: WIP(Work In Progress) ETL de Kafka a Supabase (PostgreSQL).

017da5c

feat: WIP(Work In Progress) ETL insertions in remote PostgreSQL(Supab…

0182e1c

…ase).

feat: start Mongo→Supabase ETL after dropping Kafka→Supabase streamin…

14fd114

…g approach.

build: dockerize kafka consumer, mongoDB, and mongo-to-postgres ETL s…

f793fcb

…ervice.

chore: organize test data files and configure mongo user for mongo-ex…

5c00dc9

…press. feat: WIP(Work In Progress) partial data load to Supabase (3 of 5 tables) and mongo user setup.

chore: remove unnecessary .gitkeep files from non-empty directories.

014da06

feat: complete data insertion into all 5 Supabase tables from MongoDB.

6740d88

Merge pull request #22 from Bootcamp-IA-P4/feature/kafka-to-supabase

f4966c8

Feature/kafka to supabase aprobado

feat:adding testing to etl_utils.py

edc8b2e

feat: wop redis start config

21ddcd1

feat: add redis to avoid duplicated messages

6b0bded

Merge pull request #24 from Bootcamp-IA-P4/testing

8e81629

feat:adding testing to etl_utils.py

feat: add log

593a915

jdomdev and others added 19 commits June 19, 2025 16:43

feat: integrate Prometheus and Grafana for monitoring ETL pipeline se…

b10a601

…rvices.

Merge pull request #25 from Bootcamp-IA-P4/feature/redis

1b13c7b

Feature/redis

chore: update .gitignore to exclude Grafana runtime and build files.

2e0f52b

docs: add Grafana README and pipeline overview dashboard.

feat:creating front and api

3636cd9

refactor(SQL): remove unused functions in mongo_to_supabase file and …

dc8c525

…imports from etl_utils.

feat:inhancing the frontend and the api

872f68a

feat: enhance Grafana monitoring with structured dashboards and datas…

6925040

…ource provisioning. chore: relocate unit test to /tests/unit/.

chore: remove database/.gitkeep placeholder file.

a2adbaf

refactor: remove .gitkeep from docs/ folder.

ef5fce5

refactor: remove config/ and data/ folders from root project.

bb3f2e0

refactor: remove scripts/ folder from root project.

9616bef

ci: add GitHub Actions workflow for test automation on main and dev b…

6d60e42

…ranches.

Merge branch 'dev' into feature/prometheus-metrics

223eb01

Merge pull request #27 from Bootcamp-IA-P4/feature/prometheus-metrics

393e418

Feature/prometheus metrics

docs: update readme.md

9ad33be

docs: update readme.md

0d8f0dd

Merge pull request #28 from Bootcamp-IA-P4/feature/readme

aeb1caf

Feature/readme

jdomdev merged commit 36ae72f into main Jun 22, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Merge Development Branch into Main: Complete Data Engineering Pipeline Implementation #29

Merge Development Branch into Main: Complete Data Engineering Pipeline Implementation #29

Uh oh!

jdomdev commented Jun 22, 2025

Uh oh!

Uh oh!

Uh oh!

Merge Development Branch into Main: Complete Data Engineering Pipeline Implementation #29

Merge Development Branch into Main: Complete Data Engineering Pipeline Implementation #29

Uh oh!

Conversation

jdomdev commented Jun 22, 2025

Uh oh!

Uh oh!

Uh oh!