Skip to content

Merge Development Branch into Main: Complete Data Engineering Pipeline Implementation #29

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 49 commits into from
Jun 22, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
49 commits
Select commit Hold shift + click to select a range
c942d8f
feat: add simple dockerized consumer and mongo
fintihlupik Jun 4, 2025
a6e7317
chore(structure): initialize project folder structure in dev branch.
jdomdev Jun 5, 2025
32821d5
feat: save raw data per collection in mongo
fintihlupik Jun 9, 2025
97ef793
fix: add logic to avoid duplicates in mongo collections insertion
fintihlupik Jun 9, 2025
fd2cafd
feat:creating a new brnach
odar1997 Jun 9, 2025
7ff6128
Merge branch 'feature/mongo' of https://github.com/Bootcamp-IA-P4/pro…
odar1997 Jun 9, 2025
99cb232
Merge branch 'dev' of https://github.com/Bootcamp-IA-P4/project-ix-da…
fintihlupik Jun 10, 2025
db31f2e
fix : add dev folder structure, changed compose.yml
fintihlupik Jun 10, 2025
096787a
feat: Write on .log for dates
juancmacias Jun 10, 2025
8cb1222
feat:filling the task
odar1997 Jun 12, 2025
aa40711
Merge pull request #19 from Bootcamp-IA-P4/feature/log
jdomdev Jun 12, 2025
894b1be
feat:adding procesing of data
odar1997 Jun 12, 2025
470c760
Merge branch 'new/branch' of https://github.com/Bootcamp-IA-P4/projec…
odar1997 Jun 12, 2025
8cd97ae
resolving conflicts
odar1997 Jun 12, 2025
3d3e492
feat:processing and cleaning data
odar1997 Jun 13, 2025
4ce53b6
feat:processing data
odar1997 Jun 13, 2025
9931795
Merge pull request #20 from Bootcamp-IA-P4/new/branch
jdomdev Jun 13, 2025
017da5c
feat: WIP(Work In Progress) ETL de Kafka a Supabase (PostgreSQL).
jdomdev Jun 16, 2025
0182e1c
feat: WIP(Work In Progress) ETL insertions in remote PostgreSQL(Supab…
jdomdev Jun 16, 2025
14fd114
feat: start Mongo→Supabase ETL after dropping Kafka→Supabase streamin…
jdomdev Jun 17, 2025
f793fcb
build: dockerize kafka consumer, mongoDB, and mongo-to-postgres ETL s…
jdomdev Jun 17, 2025
5c00dc9
chore: organize test data files and configure mongo user for mongo-ex…
jdomdev Jun 17, 2025
014da06
chore: remove unnecessary .gitkeep files from non-empty directories.
jdomdev Jun 17, 2025
6740d88
feat: complete data insertion into all 5 Supabase tables from MongoDB.
jdomdev Jun 18, 2025
f4966c8
Merge pull request #22 from Bootcamp-IA-P4/feature/kafka-to-supabase
odar1997 Jun 18, 2025
edc8b2e
feat:adding testing to etl_utils.py
odar1997 Jun 18, 2025
21ddcd1
feat: wop redis start config
fintihlupik Jun 18, 2025
6b0bded
feat: add redis to avoid duplicated messages
fintihlupik Jun 19, 2025
8e81629
Merge pull request #24 from Bootcamp-IA-P4/testing
jdomdev Jun 19, 2025
593a915
feat: add log
fintihlupik Jun 19, 2025
b10a601
feat: integrate Prometheus and Grafana for monitoring ETL pipeline se…
jdomdev Jun 19, 2025
1b13c7b
Merge pull request #25 from Bootcamp-IA-P4/feature/redis
jdomdev Jun 19, 2025
2e0f52b
chore: update .gitignore to exclude Grafana runtime and build files.
jdomdev Jun 19, 2025
3636cd9
feat:creating front and api
odar1997 Jun 19, 2025
dc8c525
refactor(SQL): remove unused functions in mongo_to_supabase file and …
jdomdev Jun 20, 2025
872f68a
feat:inhancing the frontend and the api
odar1997 Jun 20, 2025
1f70e04
Merge pull request #26 from Bootcamp-IA-P4/feature/api
jdomdev Jun 20, 2025
6925040
feat: enhance Grafana monitoring with structured dashboards and datas…
jdomdev Jun 20, 2025
a2adbaf
chore: remove database/.gitkeep placeholder file.
jdomdev Jun 20, 2025
ef5fce5
refactor: remove .gitkeep from docs/ folder.
jdomdev Jun 20, 2025
bb3f2e0
refactor: remove config/ and data/ folders from root project.
jdomdev Jun 20, 2025
9616bef
refactor: remove scripts/ folder from root project.
jdomdev Jun 20, 2025
6d60e42
ci: add GitHub Actions workflow for test automation on main and dev b…
jdomdev Jun 20, 2025
223eb01
Merge branch 'dev' into feature/prometheus-metrics
jdomdev Jun 20, 2025
393e418
Merge pull request #27 from Bootcamp-IA-P4/feature/prometheus-metrics
odar1997 Jun 20, 2025
531bebc
fix: restore MongoDB writes after merge + Dockerfile sets Grafana UID…
jdomdev Jun 22, 2025
9ad33be
docs: update readme.md
jdomdev Jun 22, 2025
0d8f0dd
docs: update readme.md
jdomdev Jun 22, 2025
aeb1caf
Merge pull request #28 from Bootcamp-IA-P4/feature/readme
jdomdev Jun 22, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
29 changes: 29 additions & 0 deletions .github/workflows/python-tests.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
name: Python CI

on:
push:
branches: [ main, dev ]
pull_request:
branches: [ main, dev ]

jobs:
test:
runs-on: ubuntu-latest

steps:
- name: Checkout code
uses: actions/checkout@v4

- name: Set up Python
uses: actions/setup-python@v5
with:
python-version: '3.10'

- name: Install dependencies
run: |
python -m pip install --upgrade pip
if [ -f requirements.txt ]; then pip install -r requirements.txt; fi

- name: Run unit tests
run: |
python -m unittest discover -s tests/unit -p "*.py" -v
55 changes: 19 additions & 36 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -1,3 +1,4 @@
mensajes_kafka.txt
# Byte-compiled / optimized / DLL files
__pycache__/
*.py[cod]
Expand Down Expand Up @@ -94,12 +95,6 @@ ipython_config.py
# install all needed dependencies.
#Pipfile.lock

# UV
# Similar to Pipfile.lock, it is generally recommended to include uv.lock in version control.
# This is especially recommended for binary packages to ensure reproducibility, and is more
# commonly ignored for libraries.
#uv.lock

# poetry
# Similar to Pipfile.lock, it is generally recommended to include poetry.lock in version control.
# This is especially recommended for binary packages to ensure reproducibility, and is more
Expand All @@ -112,10 +107,8 @@ ipython_config.py
#pdm.lock
# pdm stores project-wide configurations in .pdm.toml, but it is recommended to not include it
# in version control.
# https://pdm.fming.dev/latest/usage/project/#working-with-version-control
# https://pdm.fming.dev/#use-with-ide
.pdm.toml
.pdm-python
.pdm-build/

# PEP 582; used by e.g. github.com/David-OConnor/pyflow and github.com/pdm-project/pdm
__pypackages__/
Expand Down Expand Up @@ -165,30 +158,20 @@ cython_debug/
# be found at https://github.com/github/gitignore/blob/main/Global/JetBrains.gitignore
# and can be added to the global gitignore or merged into this file. For a more nuclear
# option (not recommended) you can uncomment the following to ignore the entire idea folder.
#.idea/

# Abstra
# Abstra is an AI-powered process automation framework.
# Ignore directories containing user credentials, local state, and settings.
# Learn more at https://abstra.io/docs
.abstra/

# Visual Studio Code
# Visual Studio Code specific template is maintained in a separate VisualStudioCode.gitignore
# that can be found at https://github.com/github/gitignore/blob/main/Global/VisualStudioCode.gitignore
# and can be added to the global gitignore or merged into this file. However, if you prefer,
# you could uncomment the following to ignore the enitre vscode folder
# .vscode/

# Ruff stuff:
.ruff_cache/

# PyPI configuration file
.pypirc

# Cursor
# Cursor is an AI-powered code editor. `.cursorignore` specifies files/directories to
# exclude from AI features like autocomplete and code analysis. Recommended for sensitive data
# refer to https://docs.cursor.com/context/ignore-files
.cursorignore
.cursorindexingignore
.idea/

# Grafana runtime files
monitoring/grafana/data/
monitoring/grafana/plugins/
monitoring/grafana/grafana.db
monitoring/grafana/pdf/
monitoring/grafana/csv/
monitoring/grafana/png/
monitoring/grafana/__pycache__/
monitoring/grafana/logs/
monitoring/grafana/conf/
monitoring/grafana/public/
monitoring/grafana/node_modules/
monitoring/grafana/yarn.lock
monitoring/grafana/package-lock.json
monitoring/grafana/package.json
Loading