PPDB Cloud Functions

This repository contains Google Cloud Run functions for the Rubin Observatory's Prompt Products Database (PPDB) along with related scripts and configuration files.

There is currently a single function for triggering a Dataflow job which loads table data into BigQuery from Parquet files in Google Cloud Storage (GCS). The implementation of this function is contained in the stage_chunk directory with the following files:

build-container.sh - Builds the Docker container for the Dataflow job
build-flex-template.sh - Deploys the flex template for the Dataflow job
deploy-function.sh - Deploys the Cloud Function to listen for events on the stage-chunk-topic Pub/Sub topic
Dockerfile - Dockerfile for the Dataflow job which launches the Apache Beam script
main.py - Implementation of the Cloud Function which triggers the Dataflow job. The function accepts the name of a GCS bucket and prefix containing the Parquet files for a replica chunk, e.g., gs://rubin-ppdb-test-bucket-1/data/tmp/2025/04/23/1737056400.
Makefile - Makefile with helpful targets for deploying and tearing down the Cloud Function. Typing make will print all available targets.
metadata.json - Required metadata for the Dataflow job
requirements.txt - Python dependencies for the Dataflow job
stage_chunk_beam_job.py - Apache Beam script for loading the data into BigQuery from Parquet files in GCS.
teardown.sh - Script to teardown the Cloud Function and Dataflow configuration, including deleting the Docker image, removing the flex template, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
stage_chunk		stage_chunk
.gitignore		.gitignore
COPYRIGHT		COPYRIGHT
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PPDB Cloud Functions

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

lsst-dm/ppdb-cloud-functions

Folders and files

Latest commit

History

Repository files navigation

PPDB Cloud Functions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages