This is a quick start of using JuiceFS as storage backend for Amazon EMR cluster.
-
Updated
Aug 18, 2022 - Shell
This is a quick start of using JuiceFS as storage backend for Amazon EMR cluster.
This is an end-to-end AWS Cloud ETL project. This data pipeline uses an Amazon EMR cluster managed by Apache Airflow that is running on an AWS EC2 instance. It demonstrates how to build orchestration that would perform data transformation using Amazon EMR as well as automatic data ingestion into a Snowflake via Snowpipe. It also features Power BI.
Add a description, image, and links to the amazon-emr-cluster topic page so that developers can more easily learn about it.
To associate your repository with the amazon-emr-cluster topic, visit your repo's landing page and select "manage topics."