Inspired by: https://grantmcdermott.com/duckdb-polars/
To download the NYC data you will need the aws cli tool and then you can run the following command:
mkdir -p nyc-taxi/year=2012
aws s3 cp s3://voltrondata-labs-datasets/nyc-taxi/year=2012 nyc-taxi/year=2012 --recursive --no-sign-request