Package to generate synthetic data using declarative description of the schema.
To install run python setup.py install
or pip install -e .
.
Install pre-commit checks by running pre-commit install
.
Run pytest
Schema definition can be found in example
directory.
To generate fake data run:
gen_fake --schemas-dir example --out-dir out
# This should produce a directory named out with these files
ll out
# total 1024
# -rw-r--r-- 1 franek staff 306K Jan 5 15:09 schema.trades.csv
# -rw-r--r-- 1 franek staff 166K Jan 5 15:09 schema.trades_report.csv
# -rw-r--r-- 1 franek staff 32K Jan 5 15:09 schema.users.csv
cat out/schema.trades.csv
#id,buyer_id,seller_id,price
#0,2954,4745,678630314038.799
#1,403,2051,66406442.1806716
#2,1884,9404,7500077.66942181
#...