-
Notifications
You must be signed in to change notification settings - Fork 90
Local Tutorial Custom schema
Dinesh Chandnani edited this page Apr 15, 2019
·
8 revisions
Next, we can update the schema of the test data:
- Open existing flow
- Go to Input tab
- In the schema section, edit the text. To add a value, add the following:
- name : name of the column
- type : can be double, int, string, etc
- nullable : if value can be null, true or false,
- Metadata :
- minValue : for decimals, floor value for which the generated data will stay above,
- maxValue : for decimals, ceiling value for which the generated data will stay under
NOTE: For double datatype, value should be double value like 15.1. Also due to issue with JSON editor which rounds off 0s after decimal, make sure the value after decimal is > 0. ie. 15.0 won't work, make it 15.1.
- Click Deploy
You now have the pipeline running with your custom data and able to view basic data on the Metrics dashboard.
Following data generation hints are supported (in local mode) based on data type via metadata tag:
Numbers (int, long, float, double)
"metadata": {
"minValue": 5.1,
"maxValue": 100.1
}
"metadata": {
"allowedValues": [1,2,3,5,7]
}
// this is only for long type. This will generate timestamp value in milliseconds
"metadata": {
"useCurrentTimeMillis": true
}
String , Array, Map (control the length of data generated)
"metadata": {
"maxLength": 9
}
Apart from these data types, boolean and struct data types are also supported in local mode.