-
Notifications
You must be signed in to change notification settings - Fork 121
Description
hi , is it possible to auto quantization of big arrays on long computations and make them back to their original values.
Maybe we can use auto encoders or gaussians or any other network for quantization of big arrays before computation and return back to values.
This may require training but it can be nice feature.
We will give list of possible values as a txt or csv file.
Java unit test maybe used for training.We may pass all datas to tornadovm on unit stage maybe
Train it , then the list back to tornadovm with quantized.
so tornado vm can utilize INT4,INT8
we can also ignore values from post calculation by training also....
There is also neural network optimizer.
https://github.com/microsoft/Olive
I dont know tornadovm can also utilize this kind of utilities.
also each quantization training can be a "context"
so , when i do multiplication between two int4 arrays , also pass which trained "context" for ?