to amortize the overhead of JNI/cgo call. otherwise need to call JNI/cgo for each row. cc @Xuanwo @sundy-li