Hi, this is a great work. But I wonder where is the host code, i.e. the main function. I only see **__global__** and **__device__** functions after running [genkernel.sh](https://github.com/gongbell/CUDAsmith/blob/master/genkernel.sh) Thank you very much!