CUDA is fast, but it's only supported by Nvidia's GPU. It would be better to use [HIP](https://developer.amd.com/resources/rocm-learning-center/fundamentals-of-hip-programming/) to support more GPU.