Settings: Nano Jetson Xavier No GPU: NVidia Volta Power mode: 20W 6 Core PyTorch 1.13 CUDA: 11.4 —————— We were able to run FADNet offline with sample dataset, but it was extremely slow (4~5 frames per second generated) on a input resolution of 512x256 We’d like to know what is the bottle neck that’s running it this slow as FADNet claims to be fast and accurate, so we must have done something wrong… I can provide the code if needed, but just want to get a general idea first of how fast the FADNet can actually run ideally —————