Great work! Is it normal for an A40 GPU (45G) to take several minutes to perform inference on the single demo image?