OMEinsum is slow for batched contractions as uses the fallback loop method. We can intercept these and implement a more efficient version.