Some considerations (to be expanded): - Intel suite + MKL + Unclear if `xtensor` has an MKL calling process.. MKL is unlikely to improve much (not too many ops) + `icc` should still be helpful, with the right flags (document!) - The [Hoard memory allocator](https://github.com/emeryberger/Hoard) + This might just be useful, but maybe less so since the HashMap addition.. - The [Mold linker](https://github.com/rui314/mold) + Really almost pointless to even add, we'll never link as fast as the Python runtime, and it scarcely matters... (maybe better developer ease) Most (all) of these are optionals to be documented, not really to be changed wholesale.