Skip to content

BENCH: Squeezing out the last dregs of performance #20

@HaoZeke

Description

@HaoZeke

Some considerations (to be expanded):

  • Intel suite + MKL
    • Unclear if xtensor has an MKL calling process.. MKL is unlikely to improve much (not too many ops)
    • icc should still be helpful, with the right flags (document!)
  • The Hoard memory allocator
    • This might just be useful, but maybe less so since the HashMap addition..
  • The Mold linker
    • Really almost pointless to even add, we'll never link as fast as the Python runtime, and it scarcely matters... (maybe better developer ease)

Most (all) of these are optionals to be documented, not really to be changed wholesale.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions