Replies: 1 comment 1 reply
-
Maybe something like this could be used: https://crates.io/crates/multiversion |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
@julianbraha I saw you made the original PR for using
mul_add
to speed up float operations in chunk gen. Those we ultimately reverted because that function was extremely slow on non-FMA cpus.I was wondering if you had access to a FMA cpu and could run benchmarks to see if it would be worth adding a compile time method to use
mul_add
orx * a + b
based on whether the CPU supports.Beta Was this translation helpful? Give feedback.
All reactions