Implement _mm256_exp_pd() for computing transition probability matrices (Issue #105 )
The intrinsic does not map to an instruction and is only available in the Intel Small Vector Library.
There exists a single-precision implementation, however it seems to differ a bit (in results) from the libc implementation, at least for some corner cases.
A good double-precision non-vectorized implementation seems to be in the old cephes library.