Replies: 1 comment 12 replies
-
The 1070 has faster FP32 (single-precision) floating-point performance than FP16 (half-precision) performance. By using the I'm in the same boat with my 1080ti. So, I understand what you are experiencing. More modern cards have been designed with greater FP16 performance. This wiki table also shows the legacy performance of the 10 Series cards: Hard to believe it's only been five years. 😃 |
Beta Was this translation helpful? Give feedback.
12 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi, so this fork is great for small VRam, with only 2.4 GB, with turbo it takes 3.4GB, but its still slow vs un-optimized of other forks like hlky 49sec here vs hlky fork 37sec that takes 6.4GB. So it would be cool if there is a config to enable un-optimized for people who owns > 8Gb VRam.
I used my gtx 1070 8GB here for testing.
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions