Does anyone have FP8 vs. FP16 benchmark results on H100? #7385
ajtejankar
announced in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I am benchmarking FP8 vs. FP16 on H100 and I don't see much of an improvement. The repro script (uses vllm's benchmarking code) and results are below. I am worried that I may be doing something weird so just want to confirm.
The results are:
Beta Was this translation helpful? Give feedback.
All reactions