Any plans to implement mxfp4 and/or nvfp4? #1783
-
|
Hello. I'm a big fan of I'm aware of torchao but I'm wondering if there are any plans to support mxfp4 and/or nvfp4? Obviously the native support for GPUs is great but a really nice thing about Thank you! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
|
We're currently considering our options for additional quantization schemes. MXFP4 and NVFP4 are definitely options we're considering to add. I've mostly been thinking about this in terms of LLMs to be honest, but I think your use case is quite interesting too. Do you have any background you can share on how well either of those formats would potentially perform for ANN? Also as an FYI regarding hardware support:
|
Beta Was this translation helpful? Give feedback.
Hi @davidmezzetti
We're currently considering our options for additional quantization schemes. MXFP4 and NVFP4 are definitely options we're considering to add. I've mostly been thinking about this in terms of LLMs to be honest, but I think your use case is quite interesting too. Do you have any background you can share on how well either of those formats would potentially perform for ANN?
Also as an FYI regarding hardware support: