Skip to content

Commit a1faeb4

Browse files
committed
[xpu/triton] Add trtion dequantization kernel
This PR adds xpu backend and trtion kernel for dequantization nf4 dtype. Tests: tests/test_functional.py::TestQuantize4BitFunctional supported nf4 cases tests/test_functional.py::Test8BitBlockwiseQuantizeFunctional implemented quantize_blockwise with binary search that works faster for XPU Signed-off-by: Dmitrii Makarenko <dmitrii.makarenko@intel.com>
1 parent 42bc729 commit a1faeb4

File tree

4 files changed

+452
-0
lines changed

4 files changed

+452
-0
lines changed

bitsandbytes/__init__.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -34,6 +34,9 @@
3434
if torch.cuda.is_available():
3535
from .backends.cuda import ops as cuda_ops
3636

37+
if torch.xpu.is_available():
38+
from .backends.xpu import ops as xpu_ops
39+
3740

3841
def _import_backends():
3942
"""

bitsandbytes/backends/xpu/__init__.py

Whitespace-only changes.

0 commit comments

Comments
 (0)