Skip to content

Commit 728f7a2

Browse files
mgoinpy-andy-c
authored andcommitted
[Bugfix] Restrict Machete to only run on Hopper (vllm-project#20830)
Signed-off-by: mgoin <mgoin64@gmail.com>
1 parent 5ca7b40 commit 728f7a2

File tree

1 file changed

+3
-0
lines changed
  • vllm/model_executor/layers/quantization/kernels/mixed_precision

1 file changed

+3
-0
lines changed

vllm/model_executor/layers/quantization/kernels/mixed_precision/machete.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -32,6 +32,9 @@ def can_implement(cls,
3232
if not current_platform.is_cuda():
3333
return False, "Machete only supported on CUDA"
3434

35+
if not current_platform.is_device_capability(90):
36+
return False, "Machete requires compute capability of 90 (Hopper)"
37+
3538
if c.has_g_idx and\
3639
c.partition_weight_shape[0] != c.full_weight_shape[0]:
3740
return False, "Act reordering currently not supported by Machete, "\

0 commit comments

Comments
 (0)