Skip to content

Commit e5de2b9

Browse files
authored
skip layers if already fused (#322)
1 parent a6327c7 commit e5de2b9

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

src/compressed_tensors/quantization/lifecycle/initialize.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -304,6 +304,9 @@ def _valid_fp4_quant(layer_list: List[torch.nn.Linear]):
304304
):
305305

306306
if _is_attention_module(submodule):
307+
# already fused/treated as one layer
308+
if hasattr(submodule, "qkv_proj"):
309+
continue
307310

308311
if not _valid_fp4_quant(
309312
[submodule.q_proj, submodule.v_proj, submodule.k_proj]

0 commit comments

Comments
 (0)