File tree
5 files changed
+79
-60
lines changed- tests/quantization
- vllm/model_executor/layers/quantization
- compressed_tensors
- schemes
- utils
5 files changed
+79
-60
lines changedLines changed: 7 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
667 | 667 |
| |
668 | 668 |
| |
669 | 669 |
| |
670 |
| - | |
| 670 | + | |
| 671 | + | |
| 672 | + | |
| 673 | + | |
| 674 | + | |
| 675 | + | |
| 676 | + | |
671 | 677 |
| |
672 | 678 |
| |
673 | 679 |
| |
|
Lines changed: 8 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
374 | 374 |
| |
375 | 375 |
| |
376 | 376 |
| |
377 |
| - | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
378 | 385 |
| |
379 | 386 |
| |
380 | 387 |
| |
|
Lines changed: 13 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
18 | 18 |
| |
19 | 19 |
| |
20 | 20 |
| |
21 |
| - | |
| 21 | + | |
| 22 | + | |
22 | 23 |
| |
23 | 24 |
| |
24 | 25 |
| |
| |||
64 | 65 |
| |
65 | 66 |
| |
66 | 67 |
| |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
67 | 75 |
| |
68 | 76 |
| |
69 | 77 |
| |
| |||
77 | 85 |
| |
78 | 86 |
| |
79 | 87 |
| |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
80 | 92 |
| |
81 | 93 |
| |
82 | 94 |
| |
|
vllm/model_executor/layers/quantization/compressed_tensors/schemes/compressed_tensors_w4a4_nvfp4.py
Lines changed: 22 additions & 57 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
9 | 9 |
| |
10 | 10 |
| |
11 | 11 |
| |
12 |
| - | |
13 |
| - | |
14 | 12 |
| |
15 | 13 |
| |
16 | 14 |
| |
| |||
21 | 19 |
| |
22 | 20 |
| |
23 | 21 |
| |
24 |
| - | |
25 |
| - | |
26 |
| - | |
27 |
| - | |
28 |
| - | |
29 |
| - | |
30 |
| - | |
31 |
| - | |
32 | 22 |
| |
33 | 23 |
| |
34 | 24 |
| |
35 | 25 |
| |
36 |
| - | |
37 |
| - | |
38 |
| - | |
39 |
| - | |
40 | 26 |
| |
41 | 27 |
| |
42 | 28 |
| |
43 |
| - | |
44 |
| - | |
45 |
| - | |
46 |
| - | |
47 |
| - | |
48 |
| - | |
49 |
| - | |
50 |
| - | |
51 |
| - | |
52 |
| - | |
| 29 | + | |
53 | 30 |
| |
54 |
| - | |
55 |
| - | |
56 |
| - | |
57 |
| - | |
58 |
| - | |
59 |
| - | |
60 |
| - | |
61 |
| - | |
62 |
| - | |
63 |
| - | |
64 |
| - | |
65 |
| - | |
66 |
| - | |
67 |
| - | |
68 |
| - | |
69 |
| - | |
70 |
| - | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
71 | 39 |
| |
72 | 40 |
| |
73 | 41 |
| |
| |||
152 | 120 |
| |
153 | 121 |
| |
154 | 122 |
| |
155 |
| - | |
156 |
| - | |
157 |
| - | |
158 |
| - | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
159 | 126 |
| |
160 | 127 |
| |
161 | 128 |
| |
162 | 129 |
| |
163 | 130 |
| |
164 | 131 |
| |
165 |
| - | |
166 |
| - | |
167 |
| - | |
| 132 | + | |
| 133 | + | |
168 | 134 |
| |
169 |
| - | |
170 |
| - | |
| 135 | + | |
| 136 | + | |
171 | 137 |
| |
172 |
| - | |
173 |
| - | |
174 |
| - | |
175 |
| - | |
176 |
| - | |
177 |
| - | |
178 |
| - | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + |
Lines changed: 29 additions & 0 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
102 | 102 |
| |
103 | 103 |
| |
104 | 104 |
| |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + |
0 commit comments