Skip to content

Commit ae8b7ac

Browse files
committed
Update llama-quant.cpp
1 parent 63fbbca commit ae8b7ac

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

src/llama-quant.cpp

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -217,6 +217,7 @@ static ggml_type llama_tensor_get_type(quantize_state_impl & qs, ggml_type new_t
217217
else if (i_layer < 12) new_type = GGML_TYPE_Q3_K; // 3.5 bpw
218218
else if (i_layer < 18) new_type = GGML_TYPE_IQ2_XXS; // 2.06 bpw
219219
else if (i_layer > 58) new_type = GGML_TYPE_IQ2_XXS; // 3.5 bpw
220+
else new_type = GGML_TYPE_IQ3_S;
220221
}
221222
else {
222223
if (i_layer < 6) new_type = GGML_TYPE_Q4_K;

0 commit comments

Comments
 (0)