Skip to content

Commit cb47421

Browse files
committed
feat: Auto-fill hparams.recurrent_layer_arr based on whether the model is recurrent
Branch: GraniteFour Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>
1 parent 442af2f commit cb47421

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

src/llama-model.cpp

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -469,6 +469,10 @@ void llama_model::load_hparams(llama_model_loader & ml) {
469469
std::fill(hparams.n_head_arr.begin(), hparams.n_head_arr.end(), 0);
470470
std::fill(hparams.n_head_kv_arr.begin(), hparams.n_head_kv_arr.end(), 0);
471471
std::fill(hparams.n_ff_arr.begin(), hparams.n_ff_arr.end(), 0);
472+
std::fill(
473+
hparams.recurrent_layer_arr.begin(),
474+
hparams.recurrent_layer_arr.end(),
475+
llm_arch_is_recurrent(ml.get_arch()));
472476

473477
std::fill(hparams.rope_sections.begin(), hparams.rope_sections.end(), 0);
474478

0 commit comments

Comments
 (0)