Skip to content

Commit 7fb0ed3

Browse files
committed
feat: Auto-fill hparams.recurrent_layer_arr based on whether the model is recurrent
Branch: GraniteFour Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>
1 parent 2d31516 commit 7fb0ed3

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

src/llama-model.cpp

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -470,6 +470,10 @@ void llama_model::load_hparams(llama_model_loader & ml) {
470470
std::fill(hparams.n_head_arr.begin(), hparams.n_head_arr.end(), 0);
471471
std::fill(hparams.n_head_kv_arr.begin(), hparams.n_head_kv_arr.end(), 0);
472472
std::fill(hparams.n_ff_arr.begin(), hparams.n_ff_arr.end(), 0);
473+
std::fill(
474+
hparams.recurrent_layer_arr.begin(),
475+
hparams.recurrent_layer_arr.end(),
476+
llm_arch_is_recurrent(ml.get_arch()));
473477

474478
std::fill(hparams.rope_sections.begin(), hparams.rope_sections.end(), 0);
475479

0 commit comments

Comments
 (0)