Add granite #1882

BBC-Esq · 2025-04-12T13:43:51Z

Should add the really cool granite models by IBM.

gabe-l-hart

Thanks for putting together this PR for Granite! I'm one of the Granite team leads for open source integrations, so I've got a couple of notes inline. Also, in addition to pulling head_dim from config and applying the embedding_multiplier, there are three more scalars that are used in the GraniteForCausalLM architecture that differentiate it from llama:

attention_multiplier: This should be applied as a scaling after the self attention block of each layer (here)
residual_multiplier: This should be applied when recombining with the residual after the self attention block (here) and the FFN (here) in each layer
logits_scaling: This should be applied to the output logits before returning them from the model forward pass (here)

gabe-l-hart · 2025-04-14T15:57:09Z

python/ctranslate2/converters/transformers.py

+            rotary_interleave=False,
+            rotary_base=getattr(model.config, "rope_theta", 5000000.0),
+            num_heads_kv=num_heads_kv,
+            head_dim=model.config.hidden_size // model.config.num_attention_heads,


In Granite, the head_dim can be pulled directly from config with the fallback being this calculation (here)

gabe-l-hart · 2025-04-14T15:58:53Z

python/ctranslate2/converters/transformers.py

+        self.set_linear(spec.decoder.projection, model.lm_head)
+
+        if hasattr(model.config, "embedding_multiplier") and model.config.embedding_multiplier:
+            spec.decoder.embeddings.multiply_by_sqrt_depth = model.config.embedding_multiplier


Without unboxing the repo further, I can't tell exactly whether this is the right place for embedding_multipler, but it looks right. It should simply be applied as a scaling after the embeddings are computed (here)

add granite

394216c

BBC-Esq changed the title ~~add granite~~ Add granite Apr 12, 2025

gabe-l-hart reviewed Apr 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add granite #1882

Add granite #1882

Uh oh!

BBC-Esq commented Apr 12, 2025

Uh oh!

gabe-l-hart left a comment

Uh oh!

gabe-l-hart Apr 14, 2025

Uh oh!

gabe-l-hart Apr 14, 2025

Uh oh!

Uh oh!

Add granite #1882

Are you sure you want to change the base?

Add granite #1882

Uh oh!

Conversation

BBC-Esq commented Apr 12, 2025

Uh oh!

gabe-l-hart left a comment

Choose a reason for hiding this comment

Uh oh!

gabe-l-hart Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

gabe-l-hart Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!