Sharded weights type error #2296

laxmareddyp · 2025-06-11T21:52:04Z

Description of the change

Fix: Handle lists in weight_map for sharded weights

The _get_sharded_filenames method in preset_utils.py was raising a
TypeError when weight_map.values() contained lists. This occurred because
lists are unhashable and cannot be added directly to a set.

Reference

Colab Notebook

Checklist

I have added all the necessary unit tests for my change.
I have verified that my change does not break existing code and works with all backends (TensorFlow, JAX, and PyTorch).
My PR is based on the latest changes of the main branch (if unsure, rebase the code).
I have followed the Keras Hub Model contribution guidelines in making these changes.
I have followed the Keras Hub API design guidelines in making these changes.
I have signed the Contributor License Agreement.

james77777778 · 2025-06-12T00:55:06Z

@laxmareddyp could we add a test for this to prevent future breakage?

Here is an example of testing sharded weights:
https://github.com/keras-team/keras-hub/blob/master/keras_hub/src/utils/preset_utils_test.py#L22

sachinprasadhs · 2025-06-13T20:42:25Z

keras_hub/src/utils/preset_utils_test.py

+        init_kwargs = {
+            "vocabulary_size": 1024,
+            "num_layers": 12,
+            "num_query_heads": 8,
+            "num_key_value_heads": 4,
+            "hidden_dim": 32,
+            "intermediate_dim": 64,
+            "head_dim": 4,
+            "sliding_window_size": 5,
+            "attention_logit_soft_cap": 50,
+            "final_logit_soft_cap": 30,
+            "layer_norm_epsilon": 1e-6,
+            "query_head_dim_normalize": False,
+            "use_post_ffw_norm": True,
+            "use_post_attention_norm": True,
+            "use_sliding_window_attention": True,
+        }
+        backbone = GemmaBackbone(**init_kwargs)  # ~422KB


Can you move this to setUp and use it for all 3 test cases which we are doing here, so that our test setup will be lot more cleaner.

…l three relevant test cases to use the shared setup.

sachinprasadhs

LGTM

fix-sharded-weights-typeerror

ad06f07

laxmareddyp changed the title ~~fix-sharded-weights-typeerror~~ fix sharded weights typeerror Jun 11, 2025

laxmareddyp changed the title ~~fix sharded weights typeerror~~ Sharded weights type error Jun 11, 2025

laxmareddyp requested a review from sachinprasadhs June 11, 2025 21:59

sachinprasadhs added the kokoro:force-run Runs Tests on GPU label Jun 11, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jun 11, 2025

laxmareddyp added 2 commits June 12, 2025 10:42

Merge branch 'keras-team:master' into flatten_weight_map_values

7440d2f

Add unit test case

51843dc

laxmareddyp added the kokoro:force-run Runs Tests on GPU label Jun 13, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jun 13, 2025

sachinprasadhs reviewed Jun 13, 2025

View reviewed changes

Moved common initialization code to setUp for cleaner and Updated al…

7a74ee1

…l three relevant test cases to use the shared setup.

laxmareddyp added the kokoro:force-run Runs Tests on GPU label Jun 13, 2025

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jun 13, 2025

sachinprasadhs approved these changes Jun 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sharded weights type error #2296

Sharded weights type error #2296

laxmareddyp commented Jun 11, 2025 •

edited

Loading

Uh oh!

james77777778 commented Jun 12, 2025

Uh oh!

sachinprasadhs Jun 13, 2025

Uh oh!

sachinprasadhs left a comment

Uh oh!

Uh oh!

Sharded weights type error #2296

Are you sure you want to change the base?

Sharded weights type error #2296

Conversation

laxmareddyp commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of the change

Reference

Colab Notebook

Checklist

Uh oh!

james77777778 commented Jun 12, 2025

Uh oh!

sachinprasadhs Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

sachinprasadhs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

laxmareddyp commented Jun 11, 2025 •

edited

Loading