Deprecate `iter_named_leaf_modules` and `iter_named_quantizable_modules` #1628

kylesayrs · 2025-07-07T16:32:50Z

Purpose

Refactor module targeting to be cleaner and easier to maintain
Support skipping TransformBase modules

Prerequisites

Deprecate iter_named_leaf_modules and iter_named_quantizable_modules neuralmagic/compressed-tensors#381

Changes

Remove all uses of iter_named_leaf_modules and iter_named_quantizable_modules
Make Observer inherit from InternalModule

Testing

https://github.com/neuralmagic/llm-compressor-testing/actions/runs/16123598340 ✅

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

github-actions · 2025-07-07T16:32:59Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

gemini-code-assist

Summary of Changes

Hello @kylesayrs, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request primarily refactors the codebase by removing custom module iteration utilities and replacing them with standard PyTorch Module iteration methods. This change aims to simplify module traversal logic and align with PyTorch's native API. Additionally, the Observer base class has been updated to inherit from compressed_tensors.InternalModule, and minor code formatting and logic improvements have been applied.

Highlights

Code Simplification: Replaced custom module iteration functions (iter_named_leaf_modules, iter_named_quantizable_modules) with standard PyTorch model.modules() or module.named_modules() across the codebase. This streamlines module traversal and aligns with native PyTorch API.
Base Class Update: The Observer class in src/llmcompressor/observers/base.py now inherits from compressed_tensors.InternalModule instead of torch.nn.Module, indicating a shift towards a more specialized base class within the compressed_tensors library.
Sparsity Check Logic Refinement: Improved the is_sparse24_bitmask_supported function in src/llmcompressor/transformers/compression/sparsity_metadata_config.py to explicitly skip already quantized modules, preventing redundant checks and potentially improving efficiency.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist is currently in preview and may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

The changes replace custom module iterators with standard PyTorch iteration. However, the replacements introduce bugs by failing to replicate the original filtering logic, particularly regarding leaf modules and specific quantizable modules. Critical issues in sparsity_metadata_config.py and kv_cache tests need to be addressed.

src/llmcompressor/transformers/compression/sparsity_metadata_config.py

tests/llmcompressor/transformers/kv_cache/test_kv_cache.py

src/llmcompressor/transformers/compression/helpers.py

src/llmcompressor/transformers/compression/quantization_format.py

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

This reverts commit 62e93a7.

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

brian-dellabetta

🚀

discussed with @kylesayrs that mapping all iter_named_leaf_modules, regardless of whether include_attn/include_children are False or True, to torch built-in .named_modules() method, should not affect overall behavior

rahul-tuli

remove iters

df101ea

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

gemini-code-assist bot reviewed Jul 7, 2025

View reviewed changes

kylesayrs mentioned this pull request Jul 7, 2025

Deprecate iter_named_leaf_modules and iter_named_quantizable_modules neuralmagic/compressed-tensors#381

Merged

kylesayrs added 2 commits July 7, 2025 13:18

fix typo

d8ab336

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

fix kv cache tests

515c06b

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs changed the title ~~Remove iter_named_leaf_modules and iter_named_quantizable_modules~~ Deprecate iter_named_leaf_modules and iter_named_quantizable_modules Jul 8, 2025

kylesayrs marked this pull request as ready for review July 8, 2025 04:04

kylesayrs marked this pull request as draft July 8, 2025 04:05

rename to untargetable

62e93a7

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs marked this pull request as ready for review July 8, 2025 14:37

kylesayrs mentioned this pull request Jul 8, 2025

[Transform] apply_transform_config neuralmagic/compressed-tensors#348

Merged

kylesayrs added 2 commits July 8, 2025 13:36

Revert "rename to untargetable"

9d1adbf

This reverts commit 62e93a7.

change get_layers

53f0265

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

brian-dellabetta approved these changes Jul 8, 2025

View reviewed changes

rahul-tuli approved these changes Jul 9, 2025

View reviewed changes

Merge branch 'main' into kylesayrs/remove-ct-iters

b5b245d

kylesayrs added the ready When a PR is ready for review label Jul 9, 2025

kylesayrs enabled auto-merge (squash) July 9, 2025 17:26

dsikka disabled auto-merge July 9, 2025 19:52

dsikka merged commit 810d66d into main Jul 9, 2025
11 of 12 checks passed

dsikka deleted the kylesayrs/remove-ct-iters branch July 9, 2025 19:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Deprecate `iter_named_leaf_modules` and `iter_named_quantizable_modules` #1628

Deprecate `iter_named_leaf_modules` and `iter_named_quantizable_modules` #1628

Uh oh!

kylesayrs commented Jul 7, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 7, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

brian-dellabetta left a comment

Uh oh!

rahul-tuli left a comment

Uh oh!

Uh oh!

Uh oh!

Deprecate iter_named_leaf_modules and iter_named_quantizable_modules #1628

Deprecate iter_named_leaf_modules and iter_named_quantizable_modules #1628

Uh oh!

Conversation

kylesayrs commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Prerequisites

Changes

Testing

Uh oh!

github-actions bot commented Jul 7, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

brian-dellabetta left a comment

Choose a reason for hiding this comment

Uh oh!

rahul-tuli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Deprecate `iter_named_leaf_modules` and `iter_named_quantizable_modules` #1628

Deprecate `iter_named_leaf_modules` and `iter_named_quantizable_modules` #1628

kylesayrs commented Jul 7, 2025 •

edited

Loading