Update MXQuant doc #2309

Kaihui-intel · 2025-10-11T03:17:48Z

User description

Type of Change

documentation

Description

transfer to AutoRound Quant

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

PR Type

Documentation, Enhancement

Description

Updated documentation for AutoRound Quantization API
Added example using Hugging Face models
Included code snippet for model quantization and inference

Diagram Walkthrough

flowchart LR
  A["MXQuantConfig"] -- "updated to" --> B["AutoRoundConfig"]
  B -- "added example" --> C["Hugging Face models"]
  C -- "included code" --> D["Model quantization and inference"]

File Walkthrough

Relevant files

Documentation

PT_MXQuant.md `Updated to AutoRound Quantization API` docs/source/3x/PT_MXQuant.md Updated API usage from MXQuant to AutoRound Added example with Hugging Face models Included code for quantization and inference	+30/-6

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

PRAgent4INC · 2025-10-11T03:18:20Z

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 2 🔵🔵⚪⚪⚪
🧪 No relevant tests
🔒 No security concerns identified
⚡ Recommended focus areas for review Typo There is a typo in the comment `# quantize the model and save to output_dir`. It should be `# quantize the model and save to output_dir`. # quantize the model and save to output_dir Example Link The link provided in the examples section points to a non-existent path. Ensure the path is correct and the example exists. - PyTorch [huggingface models](/examples/pytorch/multimodal-modeling/quantization/auto_round/llama4)

PRAgent4INC · 2025-10-11T03:18:33Z

PR Code Suggestions ✨

Explore these optional code suggestions:

Category	Suggestion	Impact
General	Update API name consistency Correct the API name to match the description. docs/source/3x/PT_MXQuant.md [86] -To get a model quantized with Microscaling Data Types, users can use the Microscaling Quantization API as follows. +To get a model quantized with AutoRound Data Types, users can use the AutoRound Quantization API as follows. Suggestion importance[1-10]: 7 __ Why: The suggestion correctly updates the API name to match the description, improving clarity and consistency. However, it does not address a critical issue and offers a minor improvement.	Medium

for more information, see https://pre-commit.ci

thuang6 · 2025-10-11T03:41:00Z

"It adapts a granularity falling between per-channel and per-tensor to balance accuracy and memory consumption." in introduction section looks not right. block size 32 is normally smaller than channel dimension. @mengniwang95, should we remove this sentence?

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

…essor into kaihui/mx_doc

thuang6 · 2025-10-11T05:31:48Z

Also formular "The exponent (exp) is equal to torch.floor(torch.log2(amax))" in introduction section is not right. According to recipe document, the formular is: clamp(floor(log2(amax)) - maxExp, -127, 127), Where maxExp is the largest power-of-two representable in the element data type, e.g. for element FP8 E4M3, maxExp is 8, FP4 E2M1, maxExp is 2. @mengniwang95 , please double confirm if it is default option used in auto-round

docs/source/3x/PT_MXQuant.md

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

mengniwang95 · 2025-10-11T06:50:48Z

"It adapts a granularity falling between per-channel and per-tensor to balance accuracy and memory consumption." in introduction section looks not right. block size 32 is normally smaller than channel dimension. @mengniwang95, should we remove this sentence?

yes, you ar right

mengniwang95 · 2025-10-11T06:51:11Z

Also formular "The exponent (exp) is equal to torch.floor(torch.log2(amax))" in introduction section is not right. According to recipe document, the formular is: clamp(floor(log2(amax)) - maxExp, -127, 127), Where maxExp is the largest power-of-two representable in the element data type, e.g. for element FP8 E4M3, maxExp is 8, FP4 E2M1, maxExp is 2. @mengniwang95 , please double confirm if it is default option used in auto-round

clamp(floor(log2(amax)) - maxExp, -127, 127) is used in autoround

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

thuang6 · 2025-10-11T08:54:36Z

Also formular "The exponent (exp) is equal to torch.floor(torch.log2(amax))" in introduction section is not right. According to recipe document, the formular is: clamp(floor(log2(amax)) - maxExp, -127, 127), Where maxExp is the largest power-of-two representable in the element data type, e.g. for element FP8 E4M3, maxExp is 8, FP4 E2M1, maxExp is 2. @mengniwang95 , please double confirm if it is default option used in auto-round

clamp(floor(log2(amax)) - maxExp, -127, 127) is used in autoround

@Kaihui-intel , please help to update formular as well

Kaihui-intel · 2025-10-11T08:59:26Z

Also formular "The exponent (exp) is equal to torch.floor(torch.log2(amax))" in introduction section is not right. According to recipe document, the formular is: clamp(floor(log2(amax)) - maxExp, -127, 127), Where maxExp is the largest power-of-two representable in the element data type, e.g. for element FP8 E4M3, maxExp is 8, FP4 E2M1, maxExp is 2. @mengniwang95 , please double confirm if it is default option used in auto-round

clamp(floor(log2(amax)) - maxExp, -127, 127) is used in autoround

@Kaihui-intel , please help to update formular as well

2d9c95d

update MXQuant doc

4bad514

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

Kaihui-intel requested a review from mengniwang95 October 11, 2025 03:17

Kaihui-intel assigned thuang6 Oct 11, 2025

Kaihui-intel added this to the 3.6 milestone Oct 11, 2025

PRAgent4INC added the Review effort 2/5 label Oct 11, 2025

Kaihui-intel unassigned thuang6 Oct 11, 2025

Kaihui-intel requested a review from thuang6 October 11, 2025 03:19

[pre-commit.ci] auto fixes from pre-commit.com hooks

6d6f14b

for more information, see https://pre-commit.ci

Kaihui-intel added 2 commits October 11, 2025 12:43

rename example link

b317f05

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

Merge branch 'kaihui/mx_doc' of https://github.com/intel/neural-compr…

6b19312

…essor into kaihui/mx_doc

thuang6 reviewed Oct 11, 2025

View reviewed changes

docs/source/3x/PT_MXQuant.md Outdated Show resolved Hide resolved

remove wrong sentence

b1ea183

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

update formular

2d9c95d

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>

thuang6 approved these changes Oct 11, 2025

View reviewed changes

Kaihui-intel merged commit e36230e into master Oct 13, 2025
11 checks passed

Kaihui-intel deleted the kaihui/mx_doc branch October 13, 2025 01:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update MXQuant doc #2309

Update MXQuant doc #2309

Kaihui-intel commented Oct 11, 2025 •

edited by PRAgent4INC

Loading

Uh oh!

PRAgent4INC commented Oct 11, 2025

Uh oh!

PRAgent4INC commented Oct 11, 2025

Uh oh!

thuang6 commented Oct 11, 2025

Uh oh!

thuang6 commented Oct 11, 2025

Uh oh!

Uh oh!

mengniwang95 commented Oct 11, 2025

Uh oh!

mengniwang95 commented Oct 11, 2025

Uh oh!

thuang6 commented Oct 11, 2025

Uh oh!

Kaihui-intel commented Oct 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Update MXQuant doc #2309

Update MXQuant doc #2309

Conversation

Kaihui-intel commented Oct 11, 2025 • edited by PRAgent4INC Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

User description

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

PR Type

Description

Diagram Walkthrough

File Walkthrough

Uh oh!

PRAgent4INC commented Oct 11, 2025

PR Reviewer Guide 🔍

Uh oh!

PRAgent4INC commented Oct 11, 2025

PR Code Suggestions ✨

Uh oh!

thuang6 commented Oct 11, 2025

Uh oh!

thuang6 commented Oct 11, 2025

Uh oh!

Uh oh!

mengniwang95 commented Oct 11, 2025

Uh oh!

mengniwang95 commented Oct 11, 2025

Uh oh!

thuang6 commented Oct 11, 2025

Uh oh!

Kaihui-intel commented Oct 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Kaihui-intel commented Oct 11, 2025 •

edited by PRAgent4INC

Loading