Skip to content

Support Autotune FP16 Mix-precision on torch 3.0 new API #1793

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 18 commits into from
May 17, 2024
Merged

Conversation

zehao-intel
Copy link
Contributor

Type of Change

feature

Description

support fp16 by using torch.amp

How has this PR been tested?

PreCI

Dependency Change?

No

Signed-off-by: zehao-intel <zehao.huang@intel.com>
Signed-off-by: zehao-intel <zehao.huang@intel.com>
Signed-off-by: zehao-intel <zehao.huang@intel.com>
Copy link

github-actions bot commented May 15, 2024

⛈️ Required checks status: Has failure 🔴

Warning
If you do not have the access to re-run the Probot, please contact XuehaoSun for help. If you push a new commit, all of the workflow will be re-triggered.

Groups summary

🟢 Code Scan Tests workflow
Check ID Status Error details
Code-Scan success
Code-Scan (Bandit Code Scan Bandit) success
Code-Scan (DocStyle Code Scan DocStyle) success
Code-Scan (Pylint Code Scan Pylint) success

These checks are required after the changes to neural_compressor/common/utils/constants.py, neural_compressor/torch/algorithms/mix_precision/__init__.py, neural_compressor/torch/algorithms/mix_precision/half_precision_convert.py, neural_compressor/torch/algorithms/mix_precision/module_wrappers.py, neural_compressor/torch/quantization/__init__.py, neural_compressor/torch/quantization/algorithm_entry.py, neural_compressor/torch/quantization/config.py.

🔴 Model Tests 3x workflow
Check ID Status Error details
Model-Test-3x failure
Model-Test-3x (Generate Report GenerateReport) failure download
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4) success
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4_dq_bnb) success
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4_dq_ggml) success

These checks are required after the changes to neural_compressor/common/utils/constants.py, neural_compressor/torch/algorithms/mix_precision/__init__.py, neural_compressor/torch/algorithms/mix_precision/half_precision_convert.py, neural_compressor/torch/algorithms/mix_precision/module_wrappers.py, neural_compressor/torch/quantization/__init__.py, neural_compressor/torch/quantization/algorithm_entry.py, neural_compressor/torch/quantization/config.py.

🟢 Unit Tests 3x-TensorFlow workflow
Check ID Status Error details
UT-3x-TensorFlow success
UT-3x-TensorFlow (Coverage Compare CollectDatafiles) success
UT-3x-TensorFlow (Unit Test 3x TensorFlow Unit Test 3x TensorFlow) success
UT-3x-TensorFlow (Unit Test 3x TensorFlow baseline Unit Test 3x TensorFlow baseline) success

These checks are required after the changes to neural_compressor/common/utils/constants.py.

🟢 Unit Tests 3x-PyTorch workflow
Check ID Status Error details
UT-3x-Torch success
UT-3x-Torch (Coverage Compare CollectDatafiles) success
UT-3x-Torch (Unit Test 3x Torch Unit Test 3x Torch) success
UT-3x-Torch (Unit Test 3x Torch baseline Unit Test 3x Torch baseline) success

These checks are required after the changes to neural_compressor/common/utils/constants.py, neural_compressor/torch/algorithms/mix_precision/__init__.py, neural_compressor/torch/algorithms/mix_precision/half_precision_convert.py, neural_compressor/torch/algorithms/mix_precision/module_wrappers.py, neural_compressor/torch/quantization/__init__.py, neural_compressor/torch/quantization/algorithm_entry.py, neural_compressor/torch/quantization/config.py, test/3x/torch/test_autotune.py.

🟢 Unit Tests 3x-ONNXRT workflow
Check ID Status Error details
UT-3x-ONNXRT success
UT-3x-ONNXRT (Coverage Compare CollectDatafiles) success
UT-3x-ONNXRT (Unit Test 3x ONNXRT Unit Test 3x ONNXRT) success
UT-3x-ONNXRT (Unit Test 3x ONNXRT baseline Unit Test 3x ONNXRT baseline) success

These checks are required after the changes to neural_compressor/common/utils/constants.py.


Thank you for your contribution! 💜

Note
This comment is automatically generated and will be updates every 180 seconds within the next 6 hours. If you have any other questions, contact chensuyue or XuehaoSun for help.

pre-commit-ci bot and others added 4 commits May 15, 2024 06:38
Signed-off-by: zehao-intel <zehao.huang@intel.com>
Signed-off-by: zehao-intel <zehao.huang@intel.com>
@zehao-intel zehao-intel added python Pull requests that update Python code INC3.X labels May 15, 2024
@zehao-intel zehao-intel requested a review from yiliu30 May 15, 2024 07:30
Signed-off-by: zehao-intel <zehao.huang@intel.com>
zehao-intel and others added 2 commits May 16, 2024 13:21
Signed-off-by: zehao-intel <zehao.huang@intel.com>
@yiliu30 yiliu30 removed the python Pull requests that update Python code label May 16, 2024
Signed-off-by: zehao-intel <zehao.huang@intel.com>
@yiliu30 yiliu30 added the PyTorch Related to PyTorch F/W label May 16, 2024
@yiliu30 yiliu30 requested review from xin3he and ftian1 May 16, 2024 06:56
zehao-intel and others added 6 commits May 17, 2024 11:49
Signed-off-by: zehao-intel <zehao.huang@intel.com>
Signed-off-by: zehao-intel <zehao.huang@intel.com>
Signed-off-by: zehao-intel <zehao.huang@intel.com>
@chensuyue chensuyue merged commit 2e1cdc5 into master May 17, 2024
35 of 37 checks passed
@chensuyue chensuyue deleted the zehao/fp16 branch May 17, 2024 12:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
INC3.X PyTorch Related to PyTorch F/W
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants