Add attribute output_dtype to QuantizeLinear #5956

galagam · 2024-02-23T14:36:32Z

The purpose of this change is to allow setting the quantized type without providing the zero-point tensor for symmetric quantization.
This reduces model size, most importantly for block quantization where the zero-point tensor dimensions are large, and reduces backend runtime.

This implements issue #5943

galagam · 2024-02-23T14:38:28Z

@gramalingam following up on our discussion in the Operators SIG meeting yesterday, here are the changes for #5943.
If you can do a quick review, hopefully we'll be able to get this into v1.16.

codecov · 2024-02-23T14:42:26Z

Codecov Report

Attention: Patch coverage is 62.50000% with 30 lines in your changes are missing coverage. Please review.

Project coverage is 56.79%. Comparing base (945d7be) to head (05b222a).

Files	Patch %	Lines
onnx/backend/test/case/node/quantizelinear.py	0.00%	14 Missing ⚠️
onnx/reference/ops/op_quantize_linear.py	70.00%	7 Missing and 5 partials ⚠️
onnx/test/shape_inference_test.py	77.77%	4 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #5956   +/-   ##
=======================================
  Coverage   56.79%   56.79%           
=======================================
  Files         506      506           
  Lines       30308    30349   +41     
  Branches     4580     4589    +9     
=======================================
+ Hits        17214    17238   +24     
- Misses      12267    12283   +16     
- Partials      827      828    +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

onnx/version_converter/adapters/q_dq_21_20.h

+        ONNX_ASSERTM(
+            false,
+            "Attribute output_dtype is not supported for Opset Version %d, supply a zero-point tensor instead",
+            target_version().version())


onnx/defs/quantization/defs.cc

gramalingam

LGTM, thanks for the quick PR, greatly appreciate it!

The purpose of this change is to allow setting the quantized type without providing the zero-point tensor. This reduces model size, most importantly for block quantization where the zero-point tensor dimensions are large. It also simplifies the creation of symmetric quantization nodes. Signed-off-by: Gal Hubara Agam <ghubaraagam@nvidia.com>

Signed-off-by: Gal Hubara Agam <ghubaraagam@nvidia.com>

* 'main' of https://github.com/onnx/onnx: Add attribute output_dtype to QuantizeLinear (#5956) Update inliner to propagate valueinfos (#5942) Fix ConstantOfShape type constraints (#5961) Support register custom OpSchema by python (#5906) Fix ReferenceEvaluator when run from a subclass (#5936)

The purpose of this change is to allow setting the quantized type without providing the zero-point tensor for symmetric quantization. This reduces model size, most importantly for block quantization where the zero-point tensor dimensions are large, and reduces backend runtime. This implements issue onnx#5943 --------- Signed-off-by: Gal Hubara Agam <ghubaraagam@nvidia.com> Signed-off-by: isdanni <leedanni@gmail.com>

The purpose of this change is to allow setting the quantized type without providing the zero-point tensor for symmetric quantization. This reduces model size, most importantly for block quantization where the zero-point tensor dimensions are large, and reduces backend runtime. This implements issue onnx#5943 --------- Signed-off-by: Gal Hubara Agam <ghubaraagam@nvidia.com> Signed-off-by: Linsho Kaku <linsho@preferred.jp>

galagam requested review from a team as code owners February 23, 2024 14:36

github-advanced-security bot found potential problems Feb 23, 2024

View reviewed changes

gramalingam reviewed Feb 23, 2024

View reviewed changes

onnx/defs/quantization/defs.cc Outdated Show resolved Hide resolved

gramalingam reviewed Feb 23, 2024

View reviewed changes

onnx/defs/quantization/defs.cc Outdated Show resolved Hide resolved

gramalingam approved these changes Feb 23, 2024

View reviewed changes

galagam added 4 commits February 24, 2024 02:04

Fix type inference + tests

aa8f16e

Signed-off-by: Gal Hubara Agam <ghubaraagam@nvidia.com>

Improve error message readability

f572b26

Signed-off-by: Gal Hubara Agam <ghubaraagam@nvidia.com>

Strict typing

05b222a

Signed-off-by: Gal Hubara Agam <ghubaraagam@nvidia.com>

galagam force-pushed the quantize-output-dtype-attr branch from 1e26f28 to 05b222a Compare February 24, 2024 00:10

gramalingam added this pull request to the merge queue Feb 25, 2024

Merged via the queue into onnx:main with commit c95a59c Feb 25, 2024

galagam mentioned this pull request Apr 8, 2024

[Feature request] Adding output_dtype attribute to QuantizeLinear #5943

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add attribute output_dtype to QuantizeLinear #5956

Add attribute output_dtype to QuantizeLinear #5956

Uh oh!

galagam commented Feb 23, 2024

Uh oh!

galagam commented Feb 23, 2024

Uh oh!

codecov bot commented Feb 23, 2024 •

edited

Loading

Uh oh!

Check notice

Uh oh!

Uh oh!

gramalingam left a comment

Uh oh!

Uh oh!

Add attribute output_dtype to QuantizeLinear #5956

Add attribute output_dtype to QuantizeLinear #5956

Uh oh!

Conversation

galagam commented Feb 23, 2024

Uh oh!

galagam commented Feb 23, 2024

Uh oh!

codecov bot commented Feb 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Check notice

Uh oh!

Uh oh!

gramalingam left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov bot commented Feb 23, 2024 •

edited

Loading