Add a pass to insert QDQ nodes before residual connection #59009

leo0519 · 2023-11-14T17:30:14Z

PR types

Bug fixes

PR changes

Others

Description

This pull request adds a new pass AddQuantDequantForResidual in quantization_pass.py.
Through this pass, quant_aware could insert QDQ nodes for residual connections to ensure that INT8 inference runs entirely under low precision. Otherwise, some kernels may have floating-point precision and intermediate tensors.
This PR is an example for issue The model quantized by QAT API should have QDQ nodes before skip connection.

leo0519 · 2023-11-23T03:11:49Z

This PR only supports distributed optimizer to insert QDQ node before skip-connection, but this should be implemented in QAT API (ex. quant_aware or quantization_pass).

I will provide a complete version for this.

Marked as draft.

…9009)

leo0519 added the NVIDIA label Nov 14, 2023

leo0519 mentioned this pull request Nov 14, 2023

The model quantized by QAT API should have QDQ nodes before skip connection. #58989

Closed

onecatcn assigned wanghaoshuang Nov 15, 2023

leo0519 marked this pull request as ready for review November 16, 2023 02:51

wanghaoshuang assigned yghstill Nov 16, 2023

yghstill previously approved these changes Nov 23, 2023

View reviewed changes

leo0519 marked this pull request as draft November 23, 2023 03:12

Add a pass to insert QDQ nodes before skip connection

9519267

leo0519 dismissed yghstill’s stale review via 9519267 November 29, 2023 01:44

leo0519 force-pushed the qdq-skip branch from 98cb164 to 9519267 Compare November 29, 2023 01:44

leo0519 marked this pull request as ready for review November 29, 2023 01:45

leo0519 changed the title ~~Add QDQ into skip-connection in qat meta optimizer~~ Add a pass to insert QDQ nodes before residual connection Nov 29, 2023

AdamzNV requested a review from yghstill November 30, 2023 02:38

wanghaoshuang approved these changes Dec 4, 2023

View reviewed changes

wanghaoshuang merged commit de9d407 into PaddlePaddle:develop Dec 4, 2023

SigureMo pushed a commit to gouzil/Paddle that referenced this pull request Dec 5, 2023

Add a pass to insert QDQ nodes before skip connection (PaddlePaddle#5…

1104b4d

…9009)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a pass to insert QDQ nodes before residual connection #59009

Add a pass to insert QDQ nodes before residual connection #59009

Uh oh!

leo0519 commented Nov 14, 2023 •

edited

Loading

Uh oh!

leo0519 commented Nov 23, 2023

Uh oh!

Uh oh!

Add a pass to insert QDQ nodes before residual connection #59009

Add a pass to insert QDQ nodes before residual connection #59009

Uh oh!

Conversation

leo0519 commented Nov 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Description

Uh oh!

leo0519 commented Nov 23, 2023

Uh oh!

Uh oh!

leo0519 commented Nov 14, 2023 •

edited

Loading