[quant][pt2e][bc-breaking] Set `fold_quantize` to True in `convert_pt2e` #118701

jerryzh168 · 2024-01-31T00:25:14Z

Summary:
This is a follow up to #118605 to remove fold_quantize flag from
convert_pt2e

Test Plan: CI

Differential Revision: D53247301

BC Breaking Note:

flag fold_quantize set to True convert_pt2e and now we'll fold the quantize op in the weight by default, so users will see model size reduction by default after pt2e quantization.
2.2

folded_model = convert_pt2e(model, fold_quantize=True)

non_folded_model = convert_pt2e(model)

2.3

folded_model = convert_pt2e(model)

non_folded_model = convert_pt2e(model, fold_quantize=False)

cc @ezyang @gchanan @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler

pytorch-bot · 2024-01-31T00:25:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/118701

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit df455cb with merge base 46ef735 ():

NEW FAILURE - The following job has failed:

pull / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, linux.2xlarge) (gh)
Delegation q8 error

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-01-31T00:25:23Z

This pull request was exported from Phabricator. Differential Revision: D53247301

Summary: X-link: pytorch/pytorch#118701 This is a follow up to pytorch/pytorch#118605 to remove `fold_quantize` flag from `convert_pt2e` Differential Revision: D53247301

facebook-github-bot · 2024-01-31T01:01:38Z

This pull request was exported from Phabricator. Differential Revision: D53247301

Summary: X-link: pytorch/executorch#1766 This is a follow up to pytorch#118605 to remove `fold_quantize` flag from `convert_pt2e` Test Plan: CI Reviewed By: andrewor14 Differential Revision: D53247301

Summary: X-link: pytorch/pytorch#118701 This is a follow up to pytorch/pytorch#118605 to remove `fold_quantize` flag from `convert_pt2e` Reviewed By: andrewor14 Differential Revision: D53247301

facebook-github-bot · 2024-02-01T23:11:49Z

This pull request was exported from Phabricator. Differential Revision: D53247301

Summary: Pull Request resolved: #1766 X-link: pytorch/pytorch#118701 This is a follow up to pytorch/pytorch#118605 to set `fold_quantize` flag to True in `convert_pt2e` Reviewed By: andrewor14, digantdesai Differential Revision: D53247301 fbshipit-source-id: 5b2dbbc76487a8779f30c483b5ff4f054ba1ae8c

facebook-github-bot · 2024-02-07T19:08:52Z

@pytorchbot merge -f 'Landed internally'

(Initiating merge automatically since Phabricator Diff has merged, using force because this PR might not pass merge_rules.json but landed internally)

pytorchmergebot · 2024-02-07T19:10:40Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

facebook-github-bot · 2024-02-07T20:54:12Z

@pytorchbot revert -m="Diff reverted internally" -c="ghfirst"

This Pull Request has been reverted by a revert inside Meta. To re-land this change, please open another pull request, assign the same reviewers, fix the CI failures that caused the revert and make sure that the failing CI runs on the PR by applying the proper ciflow label (e.g., ciflow/trunk).)

pytorchmergebot · 2024-02-07T20:56:11Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

This reverts commit 482d952. Reverted #118701 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](#118701 (comment)))

pytorchmergebot · 2024-02-07T20:56:21Z

@jerryzh168 your PR has been successfully reverted.

Summary: This is a follow up to #118605 to remove `fold_quantize` flag from `convert_pt2e` Test Plan: CI Differential Revision: D53247301 BC Breaking Note: flag `fold_quantize` set to True `convert_pt2e` and now we'll fold the quantize op in the weight by default, so users will see model size reduction by default after pt2e quantization. 2.2 ``` folded_model = convert_pt2e(model, fold_quantize=True) non_folded_model = convert_pt2e(model) ``` 2.3 ``` folded_model = convert_pt2e(model) non_folded_model = convert_pt2e(model, fold_quantize=False) ``` Pull Request resolved: #118701 Approved by: https://github.com/andrewor14, https://github.com/leslie-fang-intel

This reverts commit 482d952. Reverted #118701 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](#118701 (comment)))

Summary: This is a follow up to #118605 to remove `fold_quantize` flag from `convert_pt2e` Test Plan: CI Differential Revision: D53247301 BC Breaking Note: flag `fold_quantize` set to True `convert_pt2e` and now we'll fold the quantize op in the weight by default, so users will see model size reduction by default after pt2e quantization. 2.2 ``` folded_model = convert_pt2e(model, fold_quantize=True) non_folded_model = convert_pt2e(model) ``` 2.3 ``` folded_model = convert_pt2e(model) non_folded_model = convert_pt2e(model, fold_quantize=False) ``` Pull Request resolved: #118701 Approved by: https://github.com/andrewor14, https://github.com/leslie-fang-intel

This reverts commit 482d952. Reverted #118701 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](#118701 (comment)))

pytorch-bot bot added the release notes: quantization release notes category label Jan 31, 2024

facebook-github-bot added the fb-exported label Jan 31, 2024

github-actions bot added the module: inductor label Jan 31, 2024

jerryzh168 force-pushed the export-D53247301 branch from ed5f7ed to ebf1404 Compare January 31, 2024 01:01

jerryzh168 requested review from kimishpatel, andrewor14, leslie-fang-intel and digantdesai January 31, 2024 01:19

andrewor14 approved these changes Feb 1, 2024

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Feb 1, 2024

[quant][pt2e][bc-breaking] Remove fold_quantize flag (pytorch#118701)

df455cb

Summary: X-link: pytorch/executorch#1766 This is a follow up to pytorch#118605 to remove `fold_quantize` flag from `convert_pt2e` Test Plan: CI Reviewed By: andrewor14 Differential Revision: D53247301

jerryzh168 force-pushed the export-D53247301 branch from ebf1404 to df455cb Compare February 1, 2024 23:11

jerryzh168 added the module: bc-breaking Related to a BC-breaking change label Feb 2, 2024

pytorch-bot bot added the topic: bc_breaking label Feb 2, 2024

jerryzh168 added the suppress-bc-linter Suppresses the failures of API backward-compatibility linter (Lint/bc_linter) label Feb 2, 2024

leslie-fang-intel approved these changes Feb 4, 2024

View reviewed changes

jerryzh168 mentioned this pull request Feb 6, 2024

[quant][bc-breaking] Turn on fold_quantize by default #118605

Closed

pytorchmergebot added the merging label Feb 7, 2024

pytorchmergebot closed this in 482d952 Feb 7, 2024

pytorchmergebot added Merged and removed merging labels Feb 7, 2024

jerryzh168 changed the title ~~[quant][pt2e][bc-breaking] Remove fold_quantize flag~~ [quant][pt2e][bc-breaking] Set fold_quantize to True in convert_pt2e Feb 7, 2024

pytorchmergebot added the Reverted label Feb 7, 2024

pytorchmergebot reopened this Feb 7, 2024

jerryzh168 closed this Feb 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[quant][pt2e][bc-breaking] Set `fold_quantize` to True in `convert_pt2e` #118701

[quant][pt2e][bc-breaking] Set `fold_quantize` to True in `convert_pt2e` #118701

Uh oh!

jerryzh168 commented Jan 31, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jan 31, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Jan 31, 2024

Uh oh!

facebook-github-bot commented Jan 31, 2024

Uh oh!

facebook-github-bot commented Feb 1, 2024

Uh oh!

facebook-github-bot commented Feb 7, 2024

Uh oh!

pytorchmergebot commented Feb 7, 2024

Uh oh!

facebook-github-bot commented Feb 7, 2024

Uh oh!

pytorchmergebot commented Feb 7, 2024

Uh oh!

pytorchmergebot commented Feb 7, 2024

Uh oh!

Uh oh!

[quant][pt2e][bc-breaking] Set fold_quantize to True in convert_pt2e #118701

[quant][pt2e][bc-breaking] Set fold_quantize to True in convert_pt2e #118701

Uh oh!

Conversation

jerryzh168 commented Jan 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jan 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/118701

❌ 1 New Failure

Uh oh!

facebook-github-bot commented Jan 31, 2024

Uh oh!

facebook-github-bot commented Jan 31, 2024

Uh oh!

facebook-github-bot commented Feb 1, 2024

Uh oh!

facebook-github-bot commented Feb 7, 2024

Uh oh!

pytorchmergebot commented Feb 7, 2024

Merge started

Uh oh!

facebook-github-bot commented Feb 7, 2024

Uh oh!

pytorchmergebot commented Feb 7, 2024

Uh oh!

pytorchmergebot commented Feb 7, 2024

Uh oh!

Uh oh!

[quant][pt2e][bc-breaking] Set `fold_quantize` to True in `convert_pt2e` #118701

[quant][pt2e][bc-breaking] Set `fold_quantize` to True in `convert_pt2e` #118701

jerryzh168 commented Jan 31, 2024 •

edited

Loading

pytorch-bot bot commented Jan 31, 2024 •

edited

Loading