Use symmetric weights for convs and int8 in the default quantizer #8344

mcremon-meta · 2025-02-10T18:52:39Z

Summary:
As titled. int8 should give better performance with Cadence kernels, since they're not improving uint8 anymore.
The upcoming (quantized) convolution kernel needs symmetric weights, so we make that change as well.

Differential Revision: D69405797

pytorch-bot · 2025-02-10T18:52:43Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/8344

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 5 Unrelated Failures

As of commit 4b02da3 with merge base ee7d388 ():

NEW FAILURES - The following jobs have failed:

Lint / lintrunner / linux-job (gh)
>>> Lint for backends/cadence/aot/export_example.py:
pull / test-binary-size-linux / linux-job (gh)
/pytorch/executorch/kernels/portable/cpu/util/advanced_index_util.cpp:58:18: error: unused variable 'input_shape' [-Werror,-Wunused-variable]

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / test-llama-runner-qnn-linux (fp32, qnn_16a16w, qnn) / linux-job (gh) (similar failure)
undefined reference to executorch::runtime::tensor_shape_to_c_string(executorch::runtime::Span)'`

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-binary-size-linux-gcc / linux-job (gh) (trunk failure)
/pytorch/executorch/kernels/portable/cpu/util/advanced_index_util.cpp:58:18: error: variable 'input_shape' set but not used [-Werror=unused-but-set-variable]
pull / test-llama-runner-qnn-linux (fp32, qnn_8a8w, qnn) / linux-job (gh) (trunk failure)
undefined reference to executorch::runtime::tensor_shape_to_c_string(executorch::runtime::Span)'`
pull / test-selective-build-linux / linux-job (gh) (trunk failure)
undefined reference to executorch::runtime::tensor_shape_to_c_string(executorch::runtime::Span)'`
pull / test-static-llama-qnn-linux / linux-job (gh) (trunk failure)
undefined reference to executorch::runtime::tensor_shape_to_c_string(executorch::runtime::Span)'`

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-02-10T18:52:52Z

This pull request was exported from Phabricator. Differential Revision: D69405797

) Summary: As titled. int8 should give better performance with Cadence kernels, since they're not improving uint8 anymore. The upcoming (quantized) convolution kernel needs symmetric weights, so we make that change as well. Reviewed By: zonglinpeng Differential Revision: D69405797

facebook-github-bot · 2025-02-12T01:55:42Z

This pull request was exported from Phabricator. Differential Revision: D69405797

) Summary: As titled. int8 should give better performance with Cadence kernels, since they're not improving uint8 anymore. The upcoming (quantized) convolution kernel needs symmetric weights, so we make that change as well. Reviewed By: zonglinpeng Differential Revision: D69405797

facebook-github-bot · 2025-02-12T02:09:24Z

This pull request was exported from Phabricator. Differential Revision: D69405797

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 10, 2025

facebook-github-bot added the fb-exported label Feb 10, 2025

zonglinpeng approved these changes Feb 12, 2025

View reviewed changes

zonglinpeng added the topic: not user facing label Feb 12, 2025

facebook-github-bot force-pushed the export-D69405797 branch from 78b7bf9 to bde2dd7 Compare February 12, 2025 01:55

mcremon-meta force-pushed the export-D69405797 branch from bde2dd7 to 4b02da3 Compare February 12, 2025 02:09

facebook-github-bot merged commit 3681588 into main Feb 12, 2025
39 of 48 checks passed

facebook-github-bot deleted the export-D69405797 branch February 12, 2025 04:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use symmetric weights for convs and int8 in the default quantizer #8344

Use symmetric weights for convs and int8 in the default quantizer #8344

Uh oh!

mcremon-meta commented Feb 10, 2025

Uh oh!

pytorch-bot bot commented Feb 10, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Feb 10, 2025

Uh oh!

facebook-github-bot commented Feb 12, 2025

Uh oh!

facebook-github-bot commented Feb 12, 2025

Uh oh!

Uh oh!

Uh oh!

Use symmetric weights for convs and int8 in the default quantizer #8344

Use symmetric weights for convs and int8 in the default quantizer #8344

Uh oh!

Conversation

mcremon-meta commented Feb 10, 2025

Uh oh!

pytorch-bot bot commented Feb 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/8344

❌ 2 New Failures, 5 Unrelated Failures

Uh oh!

facebook-github-bot commented Feb 10, 2025

Uh oh!

facebook-github-bot commented Feb 12, 2025

Uh oh!

facebook-github-bot commented Feb 12, 2025

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 10, 2025 •

edited

Loading