VulkanQuantizer for weight-only quantization on linear #4707

nathanaelsee · 2024-08-14T00:05:49Z

Summary:
Using XNNPACKQuantizer as a base.
VulkanQuantizer only annotates for 8-bit weight-only static quantization on linear nodes for now, as we only currently implement 8-bit weight quantized linear in the form of weight_int8packed_mm.

Differential Revision: D61243540

pytorch-bot · 2024-08-14T00:05:51Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/4707

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 6d1a572 with merge base 35da5bf ():

NEW FAILURE - The following job has failed:

pull / test-llama-runner-qnn-linux (fp32, cmake, qnn) / linux-job (gh)
RuntimeError: Command docker exec -t a21178c28c9702f2e080ef5114f98a55d4dc7897f6b7bec4a04ffa8ae73f1f56 /exec failed with exit code 2

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-08-14T00:05:58Z

This pull request was exported from Phabricator. Differential Revision: D61243540

Summary: Using XNNPACKQuantizer as a base. VulkanQuantizer only annotates for 8-bit weight-only static quantization on linear nodes for now, as we only currently implement 8-bit weight quantized linear in the form of weight_int8packed_mm. Differential Revision: D61243540

facebook-github-bot · 2024-08-14T20:03:22Z

This pull request was exported from Phabricator. Differential Revision: D61243540

Summary: Using XNNPACKQuantizer as a base. VulkanQuantizer only annotates for 8-bit weight-only static quantization on linear nodes for now, as we only currently implement 8-bit weight quantized linear in the form of weight_int8packed_mm. Reviewed By: copyrightly Differential Revision: D61243540

facebook-github-bot · 2024-08-15T03:59:28Z

This pull request was exported from Phabricator. Differential Revision: D61243540

Summary: Using XNNPACKQuantizer as a base. VulkanQuantizer only annotates for 8-bit weight-only static quantization on linear nodes for now, as we only currently implement 8-bit weight quantized linear in the form of weight_int8packed_mm. Reviewed By: copyrightly Differential Revision: D61243540

facebook-github-bot · 2024-08-15T04:50:48Z

This pull request was exported from Phabricator. Differential Revision: D61243540

Differential Revision: D61243540 Pull Request resolved: pytorch#4707

pytorch-bot bot added ciflow/periodic module: vulkan Issues related to the Vulkan delegate and code under backends/vulkan/ labels Aug 14, 2024

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 14, 2024

facebook-github-bot added the fb-exported label Aug 14, 2024

nathanaelsee force-pushed the export-D61243540 branch from 1282332 to 5cdda6e Compare August 14, 2024 20:03

copyrightly approved these changes Aug 15, 2024

View reviewed changes

nathanaelsee force-pushed the export-D61243540 branch from 5cdda6e to 4a97670 Compare August 15, 2024 03:59

nathanaelsee force-pushed the export-D61243540 branch from 4a97670 to 6d1a572 Compare August 15, 2024 04:50

facebook-github-bot merged commit caadd81 into pytorch:main Aug 15, 2024
75 of 78 checks passed

kirklandsign pushed a commit to kirklandsign/executorch that referenced this pull request Aug 15, 2024

VulkanQuantizer for weight-only quantization on linear

7010b69

Differential Revision: D61243540 Pull Request resolved: pytorch#4707

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

VulkanQuantizer for weight-only quantization on linear #4707

VulkanQuantizer for weight-only quantization on linear #4707

Uh oh!

nathanaelsee commented Aug 14, 2024

Uh oh!

pytorch-bot bot commented Aug 14, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Aug 14, 2024

Uh oh!

facebook-github-bot commented Aug 14, 2024

Uh oh!

facebook-github-bot commented Aug 15, 2024

Uh oh!

facebook-github-bot commented Aug 15, 2024

Uh oh!

Uh oh!

Uh oh!

VulkanQuantizer for weight-only quantization on linear #4707

VulkanQuantizer for weight-only quantization on linear #4707

Uh oh!

Conversation

nathanaelsee commented Aug 14, 2024

Uh oh!

pytorch-bot bot commented Aug 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/4707

❌ 1 New Failure

Uh oh!

facebook-github-bot commented Aug 14, 2024

Uh oh!

facebook-github-bot commented Aug 14, 2024

Uh oh!

facebook-github-bot commented Aug 15, 2024

Uh oh!

facebook-github-bot commented Aug 15, 2024

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 14, 2024 •

edited

Loading