Support unlift=false for per_channel qparams #868

digantdesai · 2023-10-12T17:06:03Z

Summary: This was a TODO, is now needed to support Saliency model with unlift=False

Differential Revision: D50197184

netlify · 2023-10-12T17:06:09Z

✅ Deploy Preview for resplendent-gnome-14e531 canceled.

Name	Link
🔨 Latest commit	`fc6dfa9`
🔍 Latest deploy log	https://app.netlify.com/sites/resplendent-gnome-14e531/deploys/65299b45058868000970b9e9

facebook-github-bot · 2023-10-12T17:06:34Z

This pull request was exported from Phabricator. Differential Revision: D50197184

Summary: Rationale, * PyTorch, weights = [out_channels, in_channels/group, kernel_h, kernel_w], per_channel quant axis = 0 * XNNPACK, weights = [in_channels/group, kernel_h, kernel_w, out_channels], per_channel quant axis = 3 This diff fixes the axis value (i.e. weight dim) and convert from 0 --> 3 before passing it on to XNNPACK just like we do weights already Differential Revision: https://internalfb.com/D50195930 fbshipit-source-id: 21c2f94d0dfaf62ca3c527297e59c19a2c4c351d

Summary: Pull Request resolved: pytorch/executorch#868 This was a TODO, is now needed to support Saliency model with `unlift=False` Reviewed By: kimishpatel Differential Revision: D50197184 fbshipit-source-id: 0610e8318be96d84d83ace29c7a95dc9a31b1296

facebook-github-bot · 2023-10-13T19:32:15Z

This pull request was exported from Phabricator. Differential Revision: D50197184

facebook-github-bot · 2023-10-13T20:39:41Z

This pull request has been merged in c33932c.

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema

* Initial Creation of a quantization directory * Moving qops * updating import * Updating lm_eval version (#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (#867) * Update Quant call using llama.cpp (#868) llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema * Updating torch nightly to pick up aoti improvements in 128339 (#862) * Updating torch nightly to pick up aoti improvements in 128339 * Update the torch version to 2.5 * Updating lm_eval version (#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (#867)

* Removing all references to HQQ * Updating lm_eval version (#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (#867) * Update Quant call using llama.cpp (#868) llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema * Updating torch nightly to pick up aoti improvements in 128339 (#862) * Updating torch nightly to pick up aoti improvements in 128339 * Update the torch version to 2.5 * Updating lm_eval version (#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (#867) * Creating an initial Quantization Directory (#863) * Initial Creation of a quantization directory * Moving qops * updating import * Updating lm_eval version (#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (#867) * Update Quant call using llama.cpp (#868) llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema * Updating torch nightly to pick up aoti improvements in 128339 (#862) * Updating torch nightly to pick up aoti improvements in 128339 * Update the torch version to 2.5 * Updating lm_eval version (#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (#867)

* Removing GPTQ from all of torchchat * Updating lm_eval version (#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (#867) * Rebase + Add back accidental deletion * Update Quant call using llama.cpp (#868) llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema * Updating torch nightly to pick up aoti improvements in 128339 (#862) * Updating torch nightly to pick up aoti improvements in 128339 * Update the torch version to 2.5 * Updating lm_eval version (#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (#867) * Creating an initial Quantization Directory (#863) * Initial Creation of a quantization directory * Moving qops * updating import * Updating lm_eval version (#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (#867) * Update Quant call using llama.cpp (#868) llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema * Updating torch nightly to pick up aoti improvements in 128339 (#862) * Updating torch nightly to pick up aoti improvements in 128339 * Update the torch version to 2.5 * Updating lm_eval version (#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (#867) * Removing all references to HQQ (#869) * Removing all references to HQQ * Updating lm_eval version (#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (#867) * Update Quant call using llama.cpp (#868) llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema * Updating torch nightly to pick up aoti improvements in 128339 (#862) * Updating torch nightly to pick up aoti improvements in 128339 * Update the torch version to 2.5 * Updating lm_eval version (#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (#867) * Creating an initial Quantization Directory (#863) * Initial Creation of a quantization directory * Moving qops * updating import * Updating lm_eval version (#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (#867) * Update Quant call using llama.cpp (#868) llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema * Updating torch nightly to pick up aoti improvements in 128339 (#862) * Updating torch nightly to pick up aoti improvements in 128339 * Update the torch version to 2.5 * Updating lm_eval version (#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (#867)

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 12, 2023

facebook-github-bot added the fb-exported label Oct 12, 2023

digantdesai and others added 2 commits October 12, 2023 14:14

digantdesai force-pushed the export-D50197184 branch from 3930509 to fc6dfa9 Compare October 13, 2023 19:32

facebook-github-bot closed this in c33932c Oct 13, 2023

facebook-github-bot added the Merged label Oct 13, 2023

dbort mentioned this pull request Oct 14, 2023

[release/0.1] Cherrypick docs-only commits from main #923

Merged

Gasoonjia pushed a commit that referenced this pull request Jul 30, 2024

Update Quant call using llama.cpp (#868)

103a17b

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support unlift=false for per_channel qparams #868

Support unlift=false for per_channel qparams #868

Uh oh!

digantdesai commented Oct 12, 2023

Uh oh!

netlify bot commented Oct 12, 2023 •

edited

Loading

Uh oh!

facebook-github-bot commented Oct 12, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

Uh oh!

Support unlift=false for per_channel qparams #868

Support unlift=false for per_channel qparams #868

Uh oh!

Conversation

digantdesai commented Oct 12, 2023

Uh oh!

netlify bot commented Oct 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for resplendent-gnome-14e531 canceled.

Uh oh!

facebook-github-bot commented Oct 12, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

Uh oh!

netlify bot commented Oct 12, 2023 •

edited

Loading