Skip to content

Support unlift=false for per_channel qparams #868

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 2 commits into from

Conversation

digantdesai
Copy link
Contributor

Summary: This was a TODO, is now needed to support Saliency model with unlift=False

Differential Revision: D50197184

@netlify
Copy link

netlify bot commented Oct 12, 2023

Deploy Preview for resplendent-gnome-14e531 canceled.

Name Link
🔨 Latest commit fc6dfa9
🔍 Latest deploy log https://app.netlify.com/sites/resplendent-gnome-14e531/deploys/65299b45058868000970b9e9

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 12, 2023
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D50197184

digantdesai and others added 2 commits October 12, 2023 14:14
Summary:
Rationale,
* PyTorch, weights = [out_channels, in_channels/group, kernel_h, kernel_w], per_channel quant axis = 0
* XNNPACK, weights = [in_channels/group, kernel_h, kernel_w, out_channels], per_channel quant axis = 3

This diff fixes the axis value (i.e. weight dim) and convert from 0 --> 3 before passing it on to XNNPACK just like we do weights already

Differential Revision: https://internalfb.com/D50195930

fbshipit-source-id: 21c2f94d0dfaf62ca3c527297e59c19a2c4c351d
Summary:
Pull Request resolved: pytorch/executorch#868

This was a TODO, is now needed to support Saliency model with `unlift=False`

Reviewed By: kimishpatel

Differential Revision: D50197184

fbshipit-source-id: 0610e8318be96d84d83ace29c7a95dc9a31b1296
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D50197184

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in c33932c.

Gasoonjia pushed a commit that referenced this pull request Jul 30, 2024
llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema
Gasoonjia pushed a commit that referenced this pull request Jul 30, 2024
* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)
Gasoonjia pushed a commit that referenced this pull request Jul 30, 2024
* Removing all references to HQQ

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)
Gasoonjia pushed a commit that referenced this pull request Jul 30, 2024
* Removing GPTQ from all of torchchat

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Rebase + Add back accidental deletion

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Removing all references to HQQ (#869)

* Removing all references to HQQ

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants