Skip to content

Embedding quantization per backend #402

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 6 commits into from
Apr 23, 2024
Merged

Conversation

mikekgfb
Copy link
Contributor

Embedding quantization per backend

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 23, 2024
Copy link
Contributor

@malfet malfet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How do we test for anything like that?

@mikekgfb mikekgfb merged commit 052fb1a into main Apr 23, 2024
malfet added a commit that referenced this pull request Apr 23, 2024
mikekgfb added a commit that referenced this pull request Apr 23, 2024
mikekgfb added a commit that referenced this pull request Apr 23, 2024
* Revert "Revert "Embedding quantization per backend (#402)" (#411)"

This reverts commit 8b35acd.

* 4b and 8b embedding table quantization

* minor changes

* remove extra et workflow
mikekgfb added a commit that referenced this pull request Apr 24, 2024
* Revert "Revert "Embedding quantization per backend (#402)" (#411)"

This reverts commit 8b35acd.

* merge GGUF tests into pull.yml
@malfet malfet deleted the embedding_quantization_per_backend branch April 30, 2024 16:51
malfet pushed a commit that referenced this pull request Jul 17, 2024
* ET or AOTI backend logic

* use args, not builder_args

* typo

* typo

* typo
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Revert "Revert "Embedding quantization per backend (#402)" (#411)"

This reverts commit 8b35acd.

* 4b and 8b embedding table quantization

* minor changes

* remove extra et workflow
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Revert "Revert "Embedding quantization per backend (#402)" (#411)"

This reverts commit 8b35acd.

* merge GGUF tests into pull.yml
malfet pushed a commit that referenced this pull request Jul 17, 2024
* ET or AOTI backend logic

* use args, not builder_args

* typo

* typo

* typo
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Revert "Revert "Embedding quantization per backend (#402)" (#411)"

This reverts commit 8b35acd.

* 4b and 8b embedding table quantization

* minor changes

* remove extra et workflow
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Revert "Revert "Embedding quantization per backend (#402)" (#411)"

This reverts commit 8b35acd.

* merge GGUF tests into pull.yml
malfet pushed a commit that referenced this pull request Jul 17, 2024
* ET or AOTI backend logic

* use args, not builder_args

* typo

* typo

* typo
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Revert "Revert "Embedding quantization per backend (#402)" (#411)"

This reverts commit 8b35acd.

* 4b and 8b embedding table quantization

* minor changes

* remove extra et workflow
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Revert "Revert "Embedding quantization per backend (#402)" (#411)"

This reverts commit 8b35acd.

* merge GGUF tests into pull.yml
malfet pushed a commit that referenced this pull request Jul 17, 2024
* ET or AOTI backend logic

* use args, not builder_args

* typo

* typo

* typo
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Revert "Revert "Embedding quantization per backend (#402)" (#411)"

This reverts commit 8b35acd.

* 4b and 8b embedding table quantization

* minor changes

* remove extra et workflow
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Revert "Revert "Embedding quantization per backend (#402)" (#411)"

This reverts commit 8b35acd.

* merge GGUF tests into pull.yml
malfet pushed a commit that referenced this pull request Jul 17, 2024
* ET or AOTI backend logic

* use args, not builder_args

* typo

* typo

* typo
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Revert "Revert "Embedding quantization per backend (#402)" (#411)"

This reverts commit 8b35acd.

* 4b and 8b embedding table quantization

* minor changes

* remove extra et workflow
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Revert "Revert "Embedding quantization per backend (#402)" (#411)"

This reverts commit 8b35acd.

* merge GGUF tests into pull.yml
malfet pushed a commit that referenced this pull request Jul 17, 2024
* ET or AOTI backend logic

* use args, not builder_args

* typo

* typo

* typo
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Revert "Revert "Embedding quantization per backend (#402)" (#411)"

This reverts commit 8b35acd.

* 4b and 8b embedding table quantization

* minor changes

* remove extra et workflow
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Revert "Revert "Embedding quantization per backend (#402)" (#411)"

This reverts commit 8b35acd.

* merge GGUF tests into pull.yml
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Meta Open Source bot.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants