Skip to content

Removing GPTQ from all of torchchat #864

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 9 commits into from
Jun 21, 2024
Merged

Removing GPTQ from all of torchchat #864

merged 9 commits into from
Jun 21, 2024

Conversation

Jack-Khuu
Copy link
Contributor

GPTQ is currently experimental in torchao and not an MVP target

Will be called out in future works

Copy link

pytorch-bot bot commented Jun 18, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/864

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit be5b9b4 with merge base a51389c (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jun 18, 2024
python3 -W ignore generate.py --dtype ${DTYPE} --checkpoint-path "$CHECKPOINT_PATH" --temperature 0 --dso-path ${MODEL_DIR}/${MODEL_NAME}.so --device "$TARGET_DEVICE" > "$MODEL_DIR/output_aoti" || exit 1
.ci/scripts/check_gibberish "$MODEL_DIR/output_aoti"
if [ "$DTYPE" != "float16" ]; then
python3 -W ignore export.py --dtype ${DTYPE} --quant '{"linear:int4" : {"groupsize": 32}}' --checkpoint-path "$CHECKPOINT_PATH" --output-dso-path ${MODEL_DIR}/${MODEL_NAME}.so --device "$TARGET_DEVICE" || exit 1
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this one includes int4, it's not just gptq

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good catch, will fix

python3 generate.py [ --pte-path ${MODEL_OUT}/${MODEL_NAME}_gptq.pte | ...dso...] --prompt "Hello my name is"
```

#### Adding additional quantization schemes (hqq)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

are you planning to remove hqq as well

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yup I have a separate PR for that

Jack-Khuu and others added 5 commits June 20, 2024 13:23
Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets
llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema
* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)
* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)
* Removing all references to HQQ

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)
@Jack-Khuu Jack-Khuu merged commit d71783c into main Jun 21, 2024
51 checks passed
@Jack-Khuu Jack-Khuu deleted the remove_gptq branch June 21, 2024 23:38
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Removing GPTQ from all of torchchat

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Rebase + Add back accidental deletion

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Removing all references to HQQ (#869)

* Removing all references to HQQ

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Removing GPTQ from all of torchchat

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Rebase + Add back accidental deletion

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Removing all references to HQQ (#869)

* Removing all references to HQQ

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Removing GPTQ from all of torchchat

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Rebase + Add back accidental deletion

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Removing all references to HQQ (#869)

* Removing all references to HQQ

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Removing GPTQ from all of torchchat

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Rebase + Add back accidental deletion

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Removing all references to HQQ (#869)

* Removing all references to HQQ

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Removing GPTQ from all of torchchat

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Rebase + Add back accidental deletion

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Removing all references to HQQ (#869)

* Removing all references to HQQ

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)
malfet pushed a commit that referenced this pull request Jul 17, 2024
* Removing GPTQ from all of torchchat

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Rebase + Add back accidental deletion

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Removing all references to HQQ (#869)

* Removing all references to HQQ

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Creating an initial Quantization Directory (#863)

* Initial Creation of a quantization directory

* Moving qops

* updating import

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)

* Update Quant call using llama.cpp (#868)

llama.cpp did a BC breaking refactor: ggml-org/llama.cpp@1c641e6
resulting in some of our CI breaking

This updates our CI to match llama.cpp's schema

* Updating torch nightly to pick up aoti improvements in 128339 (#862)

* Updating torch nightly to pick up aoti improvements in 128339

* Update the torch version to 2.5

* Updating lm_eval version (#865)

Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets

* Pinning numpy to under 2.0 (#867)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Meta Open Source bot.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants