Use int4mm weight packing MPS kernel #866

manuelcandales · 2024-06-18T19:11:59Z

This should land after the following PyTorch PR lands: pytorch/pytorch#128965
in order to use the int4mm weight packing MPS kernel introduced there

pytorch-bot · 2024-06-18T19:12:02Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/866

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 0131b58 with merge base d71783c ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Jack-Khuu · 2024-06-18T21:23:18Z

Heads up that I have some PR's that'll land to fix most of these test (CI is just broke broke atm)

malfet

LGTM, but you'll need to update pytorch pin (otherwise your changes would not be visible)

jerryzh168 · 2024-06-28T00:35:13Z

so int4mm path for mps is enabled now?

* Use int4mm weight packing mps kernel * update torch nightly Updating torch nightly to pick up int4mm groupsize 256 fix (#874)

* Use int4mm weight packing mps kernel * update torch nightly

* Use int4mm weight packing mps kernel * update torch nightly Updating torch nightly to pick up int4mm groupsize 256 fix (#874)

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jun 18, 2024

Jack-Khuu self-requested a review June 18, 2024 21:22

Use int4mm weight packing mps kernel

219fe79

manuelcandales force-pushed the int4mm branch from 3179e94 to 219fe79 Compare June 25, 2024 18:49

manuelcandales requested review from malfet and kimishpatel June 25, 2024 18:50

malfet approved these changes Jun 25, 2024

View reviewed changes

update torch nightly

0131b58

manuelcandales merged commit 81fce9c into main Jun 26, 2024
51 checks passed

Jack-Khuu mentioned this pull request Jun 26, 2024

4-bit quantization on MPS generates etSocketAddressDIRECTORYDIRECTORY instead of tokens #870

Closed

malfet pushed a commit that referenced this pull request Jul 17, 2024

Use int4mm weight packing MPS kernel (#866)

2972d6a

* Use int4mm weight packing mps kernel * update torch nightly Updating torch nightly to pick up int4mm groupsize 256 fix (#874)

malfet pushed a commit that referenced this pull request Jul 17, 2024

Use int4mm weight packing MPS kernel (#866)

343534f

* Use int4mm weight packing mps kernel * update torch nightly

malfet pushed a commit that referenced this pull request Jul 17, 2024

Use int4mm weight packing MPS kernel (#866)

b100130

* Use int4mm weight packing mps kernel * update torch nightly

malfet pushed a commit that referenced this pull request Jul 17, 2024

Use int4mm weight packing MPS kernel (#866)

47273e6

* Use int4mm weight packing mps kernel * update torch nightly

malfet pushed a commit that referenced this pull request Jul 17, 2024

Use int4mm weight packing MPS kernel (#866)

2cf283d

* Use int4mm weight packing mps kernel * update torch nightly Updating torch nightly to pick up int4mm groupsize 256 fix (#874)

malfet pushed a commit that referenced this pull request Jul 17, 2024

Use int4mm weight packing MPS kernel (#866)

a6e47fd

* Use int4mm weight packing mps kernel * update torch nightly Updating torch nightly to pick up int4mm groupsize 256 fix (#874)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use int4mm weight packing MPS kernel #866

Use int4mm weight packing MPS kernel #866

Uh oh!

manuelcandales commented Jun 18, 2024

Uh oh!

pytorch-bot bot commented Jun 18, 2024 •

edited

Loading

Uh oh!

Jack-Khuu commented Jun 18, 2024

Uh oh!

malfet left a comment

Uh oh!

Uh oh!

jerryzh168 commented Jun 28, 2024

Uh oh!

Uh oh!

Use int4mm weight packing MPS kernel #866

Use int4mm weight packing MPS kernel #866

Uh oh!

Conversation

manuelcandales commented Jun 18, 2024

Uh oh!

pytorch-bot bot commented Jun 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/866

✅ No Failures

Uh oh!

Jack-Khuu commented Jun 18, 2024

Uh oh!

malfet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jerryzh168 commented Jun 28, 2024

Uh oh!

Uh oh!

pytorch-bot bot commented Jun 18, 2024 •

edited

Loading