update quant docs #425

lucylq · 2024-04-23T21:12:32Z

update quant docs, json properties must be in double quotes, groupsize 7 doesn't work (modulo 4096)

mikekgfb

Thank you!

mikekgfb · 2024-04-23T22:30:48Z

README.md

@@ -63,7 +63,6 @@ python3 torchchat.py download llama3
     * in Chat mode
     * in Generate mode
  * [Exporting for mobile via ExecuTorch](#export-executorch)
-     * in Chat mode


What does that mean? This is a function of the token stream received from the driver. @JacobSzwejbka has been overhauling the chat mode, please work through the application scenario with @JacobSzwejbka to ensure it works. If it does not, it's a bug. Ditto for aoti-generated models.

Hmn, are we supporting chat mode for exported executorch models as well? I think @JacobSzwejbka's changes are for eager (in generate.py).

What does that mean? This is a function of the token stream received from the driver. @JacobSzwejbka has been overhauling the chat mode, please work through the application scenario with @JacobSzwejbka to ensure it works. If it does not, it's a bug. Ditto for aoti-generated models.

there is a 0% chance I will have time to make chat mode work for anything but eager by the deadline. If that feature is truly required someone else needs to be working on it.

I need to dig into this. Please remove this change and I'll approve to land the rest

sorry - missed this comment. Updated

mikekgfb

Thank you!

mikekgfb · 2024-04-23T22:49:57Z

README.md

@@ -63,7 +63,6 @@ python3 torchchat.py download llama3
     * in Chat mode
     * in Generate mode
  * [Exporting for mobile via ExecuTorch](#export-executorch)
-     * in Chat mode


I need to dig into this. Please remove this change and I'll approve to land the rest

mikekgfb

Thank you!

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 23, 2024

lucylq force-pushed the lfq.download-create-dirs branch 4 times, most recently from 587c8a1 to 303bd9d Compare April 23, 2024 22:11

lucylq marked this pull request as ready for review April 23, 2024 22:16

lucylq requested review from JacobSzwejbka and larryliu0820 April 23, 2024 22:16

mikekgfb suggested changes Apr 23, 2024

View reviewed changes

mikekgfb reviewed Apr 23, 2024

View reviewed changes

lucylq force-pushed the lfq.download-create-dirs branch from 303bd9d to f3cfd7d Compare April 24, 2024 03:13

mikekgfb approved these changes Apr 24, 2024

View reviewed changes

lucylq force-pushed the lfq.download-create-dirs branch 2 times, most recently from 86b50f6 to b83db35 Compare April 24, 2024 04:17

create dir on download

134f73e

lucylq force-pushed the lfq.download-create-dirs branch from b83db35 to 134f73e Compare April 24, 2024 04:30

lucylq changed the title ~~Create dir on download, readme~~ update quant docs Apr 24, 2024

lucylq merged commit 25a105f into main Apr 24, 2024

malfet pushed a commit that referenced this pull request Jul 17, 2024

create dir on download (#425)

b85fe46

malfet pushed a commit that referenced this pull request Jul 17, 2024

create dir on download (#425)

f4c5191

malfet pushed a commit that referenced this pull request Jul 17, 2024

create dir on download (#425)

920a622

malfet pushed a commit that referenced this pull request Jul 17, 2024

create dir on download (#425)

46eb900

malfet pushed a commit that referenced this pull request Jul 17, 2024

create dir on download (#425)

6b8912e

malfet pushed a commit that referenced this pull request Jul 17, 2024

create dir on download (#425)

d6da1b4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

update quant docs #425

update quant docs #425

Uh oh!

lucylq commented Apr 23, 2024 •

edited

Loading

Uh oh!

mikekgfb left a comment

Uh oh!

mikekgfb Apr 23, 2024

Uh oh!

lucylq Apr 23, 2024

Uh oh!

JacobSzwejbka Apr 23, 2024 •

edited

Loading

Uh oh!

mikekgfb Apr 23, 2024

Uh oh!

lucylq Apr 24, 2024

Uh oh!

mikekgfb left a comment

Uh oh!

mikekgfb Apr 23, 2024

Uh oh!

mikekgfb left a comment

Uh oh!

Uh oh!

update quant docs #425

update quant docs #425

Uh oh!

Conversation

lucylq commented Apr 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mikekgfb left a comment

Choose a reason for hiding this comment

Uh oh!

mikekgfb Apr 23, 2024

Choose a reason for hiding this comment

Uh oh!

lucylq Apr 23, 2024

Choose a reason for hiding this comment

Uh oh!

JacobSzwejbka Apr 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mikekgfb Apr 23, 2024

Choose a reason for hiding this comment

Uh oh!

lucylq Apr 24, 2024

Choose a reason for hiding this comment

Uh oh!

mikekgfb left a comment

Choose a reason for hiding this comment

Uh oh!

mikekgfb Apr 23, 2024

Choose a reason for hiding this comment

Uh oh!

mikekgfb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lucylq commented Apr 23, 2024 •

edited

Loading

JacobSzwejbka Apr 23, 2024 •

edited

Loading