Fixing frequency penalty #1811

martinigoyanes · 2024-04-25T22:01:40Z

Thank you so much for the work you are doing, this is my little contribution to this great thing you have built. I hope it is useful and helpful, please don't hesitate to discuss any matters that are not clear!

I am basing my implementation of frequency penalty on OpenAI's implementation: https://platform.openai.com/docs/guides/text-generation/parameter-details

The problem I see with TGI's current implementation is that is not taking into account the frequency of tokens which have already been sampled in the current generation stream. Also, the scaling is of the adjusted token logits is done differently for positive and negative logits. While in OpenAI's implementation token frequency is taking into account and the scaling is always done with a subtraction (if penalty is positive) or add operation (if penalty is negative).

This leads to corrupt generations as I mentioned in issue #1810 . Moreover, after my tests, other issues are also gone like the one about some request's with penalty_frequency = 1.0 overruling other requests (with frequency_penalty = 0.0) in the same batch and therefore corrupting all generations in the batch. Basically, padding does not affect this implementation so I believe this score *= input_ids.ne(0) is not needed anymore.

Frequency penalty	-1.0	0.0	1.0
Before my change	https://paste.mozilla.org/JxqGJkWY	https://paste.mozilla.org/hrztJ56h	https://paste.mozilla.org/pBSEH2zw
After my change	https://paste.mozilla.org/7gXCi7zo	https://paste.mozilla.org/ZR9rJ92g	https://paste.mozilla.org/gHaD2YnC

martinigoyanes · 2024-04-29T10:38:57Z

Hey I noticed the links to the generation examples were broken, so I have updated them!

martinigoyanes · 2024-04-30T07:49:47Z

Hey @drbh I saw you created a new branch off of this one, why is that?

Btw, I have rebased and fixed the styling! Could you guys maybe take a look and give me your feedback? @drbh @Narsil @OlivierDehaene

This is the commit with my changes 949b889

Are you guys comfortable with merging this branch?

Thank you!

Narsil · 2024-04-30T09:03:05Z

We create branches to run the CI because our secrets will not be available on forks (and we need them to run the integration tests unfortunately).

The rebase created a lot of bogus commits, any way you could remove them ? I'm happy to help with the rebase if you want.

Thanks a lot for the fix !

… when apply freq penalty

Narsil · 2024-04-30T10:13:19Z

I took the liberty of doing the rebase so your fix could be included in the upcoming release.

martinigoyanes · 2024-04-30T11:07:28Z

Thank you so much @Narsil ! Okay now I understand why the fork happened, keep up the great work you guys are doing with TGI!

(and sorry for the mess I created with my rebase :/ )

Narsil · 2024-04-30T11:58:52Z

No worries :) Cheers

Thank you so much for the work you are doing, this is my little contribution to this great thing you have built. I hope it is useful and helpful, please don't hesitate to discuss any matters that are not clear! I am basing my implementation of frequency penalty on OpenAI's implementation: https://platform.openai.com/docs/guides/text-generation/parameter-details The problem I see with TGI's current implementation is that is not taking into account the frequency of tokens which have already been sampled in the current generation stream. Also, the scaling is of the adjusted token logits is done differently for positive and negative logits. While in OpenAI's implementation token frequency is taking into account and the scaling is always done with a subtraction (if penalty is positive) or add operation (if penalty is negative). This leads to corrupt generations as I mentioned in issue huggingface#1810 . Moreover, after my tests, other issues are also gone like the one about some request's with ``penalty_frequency = 1.0`` overruling other requests (with ``frequency_penalty = 0.0``) in the same batch and therefore corrupting all generations in the batch. Basically, padding does not affect this implementation so I believe this ``score *= input_ids.ne(0)`` is not needed anymore. Frequency penalty | -1.0 | 0.0 | 1.0 -- | -- | -- | -- Before my change | https://paste.mozilla.org/JxqGJkWY | https://paste.mozilla.org/hrztJ56h | https://paste.mozilla.org/pBSEH2zw After my change | https://paste.mozilla.org/7gXCi7zo | https://paste.mozilla.org/ZR9rJ92g | https://paste.mozilla.org/gHaD2YnC --------- Co-authored-by: martini <[email protected]>

drbh self-assigned this Apr 26, 2024

drbh mentioned this pull request Apr 29, 2024

Martinigoyanes fix frequency penalty #1830

Closed

martinigoyanes added 2 commits April 30, 2024 12:11

fix: take into account logits frequency so far in a generation stream…

fcbd7fc

… when apply freq penalty

chore: rebase and fix formatting

21ec539

Narsil force-pushed the fix-frequency-penalty branch from fa9713d to 21ec539 Compare April 30, 2024 10:12

Narsil merged commit 9192de5 into huggingface:main Apr 30, 2024
4 of 8 checks passed

tgaddair mentioned this pull request Nov 11, 2024

Fix frequency_penalty and presence_penalty predibase/lorax#672

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixing frequency penalty #1811

Fixing frequency penalty #1811

Uh oh!

martinigoyanes commented Apr 25, 2024 •

edited

Loading

Uh oh!

martinigoyanes commented Apr 29, 2024

Uh oh!

martinigoyanes commented Apr 30, 2024 •

edited

Loading

Uh oh!

Narsil commented Apr 30, 2024

Uh oh!

Narsil commented Apr 30, 2024

Uh oh!

Uh oh!

martinigoyanes commented Apr 30, 2024

Uh oh!

Narsil commented Apr 30, 2024

Uh oh!

Uh oh!

Fixing frequency penalty #1811

Fixing frequency penalty #1811

Uh oh!

Conversation

martinigoyanes commented Apr 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

martinigoyanes commented Apr 29, 2024

Uh oh!

martinigoyanes commented Apr 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Narsil commented Apr 30, 2024

Uh oh!

Narsil commented Apr 30, 2024

Uh oh!

Uh oh!

martinigoyanes commented Apr 30, 2024

Uh oh!

Narsil commented Apr 30, 2024

Uh oh!

Uh oh!

martinigoyanes commented Apr 25, 2024 •

edited

Loading

martinigoyanes commented Apr 30, 2024 •

edited

Loading