Skip to content

Include "<pad>" as possible padding token #15

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

DennisCraandijk
Copy link

@DennisCraandijk DennisCraandijk commented Jan 6, 2025

Various prominent models use "<pad>'' as padding token rather than "[PAD]". This change checks which of the two is used and gets the corresponding padding id before passing this to the StaticModelFineTuner

Various prominent models use "<pad>'' as padding token rather than "[PAD]". This change checks which of the two is used and gets the corresponding padding id before passing this to the StaticModelFineTuner
@Pringled
Copy link
Member

Pringled commented Jan 9, 2025

Hi @DennisCraandijk, thanks for creating this PR! We will do a bit of testing ourselves and then merge and release this feature (might take a bit longer than usual as we are finishing up a release for one of our other projects).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants