Skip to content

Adding UTF-8 support to linenoise.cpp #12111

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Mar 3, 2025
Merged

Adding UTF-8 support to linenoise.cpp #12111

merged 1 commit into from
Mar 3, 2025

Conversation

ericcurtin
Copy link
Collaborator

For emojis, non-alpha characters, etc.

@ngxson ngxson changed the title Adding UTF-8 support to llama.cpp Adding UTF-8 support to linenoise.cpp Feb 28, 2025
@ericcurtin
Copy link
Collaborator Author

@ngxson PTAL

For emojis, non-alpha characters, etc.

Signed-off-by: Eric Curtin <[email protected]>
Copy link
Collaborator

@ngxson ngxson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't have enough knowledge to review vendor changes, only giving approval so you can merge it.

@ericcurtin ericcurtin merged commit c950a1f into master Mar 3, 2025
47 checks passed
@ericcurtin ericcurtin deleted the llama-run-utf-8 branch March 3, 2025 12:45
@ericcurtin
Copy link
Collaborator Author

ericcurtin commented Mar 3, 2025

@yhirose this was the end goal for the UTF-8 stuff if you are interested and hence to be used in RamaLama to implement this properly:

containers/ramalama#872

llama.cpp in general seems to be benefitting greatly from your work with it's usage of cpp-httplib also 😄

@ericcurtin
Copy link
Collaborator Author

ericcurtin commented Mar 3, 2025

There was also some people who were trying to input japanese characters are such that complained it wasn't being sent to llama.cpp properly from linenoise.cpp , hopefully this is in a better place now.

@yhirose
Copy link

yhirose commented Mar 4, 2025

@ericcurtin sounds good! If you find any problem with the UTF-8 support, please let me know.

mglambda pushed a commit to mglambda/llama.cpp that referenced this pull request Mar 8, 2025
For emojis, non-alpha characters, etc.

Signed-off-by: Eric Curtin <[email protected]>
arthw pushed a commit to arthw/llama.cpp that referenced this pull request Mar 19, 2025
For emojis, non-alpha characters, etc.

Signed-off-by: Eric Curtin <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants