Fixing race condition in server and partial stream handling in frontend. #2391

snichols · 2023-07-25T17:04:59Z

This PR fixes two problems:

A race condition in server.cpp.
completion.js didn't handle partial stream results.

The race condition was caused by unique_lock scope loss when processing streaming completion. My fix was to handle unlocking of the mutex manually in that case. This bug caused segfaults when handling multiple streaming requests on the completion endpoint.

Partial stream results happen regularly on slower connections, or when the server is generating messages very quickly. The fix was to handle leftover data in the llama generator function. This bug caused completion messages to be garbled in several cases.

…mpletion.js

Azeirah · 2023-07-31T15:34:24Z

Ah great! I had many segfaults, I will use this branch tomorrow to check if it's fixed.

slaren · 2023-08-04T11:37:17Z

From what I understand, the .hpp files will need to be regenerated to see the changes in completion.js in the binary. However, I think that fixing this is important enough to merge this now. Hopefully the .hpp files can be regenerated in one of the other server PRs.

…nd. (ggml-org#2391) * Fixing race condition in server.cpp and partial stream handling in completion.js * Reverting assert edits. * Adding newline to eof

snichols and others added 4 commits July 25, 2023 11:58

Fixing race condition in server.cpp and partial stream handling in co…

3e3f38a

…mpletion.js

Reverting assert edits.

3811c0a

Adding newline to eof

0509a68

Merge branch 'ggerganov:master' into develop

592594f

slaren mentioned this pull request Jul 31, 2023

[User] Unreliable response from server using Wireguard when tokens are generated too fast #2467

Closed

4 tasks

slaren linked an issue Jul 31, 2023 that may be closed by this pull request

[User] Unreliable response from server using Wireguard when tokens are generated too fast #2467

Closed

4 tasks

slaren approved these changes Jul 31, 2023

View reviewed changes

slaren merged commit 5f631c2 into ggml-org:master Aug 4, 2023

cebtenzzre mentioned this pull request Aug 4, 2023

server: regenerate completion.js.hpp #2515

Merged

ml13390 mentioned this pull request Aug 7, 2023

bad display of answer in server chat client #2513

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixing race condition in server and partial stream handling in frontend. #2391

Fixing race condition in server and partial stream handling in frontend. #2391

Uh oh!

snichols commented Jul 25, 2023

Uh oh!

Azeirah commented Jul 31, 2023

Uh oh!

slaren commented Aug 4, 2023

Uh oh!

Uh oh!

Fixing race condition in server and partial stream handling in frontend. #2391

Fixing race condition in server and partial stream handling in frontend. #2391

Uh oh!

Conversation

snichols commented Jul 25, 2023

Uh oh!

Azeirah commented Jul 31, 2023

Uh oh!

slaren commented Aug 4, 2023

Uh oh!

Uh oh!