Skip to content

Hacky func streaming #1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 7 commits into from
Apr 5, 2024
Merged

Hacky func streaming #1

merged 7 commits into from
Apr 5, 2024

Conversation

tybalex
Copy link
Collaborator

@tybalex tybalex commented Apr 5, 2024

  • Streaming function calls.
  • Fixed a make bug in linux/ubuntu.
  • Fixed input function name with hyphen(-) --> by mapping - to _ and convert back when post processing.

@tybalex tybalex merged commit 60a01b3 into master Apr 5, 2024
@tybalex tybalex deleted the hacky_func_streaming branch April 16, 2024 21:22
tybalex added a commit that referenced this pull request Apr 17, 2024
* hacky function call streaming

* remove

* minor fix to take care of case that the input function has no description or arguments is null

* test parser

* fix makefile to make sure the order of file linking works for ubuntu gcc/g++ 11.4

* add function name mapping to take care of input function name with hyphen-

* add a comment TODO for streaming chunks.
tybalex added a commit that referenced this pull request Apr 18, 2024
* hacky function call streaming

* remove

* minor fix to take care of case that the input function has no description or arguments is null

* test parser

* fix makefile to make sure the order of file linking works for ubuntu gcc/g++ 11.4

* add function name mapping to take care of input function name with hyphen-

* add a comment TODO for streaming chunks.
tybalex added a commit that referenced this pull request Apr 18, 2024
* hacky function call streaming

* remove

* minor fix to take care of case that the input function has no description or arguments is null

* test parser

* fix makefile to make sure the order of file linking works for ubuntu gcc/g++ 11.4

* add function name mapping to take care of input function name with hyphen-

* add a comment TODO for streaming chunks.
tybalex pushed a commit that referenced this pull request Oct 23, 2024
* [example] batched-bench "segmentation fault"

When `llama-batched-bench` is invoked _without_ setting `-npl`, "number
of parallel prompts", it segfaults.

The segfault is caused by invoking `max_element()` on a zero-length
vector, `n_pl`

This commit addresses that by first checking to see if the number of
parallel prompts is zero, and if so sets the maximum sequence size to 1;
otherwise, sets it to the original, the result of `max_element()`.

Fixes, when running `lldb build/bin/llama-batched-bench -- -m models/Meta-Llama-3-8B.gguf`

```
* thread #1, queue = 'com.apple.main-thread', stop reason = EXC_BAD_ACCESS (code=1, address=0x0)
    frame #0: 0x000000010000366c llama-batched-bench`main(argc=3, argv=0x000000016fdff268) at batched-bench.cpp:72:28
   69  	    llama_context_params ctx_params = llama_context_params_from_gpt_params(params);
   70
   71  	    // ensure enough sequences are available
-> 72  	    ctx_params.n_seq_max = *std::max_element(n_pl.begin(), n_pl.end());
```

* Update examples/batched-bench/batched-bench.cpp

Co-authored-by: compilade <[email protected]>

---------

Co-authored-by: Georgi Gerganov <[email protected]>
Co-authored-by: compilade <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant