Commit ab028cb

and

authored

Migrate inference to llama_batch and llama_decode api (ggml-org#795)

* Add low-level batching notebook * fix: tokenization of special characters: (ggml-org#850) It should behave like llama.cpp, where most out of the box usages treat special characters accordingly * Update CHANGELOG * Cleanup * Fix runner label * Update notebook * Use llama_decode and batch api * Support logits_all parameter --------- Co-authored-by: Antoine Lizee <[email protected]>

1 parent f436e0c commit ab028cbCopy full SHA for ab028cb

3 files changed

+753

-8

lines changed

examples/notebooks
- Batching.ipynb
llama_cpp
- llama.py
tests
- test_llama.py

3 files changed

+753

-8

lines changed

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit ab028cb

3 files changed

3 files changed

File tree

3 files changed

3 files changed

0 commit comments