Releases · ggml-org/llama.cpp

23 Dec 12:30

14b699e

b4384

server : fix missing model id in /model endpoint (#10957)

* server : fix missing model id in /model endpoint

* fix ci

Assets 23

23 Dec 12:02

github-actions

b4383

485dc01

b4383

server : add system_fingerprint to chat/completion (#10917)

* server : add system_fingerprint to chat/completion

* update README

Assets 23

23 Dec 09:15

github-actions

b4382

86bf31c

b4382

rpc-server : add support for the SYCL backend (#10934)

Assets 23

23 Dec 01:58

github-actions

b4381

b92a14a

b4381

llama : support InfiniAI Megrez 3b (#10893)

* Support InfiniAI Megrez 3b

* Fix tokenizer_clean_spaces for megrez

Assets 23

23 Dec 01:56

github-actions

b4380

6f0c9e0

b4380

llama : support for Llama-3_1-Nemotron-51B (#10669)

* conflict resolution

* move comments after bracket to its own line

Assets 23

23 Dec 01:40

github-actions

b4379

dab76c9

b4379

llama-run : include temperature option (#10899)

This commit updates the `examples/run/README.md` file to include a new
option for setting the temperature and updates the `run.cpp` file to
parse this option.

Signed-off-by: Eric Curtin <[email protected]>

Assets 23

23 Dec 01:31

github-actions

b4378

7024d59

b4378

ggml : fix run-time on FreeBSD in get_executable_path() (#10948)

Assets 23

22 Dec 22:44

github-actions

b4376

7ae33a6

b4376

llama : add Falcon3 support (#10883)

* Add Falcon3 model support

* Add fix for adding bos to added special tokens

* Add comment explaining the logic behind the if statement

* Add a log message to better track the when the following line of code is triggered

* Update log to only print when input and output characters are different

* Fix handling pre-normalized tokens

* Refactoring

Assets 23

22 Dec 10:22

github-actions

b4375

ebdee94

b4375

vulkan: build fixes for 32b (#10927)

* vulkan: build fixes for 32b

Should fix #10923

* vulkan: initialize some buffer/offset variables

Assets 23

21 Dec 00:10

github-actions

b4372

e34c5af

b4372

ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0…

Assets 23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ggml-org/llama.cpp

b4384

Uh oh!

b4383

Uh oh!

b4382

Uh oh!

b4381

Uh oh!

b4380

Uh oh!

b4379

Uh oh!

b4378

Uh oh!

b4376

Uh oh!

b4375

Uh oh!

b4372

Uh oh!