Releases · ggml-org/llama.cpp

27 May 18:23

34b7c04

b5513

cmake : add llama-cparams.cpp to build (#13832)

Assets 18

27 May 18:01

github-actions

b5512

f3101a8

b5512

SYCL: add gelu_erf kernel (#13749)

* SYCL: add gelu_erf kernel

* refactor code

Co-authored-by: Atharva Dubey <[email protected]>

* Use scope_op_debug_print

---------

Co-authored-by: Atharva Dubey <[email protected]>

Assets 18

27 May 16:05

github-actions

b5510

a8ea03d

b5510

ggml : add ggml_repeat_4d (#13824)

Assets 18

27 May 14:04

github-actions

b5509

05f6ac6

b5509

ggml : riscv: add xtheadvector support (#13720)

* ggml : riscv: add xtheadvector support

* ggml : clean up some macro usage

Assets 18

27 May 13:09

github-actions

b5508

bc583e3

b5508

mtmd : support Qwen 2.5 Omni (input audio+vision, no audio output) (#…

Assets 18

27 May 12:43

github-actions

b5506

7fe03e7

b5506

ggml-cpu: x86 feature detection is specific to x86 (#13811)

Assets 18

27 May 12:01

github-actions

b5505

952f395

b5505

ggml : allow CUDA graphs when using pipeline parallelism (#13814)

Assets 18

27 May 11:53

github-actions

b5504

8171312

b5504

kv-cells : track min/max used cells and per-sequence positions (#13808)

* kv-cells : track min/max used cells and per-sequence positions

ggml-ci

* kv-cells : fix pos-modification updates for seq_pos

ggml-ci

* kv-cells : add comments

ggml-ci

Assets 18

27 May 09:43

github-actions

b5503

f9cd683

b5503

sampling : make sure samplers return at least 1 token (#13822)

* sampling : min-p should always return at least one token

ggml-ci

* sampling : same for typical sampling

* tests : sampling tests use min_keep == 0

ggml-ci

Assets 18

27 May 07:00

github-actions

b5502

4f81b33

b5502

llama : validate seq id batch input (#13809)

* llama : validate seq id batch input

ggml-ci

* cont : fix the fix

ggml-ci

Assets 18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ggml-org/llama.cpp

b5513

Uh oh!

b5512

Uh oh!

b5510

Uh oh!

b5509

Uh oh!

b5508

Uh oh!

b5506

Uh oh!

b5505

Uh oh!

b5504

Uh oh!

b5503

Uh oh!

b5502

Uh oh!