Skip to content

Releases: ggml-org/llama.cpp

b5513

27 May 18:23
34b7c04
Compare
Choose a tag to compare
cmake : add llama-cparams.cpp to build (#13832)

b5512

27 May 18:01
f3101a8
Compare
Choose a tag to compare
SYCL: add gelu_erf kernel (#13749)

* SYCL: add gelu_erf kernel

* refactor code

Co-authored-by: Atharva Dubey <[email protected]>

* Use scope_op_debug_print

---------

Co-authored-by: Atharva Dubey <[email protected]>

b5510

27 May 16:05
a8ea03d
Compare
Choose a tag to compare
ggml : add ggml_repeat_4d (#13824)

b5509

27 May 14:04
05f6ac6
Compare
Choose a tag to compare
ggml : riscv: add xtheadvector support (#13720)

* ggml : riscv: add xtheadvector support

* ggml : clean up some macro usage

b5508

27 May 13:09
bc583e3
Compare
Choose a tag to compare
mtmd : support Qwen 2.5 Omni (input audio+vision, no audio output) (#…

b5506

27 May 12:43
7fe03e7
Compare
Choose a tag to compare
ggml-cpu: x86 feature detection is specific to x86 (#13811)

b5505

27 May 12:01
952f395
Compare
Choose a tag to compare
ggml : allow CUDA graphs when using pipeline parallelism (#13814)

b5504

27 May 11:53
8171312
Compare
Choose a tag to compare
kv-cells : track min/max used cells and per-sequence positions (#13808)

* kv-cells : track min/max used cells and per-sequence positions

ggml-ci

* kv-cells : fix pos-modification updates for seq_pos

ggml-ci

* kv-cells : add comments

ggml-ci

b5503

27 May 09:43
f9cd683
Compare
Choose a tag to compare
sampling : make sure samplers return at least 1 token (#13822)

* sampling : min-p should always return at least one token

ggml-ci

* sampling : same for typical sampling

* tests : sampling tests use min_keep == 0

ggml-ci

b5502

27 May 07:00
4f81b33
Compare
Choose a tag to compare
llama : validate seq id batch input (#13809)

* llama : validate seq id batch input

ggml-ci

* cont : fix the fix

ggml-ci