Releases · ggml-org/llama.cpp

29 May 07:00

1b8fb81

b5529

ggml: aarch64: Implement SVE F32 kernels for vector functions (#13843)

* F32-Mamba-SVE

* F32-Mamba-SVE

* Resolve test errors-1

* Resolve test errors-2

* F32-vec-SVE

* F32-vec-SVE

* F32-vec-SVE

Assets 18

28 May 21:04

github-actions

b5527

763d06e

b5527

llama : fix KV shift for qwen2vl (#13870)

* llama : fix KV shift for qwen2vl

* add ref to the PR

Assets 18

28 May 20:58

github-actions

b5526

1096133

b5526

mtmd : move helpers to dedicated library (⚠️ breaking change) (#13866)

* mtmd : move helpers to dedicated library

* fix server build

* rm leftover cmakelist code

Assets 18

28 May 17:22

github-actions

b5524

e0e3aa2

b5524

llama : add support for BertForSequenceClassification reranker (#13858)

* convert: add support for BertForSequenceClassification

* add support for reranking using BertForSequenceClassification

* merge checks of eos and sep

* fix lint

---------

Co-authored-by: dinhhuy <[email protected]>

Assets 18

28 May 14:55

github-actions

b5522

c962ae3

b5522

server: fix remove 'image_url'/'input_audio' json-object effectlly fo…

Assets 18

28 May 13:06

github-actions

b5519

a682474

b5519

CUDA: fix FA tg at long context for CC >= 8.9 (#13852)

Assets 18

28 May 04:13

github-actions

b5517

1e8659e

b5517

CANN: Add SOC TYPE printing in cmake configuration (#13837)

Assets 18

27 May 20:31

github-actions

b5516

a3c3084

b5516

opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, …

Assets 18

27 May 20:21

github-actions

b5515

1701d4c

b5515

opencl: mark `mul_mat` `f32f32` as supporting non-contiguous tensors …

Assets 18

27 May 18:47

github-actions

b5514

bef8176

b5514

vulkan: use timestamp queries for GGML_VULKAN_PERF (#13817)

Also change it to be controlled by an env var rather than cmake flag

Assets 18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ggml-org/llama.cpp

b5529

Uh oh!

b5527

Uh oh!

b5526

Uh oh!

b5524

Uh oh!

b5522

Uh oh!

b5519

Uh oh!

b5517

Uh oh!

b5516

Uh oh!

b5515

Uh oh!

b5514

Uh oh!