Skip to content

Releases: ggml-org/llama.cpp

b5601

06 Jun 12:13
487a5e0
Compare
Choose a tag to compare
context : fix SWA-related warning for multiple sequences (#14045)

b5600

06 Jun 08:02
d17a809
Compare
Choose a tag to compare
llama : support multiple classifier outputs and labels (#13940)

b5598

05 Jun 14:39
669c13e
Compare
Choose a tag to compare
vulkan: Enable VK_KHR_cooperative_matrix extension for Intel Xe2 GPUs…

b5596

05 Jun 13:13
7f37b6c
Compare
Choose a tag to compare
memory : migrate from llama_kv_cache to more generic llama_memory (#1…

b5595

05 Jun 10:16
3a07714
Compare
Choose a tag to compare
llama : allow using mmap without PrefetchVirtualMemory, apply GGML_WI…

b5593

05 Jun 07:48
9f47fa5
Compare
Choose a tag to compare
vocab : warn about missing mask token (#14022)

b5592

05 Jun 06:43
9e31bec
Compare
Choose a tag to compare
context : fix pos_min initialization upon error decode (#14008)

ggml-ci

b5591

05 Jun 05:35
5a8ae30
Compare
Choose a tag to compare
vulkan: automatically deduce size of push constants (#13936)

b5590

04 Jun 20:23
0d39844
Compare
Choose a tag to compare
ggml-vulkan: adds support for op CONV_TRANSPOSE_1D (#13813)

* * ggml-vulkan: adds op CONV_TRANSPOSE_1D

* test-backend-ops: adds more spohisticated tests for CONV_TRANSPOSE_1D

* Missing barrier added to shader.
Number of additional tests reduced to 108.

* * Fixes typo in variable name.

* Removes extra whitespaces.

* Adds int64->int32 casts to prevent possible warnings.

* Problem size reduced in tests to pass tests with llvmpipe.

* supports_op condition moved from unintended position

b5589

04 Jun 16:34
3e63a58
Compare
Choose a tag to compare
kv-cache : refactor the update/defrag mechanism (#13988)

* kv-cache : refactor update mechanism

ggml-ci

* memory : improve status handling

* defrag : reset head + add comments

ggml-ci

* cont : minor fixes

ggml-ci