Releases · ggml-org/llama.cpp

06 Jun 12:13

487a5e0

b5601

context : fix SWA-related warning for multiple sequences (#14045)

Assets 15

06 Jun 08:02

github-actions

b5600

d17a809

b5600

llama : support multiple classifier outputs and labels (#13940)

Assets 15

05 Jun 14:39

github-actions

b5598

669c13e

b5598

vulkan: Enable VK_KHR_cooperative_matrix extension for Intel Xe2 GPUs…

Assets 15

05 Jun 13:13

github-actions

b5596

7f37b6c

b5596

memory : migrate from llama_kv_cache to more generic llama_memory (#1…

Assets 15

05 Jun 10:16

github-actions

b5595

3a07714

b5595

llama : allow using mmap without PrefetchVirtualMemory, apply GGML_WI…

Assets 15

05 Jun 07:48

github-actions

b5593

9f47fa5

b5593

vocab : warn about missing mask token (#14022)

Assets 15

05 Jun 06:43

github-actions

b5592

9e31bec

b5592

context : fix pos_min initialization upon error decode (#14008)

ggml-ci

Assets 15

05 Jun 05:35

github-actions

b5591

5a8ae30

b5591

vulkan: automatically deduce size of push constants (#13936)

Assets 15

04 Jun 20:23

github-actions

b5590

0d39844

b5590

ggml-vulkan: adds support for op CONV_TRANSPOSE_1D (#13813)

* * ggml-vulkan: adds op CONV_TRANSPOSE_1D

* test-backend-ops: adds more spohisticated tests for CONV_TRANSPOSE_1D

* Missing barrier added to shader.
Number of additional tests reduced to 108.

* * Fixes typo in variable name.

* Removes extra whitespaces.

* Adds int64->int32 casts to prevent possible warnings.

* Problem size reduced in tests to pass tests with llvmpipe.

* supports_op condition moved from unintended position

Assets 15

04 Jun 16:34

github-actions

b5589

3e63a58

b5589

kv-cache : refactor the update/defrag mechanism (#13988)

* kv-cache : refactor update mechanism

ggml-ci

* memory : improve status handling

* defrag : reset head + add comments

ggml-ci

* cont : minor fixes

ggml-ci

Assets 15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ggml-org/llama.cpp

b5601

Uh oh!

b5600

Uh oh!

b5598

Uh oh!

b5596

Uh oh!

b5595

Uh oh!

b5593

Uh oh!

b5592

Uh oh!

b5591

Uh oh!

b5590

Uh oh!

b5589

Uh oh!