Releases · ngxson/llama.cpp

05 Jun 05:42

5a8ae30

b5591

vulkan: automatically deduce size of push constants (#13936)

Assets 15

04 Jun 20:27

github-actions

b5590

0d39844

b5590

ggml-vulkan: adds support for op CONV_TRANSPOSE_1D (#13813)

* * ggml-vulkan: adds op CONV_TRANSPOSE_1D

* test-backend-ops: adds more spohisticated tests for CONV_TRANSPOSE_1D

* Missing barrier added to shader.
Number of additional tests reduced to 108.

* * Fixes typo in variable name.

* Removes extra whitespaces.

* Adds int64->int32 casts to prevent possible warnings.

* Problem size reduced in tests to pass tests with llvmpipe.

* supports_op condition moved from unintended position

Assets 15

04 Jun 16:18

github-actions

b5589

3e63a58

b5589

kv-cache : refactor the update/defrag mechanism (#13988)

* kv-cache : refactor update mechanism

ggml-ci

* memory : improve status handling

* defrag : reset head + add comments

ggml-ci

* cont : minor fixes

ggml-ci

Assets 15

04 Jun 14:20

github-actions

b5588

2589ad3

b5588

ci : remove cuda 11.7 releases, switch runner to windows 2022 (#13997)

Assets 15

04 Jun 11:47

github-actions

b5587

4825487

b5587

releases : use dl backend for linux release, remove arm64 linux relea…

Assets 17

04 Jun 08:37

github-actions

b5586

3ac6753

b5586

llama-graph : use ggml_repeat_4d (#13998)

Assets 18

04 Jun 07:55

github-actions

b5585

0b4be4c

b5585

CUDA: fix FTZ in FA for Gemma 3 (#13991)

Assets 18

04 Jun 07:38

github-actions

b5584

e0e806f

b5584

kv-cache : fix unified::seq_rm to work with seq_id < 0 (#13985)

ggml-ci

Assets 18

03 Jun 00:24

github-actions

b5581

71e74a3

b5581

opencl: add `backend_synchronize` (#13939)

* This is not needed by the normal use where the result is read
  using `tensor_get`, but it allows perf mode of `test-backend-ops`
  to properly measure performance.

Assets 18

03 Jun 00:09

github-actions

b5580

bfb1e01

b5580

OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (#13840)

* add concat, pad, repeat, tsembd, tanh, upscale

* small fixes

Assets 18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ngxson/llama.cpp

b5591

Uh oh!

b5590

Uh oh!

b5589

Uh oh!

b5588

Uh oh!

b5587

Uh oh!

b5586

Uh oh!

b5585

Uh oh!

b5584

Uh oh!

b5581

Uh oh!

b5580

Uh oh!