Releases · ggml-org/llama.cpp

01 Dec 03:57

43957ef

b4231

build: update Makefile comments for C++ version change (#10598)

Assets 22

30 Nov 17:49

github-actions

b4230

0c39f44

b4230

ggml-cpu: replace AArch64 NEON assembly with intrinsics in ggml_gemv_…

Assets 22

30 Nov 07:34

github-actions

b4227

0533e7f

b4227

vulkan: Dynamic subgroup size support for Q6_K mat_vec (#10536)

* subgroup 64 version with subgroup add. 15% faster

scalable version

tested for subgroup sizes 16-128

* check for subgroup multiple of 16 and greater than 16

* subgroup sizes are always a power of 2 (https://github.com/KhronosGroup/GLSL/issues/45)

* force 16 sequential threads per block

* make 16 subgroup size a constant

Assets 22

29 Nov 21:28

github-actions

b4226

7cc2d2c

b4226

ggml : move AMX to the CPU backend (#10570)

* ggml : move AMX to the CPU backend

---------

Co-authored-by: Georgi Gerganov <[email protected]>

Assets 22

29 Nov 18:04

github-actions

b4224

3a8e9af

b4224

imatrix : support combine-only (#10492)

* imatrix-combine-only idea

* ensured that behavior consistent with log

Assets 22

29 Nov 15:00

github-actions

b4222

f0678c5

b4222

ggml : fix I8MM Q4_1 scaling factor conversion (#10562)

ggml-ci

Assets 22

29 Nov 14:59

github-actions

b4221

4b3242b

b4221

ggml-cpu: fix typo in gemv/gemm iq4_nl_4_4 (#10580)

Assets 22

29 Nov 13:22

github-actions

b4220

0f77aae

b4220

sycl : offload of get_rows set to 0 (#10432)

Assets 22

29 Nov 10:25

github-actions

b4219

266b851

b4219

sycl : Reroute permuted mul_mats through oneMKL (#10408)

This PR fixes the failing MUL_MAT tests for the sycl backend.

Assets 22

29 Nov 07:26

github-actions

b4218

938f608

b4218

CANN: RoPE operator optimization (#10563)

* [cann] RoPE operator optimization

* [CANN]Code Formatting

---------

Co-authored-by: noemotiovon <[email protected]>

Assets 22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ggml-org/llama.cpp

b4231

Uh oh!

b4230

Uh oh!

b4227

Uh oh!

b4226

Uh oh!

b4224

Uh oh!

b4222

Uh oh!

b4221

Uh oh!

b4220

Uh oh!

b4219

Uh oh!

b4218

Uh oh!