Skip to content

Releases: ggml-org/llama.cpp

b4231

01 Dec 03:57
43957ef
Compare
Choose a tag to compare
build: update Makefile comments for C++ version change (#10598)

b4230

30 Nov 17:49
0c39f44
Compare
Choose a tag to compare
ggml-cpu: replace AArch64 NEON assembly with intrinsics in ggml_gemv_…

b4227

30 Nov 07:34
0533e7f
Compare
Choose a tag to compare
vulkan: Dynamic subgroup size support for Q6_K mat_vec (#10536)

* subgroup 64 version with subgroup add. 15% faster

scalable version

tested for subgroup sizes 16-128

* check for subgroup multiple of 16 and greater than 16

* subgroup sizes are always a power of 2 (https://github.com/KhronosGroup/GLSL/issues/45)

* force 16 sequential threads per block

* make 16 subgroup size a constant

b4226

29 Nov 21:28
7cc2d2c
Compare
Choose a tag to compare
ggml : move AMX to the CPU backend (#10570)

* ggml : move AMX to the CPU backend

---------

Co-authored-by: Georgi Gerganov <[email protected]>

b4224

29 Nov 18:04
3a8e9af
Compare
Choose a tag to compare
imatrix : support combine-only (#10492)

* imatrix-combine-only idea

* ensured that behavior consistent with log

b4222

29 Nov 15:00
f0678c5
Compare
Choose a tag to compare
ggml : fix I8MM Q4_1 scaling factor conversion (#10562)

ggml-ci

b4221

29 Nov 14:59
4b3242b
Compare
Choose a tag to compare
ggml-cpu: fix typo in gemv/gemm iq4_nl_4_4 (#10580)

b4220

29 Nov 13:22
0f77aae
Compare
Choose a tag to compare
sycl : offload of get_rows set to 0 (#10432)

b4219

29 Nov 10:25
266b851
Compare
Choose a tag to compare
sycl : Reroute permuted mul_mats through oneMKL (#10408)

This PR fixes the failing MUL_MAT tests for the sycl backend.

b4218

29 Nov 07:26
938f608
Compare
Choose a tag to compare
CANN: RoPE operator optimization (#10563)

* [cann] RoPE operator optimization

* [CANN]Code Formatting

---------

Co-authored-by: noemotiovon <[email protected]>