Skip to content

Releases: ngxson/llama.cpp

b4209

28 Nov 13:53
76b27d2
Compare
Choose a tag to compare
ggml : fix row condition for i8mm kernels (#10561)

ggml-ci

b4206

28 Nov 12:47
2025fa6
Compare
Choose a tag to compare
kompute : improve backend to pass test_backend_ops (#10542)

* kompute: op_unary: reject unsupported parameters

Signed-off-by: Sergio Lopez <[email protected]>

* kompute: softmax: implement ALiBi support

Signed-off-by: Sergio Lopez <[email protected]>

* kompute: rope: implement neox and phi3 support

Signed-off-by: Sergio Lopez <[email protected]>

* kompute: op_mul_mat_q4_k permutted support

Signed-off-by: Sergio Lopez <[email protected]>

* kompute: op_mul_mat_[q4_0|q4_1|q8_0] permutted support

Signed-off-by: Sergio Lopez <[email protected]>

* kompute: op_mul_mat_f16 permutted support

Signed-off-by: Sergio Lopez <[email protected]>

* kompute: op_mul_mat_q6_k permutted support

Signed-off-by: Sergio Lopez <[email protected]>

---------

Signed-off-by: Sergio Lopez <[email protected]>

b4205

28 Nov 08:50
c6bc739
Compare
Choose a tag to compare
CANN: Update cann.md to display correctly in CLion (#10538)

b4203

28 Nov 07:47
b742013
Compare
Choose a tag to compare
CANN: ROPE operator optimization (#10540)

* [cann] ROPE operator optimization

Co-authored-by: noemotiovon <[email protected]>

b4202

27 Nov 22:45
9f91251
Compare
Choose a tag to compare
common : fix duplicated file name with hf_repo and hf_file (#10550)

b4201

27 Nov 17:51
3ad5451
Compare
Choose a tag to compare
Add some minimal optimizations for CDNA (#10498)

* Add some minimal optimizations for CDNA

* ggml_cuda: set launch bounds also for GCN as it helps there too

b4200

27 Nov 10:53
46c69e0
Compare
Choose a tag to compare
ci : faster CUDA toolkit installation method and use ccache (#10537)

* ci : faster CUDA toolkit installation method and use ccache

* remove fetch-depth

* only pack CUDA runtime on master

b4196

27 Nov 08:57
c31ed2a
Compare
Choose a tag to compare
vulkan: define all quant data structures in types.comp (#10440)

b4177

26 Nov 12:04
811872a
Compare
Choose a tag to compare
speculative : simplify the implementation (#10504)

ggml-ci

b4175

26 Nov 11:21
7066b4c
Compare
Choose a tag to compare
CANN: RoPE and CANCAT operator optimization (#10488)

Co-authored-by: noemotiovon <[email protected]>