Releases · ggml-org/llama.cpp

08 Jan 15:42

c07d437

b4445

llama : avoid hardcoded QK_K (#11061)

ggml-ci

Assets 23

08 Jan 12:24

github-actions

b4443

c792dcf

b4443

ggml : allow loading backend with env variable (ggml/1059)

ref: #1058

Assets 23

08 Jan 11:31

github-actions

b4440

8cef75c

b4440

llamafile : ppc64le MMA INT8 implementation (#10912)

This change upstreams llamafile's cpu matrix
multiplication kernels for ppc64le using MMA
builtins for quantised int8 datatype.

This change results in 10% - 70% improvement
in total speed(ie all tokens/total time), across
various batch sizes.

The patch is tested with Meta-Lllama-3-8B,
Mistral-7B, Llama-2-7B-chat-hf models on a
IBM POWER10 machine.

Signed-off-by: Amrita H S <[email protected]>

Assets 23

08 Jan 10:15

github-actions

b4439

0d52a69

b4439

ci : fix cmake option (#11125)

Assets 23

08 Jan 09:11

github-actions

b4438

02f0430

b4438

Disable GL_KHR_cooperative_matrix Vulkan extension if not available. …

Assets 23

08 Jan 09:06

github-actions

b4437

bec2183

b4437

fix: Vulkan shader gen binary path when Cross-compiling (#11096)

* fix: Vulkan shader gen binary path when cross compiling

Assets 23

07 Jan 16:01

github-actions

b4435

017cc5f

b4435

ggml-backend : only offload from host buffers (fix) (#11124)

Assets 23

07 Jan 12:25

github-actions

b4434

a3d50bc

b4434

ggml-backend : only offload from host buffers (#11120)

Assets 23

07 Jan 07:24

github-actions

b4433

a4dd490

b4433

rpc : code cleanup (#11107)

Remove duplicated macros, use GGML_LOG_ERROR for errors

Assets 23

07 Jan 07:08

github-actions

b4432

c0d6f79

b4432

SYCL: Use get_multi_ptr instead of deprecated get_pointer in wkv6 (#1…

Assets 23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ggml-org/llama.cpp

b4445

Uh oh!

b4443

Uh oh!

b4440

Uh oh!

b4439

Uh oh!

b4438

Uh oh!

b4437

Uh oh!

b4435

Uh oh!

b4434

Uh oh!

b4433

Uh oh!

b4432

Uh oh!