Releases · ggml-org/llama.cpp

02 Jun 09:48

663445b

b5573

sycl: quantize and reorder the input to q8_1 when reorder is enabled …

Assets 18

01 Jun 16:49

github-actions

b5572

7675c55

b5572

gguf: fix failure on version == 0 (#13956)

Assets 18

01 Jun 16:43

github-actions

b5571

5e1c3ae

b5571

convert : fix nomic-bert-moe mask token (#13757)

Assets 18

01 Jun 15:10

github-actions

b5569

e57bb87

b5569

ggml: check if non-native endian model is being loaded (#13943)

* gguf: prevent non-native endian models from being loaded

Signed-off-by: Aaron Teo <[email protected]>

* gguf: update error message

Signed-off-by: Aaron Teo <[email protected]>

* gguf: make the non-native endian check more verbose

Signed-off-by: Aaron Teo <[email protected]>

* ggml: move ggml_assert location

Signed-off-by: Aaron Teo <[email protected]>

* ggml: reword the endianness check error message

Signed-off-by: Aaron Teo <[email protected]>

---------

Signed-off-by: Aaron Teo <[email protected]>

Assets 18

01 Jun 12:27

github-actions

b5568

f3a4b16

b5568

sync : ggml

ggml-ci

Assets 18

01 Jun 10:42

github-actions

b5560

c046217

b5560

parallel : fix n_junk == 0 (#13952)

Assets 18

01 Jun 09:32

github-actions

b5559

0fc16b4

b5559

kv-cache : split implementation in separate sources (#13920)

ggml-ci

Assets 18

31 May 23:57

github-actions

b5558

053b153

b5558

threading: support for GGML_SCHED_PRIO_LOW, update thread info on Win…

Assets 18

31 May 15:55

github-actions

b5556

e15898d

b5556

server: allow unclosed thinking tags (#13931)

Assets 18

31 May 13:47

github-actions

b5555

803f8ba

b5555

llama : deprecate explicit kv_self defrag/update calls (#13921)

ggml-ci

Assets 18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ggml-org/llama.cpp

b5573

Uh oh!

b5572

Uh oh!

b5571

Uh oh!

b5569

Uh oh!

b5568

Uh oh!

b5560

Uh oh!

b5559

Uh oh!

b5558

Uh oh!

b5556

Uh oh!

b5555

Uh oh!