Releases · ggml-org/llama.cpp

01 Jun 16:43

5e1c3ae

b5571

convert : fix nomic-bert-moe mask token (#13757)

Assets 18

01 Jun 15:10

github-actions

b5569

e57bb87

b5569

ggml: check if non-native endian model is being loaded (#13943)

* gguf: prevent non-native endian models from being loaded

Signed-off-by: Aaron Teo <[email protected]>

* gguf: update error message

Signed-off-by: Aaron Teo <[email protected]>

* gguf: make the non-native endian check more verbose

Signed-off-by: Aaron Teo <[email protected]>

* ggml: move ggml_assert location

Signed-off-by: Aaron Teo <[email protected]>

* ggml: reword the endianness check error message

Signed-off-by: Aaron Teo <[email protected]>

---------

Signed-off-by: Aaron Teo <[email protected]>

Assets 18

01 Jun 12:27

github-actions

b5568

f3a4b16

b5568

sync : ggml

ggml-ci

Assets 18

01 Jun 10:42

github-actions

b5560

c046217

b5560

parallel : fix n_junk == 0 (#13952)

Assets 18

01 Jun 09:32

github-actions

b5559

0fc16b4

b5559

kv-cache : split implementation in separate sources (#13920)

ggml-ci

Assets 18

31 May 23:57

github-actions

b5558

053b153

b5558

threading: support for GGML_SCHED_PRIO_LOW, update thread info on Win…

Assets 18

31 May 15:55

github-actions

b5556

e15898d

b5556

server: allow unclosed thinking tags (#13931)

Assets 18

31 May 13:47

github-actions

b5555

803f8ba

b5555

llama : deprecate explicit kv_self defrag/update calls (#13921)

ggml-ci

Assets 18

31 May 13:37

github-actions

b5554

3600cc2

b5554

llama : use n_swa + n_ubatch cells for SWA cache (#13833)

* llama : use n_swa + n_ubatch cells for SWA cache

ggml-ci

* llama : add warning about multi-sqeuence SWA contexts

Assets 18

31 May 10:56

github-actions

b5552

3f55f78

b5552

llama : auto-batch preparation (#13845)

* llama : auto-batch

ggml-ci

* context : simplify if branching

Assets 18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ggml-org/llama.cpp

b5571

Uh oh!

b5569

Uh oh!

b5568

Uh oh!

b5560

Uh oh!

b5559

Uh oh!

b5558

Uh oh!

b5556

Uh oh!

b5555

Uh oh!

b5554

Uh oh!

b5552

Uh oh!