Releases · ngxson/llama.cpp

27 May 20:28

a3c3084

b5516

opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, …

Assets 18

27 May 20:23

github-actions

b5515

1701d4c

b5515

opencl: mark `mul_mat` `f32f32` as supporting non-contiguous tensors …

Assets 18

27 May 17:13

github-actions

b5514

bef8176

b5514

vulkan: use timestamp queries for GGML_VULKAN_PERF (#13817)

Also change it to be controlled by an env var rather than cmake flag

Assets 18

27 May 16:36

github-actions

b5513

34b7c04

b5513

cmake : add llama-cparams.cpp to build (#13832)

Assets 18

27 May 15:50

github-actions

b5512

f3101a8

b5512

SYCL: add gelu_erf kernel (#13749)

* SYCL: add gelu_erf kernel

* refactor code

Co-authored-by: Atharva Dubey <[email protected]>

* Use scope_op_debug_print

---------

Co-authored-by: Atharva Dubey <[email protected]>

Assets 18

27 May 15:01

github-actions

b5510

a8ea03d

b5510

ggml : add ggml_repeat_4d (#13824)

Assets 18

27 May 14:02

github-actions

b5509

05f6ac6

b5509

ggml : riscv: add xtheadvector support (#13720)

* ggml : riscv: add xtheadvector support

* ggml : clean up some macro usage

Assets 18

27 May 12:23

github-actions

b5508

bc583e3

b5508

mtmd : support Qwen 2.5 Omni (input audio+vision, no audio output) (#…

Assets 18

27 May 11:43

github-actions

b5506

7fe03e7

b5506

ggml-cpu: x86 feature detection is specific to x86 (#13811)

Assets 18

27 May 11:35

github-actions

b5505

952f395

b5505

ggml : allow CUDA graphs when using pipeline parallelism (#13814)

Assets 18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ngxson/llama.cpp

b5516

Uh oh!

b5515

Uh oh!

b5514

Uh oh!

b5513

Uh oh!

b5512

Uh oh!

b5510

Uh oh!

b5509

Uh oh!

b5508

Uh oh!

b5506

Uh oh!

b5505

Uh oh!