SYCL: Disable mul_mat kernels for noncontiguous tensor b #13308

qnixsynapse · 2025-05-05T02:49:42Z

Tests for non contiguous tensor b was added in b0ecbd4 which the kernels doesn't seem to support.

Disable it for now until we work on a fix.

ggml-ci

Alcpz

This may cause a performance regression for certain models since the op will most likely fallback to CPU. I'll try to find some time to look into this

qnixsynapse · 2025-05-05T08:08:42Z

This may cause a performance regression for certain models since the op will most likely fallback to CPU. I'll try to find some time to look into this

Thank you. I'll leave mul mat to you guys.

* origin/master: (27 commits) llama : fix build_ffn without gate (ggml-org#13336) CUDA: fix bad asserts for partial offload (ggml-org#13337) convert : qwen2/3moe : set yarn metadata if present (ggml-org#13331) CUDA: fix --split-mode row for MMQ (ggml-org#13323) gguf-py : avoid requiring pyside6 for other scripts (ggml-org#13036) CUDA: fix logic for clearing padding with -ngl 0 (ggml-org#13320) sampling : Integrate Top-nσ into main sampling chain (and add it to the server) (ggml-org#13264) server : Webui - change setText command from parent window to also send the message. (ggml-org#13309) mtmd : rename llava directory to mtmd (ggml-org#13311) clip : fix confused naming ffn_up and ffn_down (ggml-org#13290) convert : bailingmoe : set yarn metadata if present (ggml-org#13312) SYCL: Disable mul_mat kernels for noncontiguous tensor b (ggml-org#13308) mtmd : add C public API (ggml-org#13184) rpc : use backend registry, support dl backends (ggml-org#13304) ggml : activate s390x simd for Q3_K (ggml-org#13301) llava/mtmd : fixes to fully support dl backends (ggml-org#13303) llama : build windows releases with dl backends (ggml-org#13220) CUDA: fix race condition in MMQ stream-k fixup (ggml-org#13299) CUDA: fix race condition in MMQ ids_dst (ggml-org#13294) vulkan: Additional type support for unary, binary, and copy (ggml-org#13266) ...

SYCL: Disable mul_mat kernels for noncontiguous tensor b

ca51f13

ggml-ci

github-actions bot added ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels May 5, 2025

qnixsynapse requested review from NeoZhangJianyu and Alcpz May 5, 2025 02:50

Alcpz approved these changes May 5, 2025

View reviewed changes

qnixsynapse merged commit 66645a5 into master May 5, 2025
53 checks passed

qnixsynapse deleted the sycl/fix_ci branch May 5, 2025 08:09

Alcpz mentioned this pull request May 6, 2025

sycl: addressing non-contiguous src1 mul_mats (nc and batched) #13343

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SYCL: Disable mul_mat kernels for noncontiguous tensor b #13308

SYCL: Disable mul_mat kernels for noncontiguous tensor b #13308

Uh oh!

qnixsynapse commented May 5, 2025

Uh oh!

Alcpz left a comment •

edited

Loading

Uh oh!

qnixsynapse commented May 5, 2025

Uh oh!

Uh oh!

Uh oh!

SYCL: Disable mul_mat kernels for noncontiguous tensor b #13308

SYCL: Disable mul_mat kernels for noncontiguous tensor b #13308

Uh oh!

Conversation

qnixsynapse commented May 5, 2025

Uh oh!

Alcpz left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

qnixsynapse commented May 5, 2025

Uh oh!

Uh oh!

Uh oh!

Alcpz left a comment •

edited

Loading