ggml-quants: Provide ggml_vqtbl1q_u8 for 64bit compatibility #5711

rgryta · 2024-02-25T12:40:48Z

Additionally unblocks Android example and workflow build for Android build with armeabi-v7a target.

Function vqtbl1q_u8 is not available under neon ARM-V7.

rgryta · 2024-02-25T13:00:13Z

android-build workflow will fail as it's fetching the current llama.cpp dependency from master (which of course doesn't have the neon v7a fix for now)

rgryta · 2024-02-25T17:01:56Z

@ggerganov, let me know if you'd rather I split this PR into two separate ones instead (one that adds the ARM-V check and then another one that updates the build workflow).
I've tested the example workflow locally by changing the CMakeLists.txt dependency to my fork instead and everything seems to be correct.

ggerganov · 2024-02-25T17:13:11Z

ggml-quants.c

@@ -9452,7 +9452,7 @@ void ggml_vec_dot_iq3_s_q8_K (int n, float * GGML_RESTRICT s, size_t bs, const v

    const int nb = n / QK_K;

-#if defined(__ARM_NEON)
+#if defined(__ARM_NEON) && (__ARM_ARCH >= 8)


Instead of this, we should add ggml_vqtbl1q_u8 similar to how we have ggml_vqtbl1q_s8 and replace all usages of vqtbl1q_u8 with ggml_vqtbl1q_u8

Amended and compiled locally in and Android project

vqtbl1q_u8 is not part of arm v7 neon library

…rg#5711) * [ggml-quants] Provide ggml_vqtbl1q_u8 for 64bit compatibility vqtbl1q_u8 is not part of arm v7 neon library * [android-example] Remove abi filter after arm v7a fix * [github-workflows] Do not skip Android armeabi-v7a build

rgryta changed the title ~~[ggml-quants] Add preprocessor check for __ARM_ARCH 8 sepcific neon optimizations~~ ggml-quants: Add preprocessor check for __ARM_ARCH 8 sepcific neon optimizations Feb 25, 2024

rgryta changed the title ~~ggml-quants: Add preprocessor check for __ARM_ARCH 8 sepcific neon optimizations~~ ggml-quants: Add preprocessor check for __ARM_ARCH 8 specific neon optimizations Feb 25, 2024

ggerganov reviewed Feb 25, 2024

View reviewed changes

rgryta added 3 commits February 25, 2024 19:26

[ggml-quants] Provide ggml_vqtbl1q_u8 for 64bit compatibility

a3ee7a1

vqtbl1q_u8 is not part of arm v7 neon library

[android-example] Remove abi filter after arm v7a fix

f5006c1

[github-workflows] Do not skip Android armeabi-v7a build

cc9288b

rgryta force-pushed the master branch from 781ed60 to cc9288b Compare February 25, 2024 18:26

rgryta changed the title ~~ggml-quants: Add preprocessor check for __ARM_ARCH 8 specific neon optimizations~~ ggml-quants: Provide ggml_vqtbl1q_u8 for 64bit compatibility Feb 25, 2024

ggerganov approved these changes Feb 25, 2024

View reviewed changes

ggerganov merged commit abbabc5 into ggml-org:master Feb 25, 2024

imciner2 mentioned this pull request Feb 26, 2024

[llama_cpp] Update to v0.0.16 (b2256, 2024-02-25) JuliaPackaging/Yggdrasil#8165

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ggml-quants: Provide ggml_vqtbl1q_u8 for 64bit compatibility #5711

ggml-quants: Provide ggml_vqtbl1q_u8 for 64bit compatibility #5711

Uh oh!

rgryta commented Feb 25, 2024

Uh oh!

rgryta commented Feb 25, 2024 •

edited

Loading

Uh oh!

rgryta commented Feb 25, 2024

Uh oh!

ggerganov Feb 25, 2024

Uh oh!

rgryta Feb 25, 2024 •

edited

Loading

Uh oh!

Uh oh!

ggml-quants: Provide ggml_vqtbl1q_u8 for 64bit compatibility #5711

ggml-quants: Provide ggml_vqtbl1q_u8 for 64bit compatibility #5711

Uh oh!

Conversation

rgryta commented Feb 25, 2024

Uh oh!

rgryta commented Feb 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rgryta commented Feb 25, 2024

Uh oh!

ggerganov Feb 25, 2024

Choose a reason for hiding this comment

Uh oh!

rgryta Feb 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rgryta commented Feb 25, 2024 •

edited

Loading

rgryta Feb 25, 2024 •

edited

Loading