ggml : reading the runtime sve config of the cpu #8382

jdomke · 2024-07-09T01:18:27Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

To go from SVE512 to SVE256 (or other configurations), one can use the following:

$ cat wrapper.c
#include <sys/prctl.h>
#include <unistd.h>
int main(int argc, char *argv[], char *envp[]){
  prctl(PR_SVE_SET_VL,32|PR_SVE_VL_INHERIT);
  execve(argv[1], &argv[1], envp); return 0;
}
$ gcc wrapper.c -o wrapper
$ ./wrapper ./bin/llama-cli ...

However, when ggml reads the "current" sve width via svcntb() it still gets the wrong value, and only prctl(PR_SVE_GET_VL) returns the correct value.

jdomke · 2024-07-11T01:36:59Z

The issue might actually be the result of a bug in our clang runtime causing svcntb to return the wrong value; we are confirming right now.

jdomke · 2024-07-26T02:52:06Z

The issue might actually be the result of a bug in our clang runtime causing svcntb to return the wrong value; we are confirming right now.

According to Fujitsu's compiler developers, the compiler assumes that the vector length does not change during the program execution, and hence svcntb() is assumed to return the same vector length and the compiler may optimize it away. Therefor, the only workaround to allow users (withour root rights) to lower the vector length with the wrapper is to fall back to PR_SVE_VL_LEN_MASK & prctl(PR_SVE_GET_VL) inside of llama.cpp.

ggerganov

Also need to update the svcntb() calls in ggml-quants.c

ggml : reading the runtime sve config of the cpu

3a837ba

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Jul 9, 2024

mofosyne added the Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix label Jul 13, 2024

ggerganov reviewed Jul 26, 2024

View reviewed changes

jdomke mentioned this pull request Jul 26, 2024

ggml : reading the runtime sve config of the cpu #8709

Merged

3 tasks

jdomke closed this Jul 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ggml : reading the runtime sve config of the cpu #8382

ggml : reading the runtime sve config of the cpu #8382

Uh oh!

jdomke commented Jul 9, 2024 •

edited

Loading

Uh oh!

jdomke commented Jul 11, 2024

Uh oh!

jdomke commented Jul 26, 2024

Uh oh!

ggerganov left a comment

Uh oh!

Uh oh!

ggml : reading the runtime sve config of the cpu #8382

ggml : reading the runtime sve config of the cpu #8382

Uh oh!

Conversation

jdomke commented Jul 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jdomke commented Jul 11, 2024

Uh oh!

jdomke commented Jul 26, 2024

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jdomke commented Jul 9, 2024 •

edited

Loading