Upgrade init_tensor API to return a ggml_status #11854

WilliamTambellini · 2025-02-14T01:49:25Z

To prepare for an 'abort-free' ggml, as agreeed with Diego in the ggml repo, upgrade the backend init_tensor APIs to return a ggml_status.

Make sure to read the contributing guidelines before submitting a PR

WilliamTambellini · 2025-02-14T19:34:40Z

@slaren review please. Tks.

ggml/src/ggml-backend.cpp

ggml/src/ggml-cuda/CMakeLists.txt

ggml/src/ggml-cuda/ggml-cuda.cu

WilliamTambellini · 2025-02-18T17:09:56Z

Tks @slaren
Reready for review.

ggml/src/ggml-backend.cpp

tests/test-backend-ops.cpp

ggml/src/ggml-backend.cpp

graehl

ok, so ggml_backend_*_buffer_init_tensor can only return success for most backends but since it's called through the interface init_tensor pointer they still need to return success. was the plan to eventually make cuda_init_tensor sometimes return an error?

WilliamTambellini · 2025-02-19T17:23:19Z

Tks @graehl

so ggml_backend_*_buffer_init_tensor can only return success for most backends but since it's called through the interface init_tensor pointer they still need to return success. was the plan to eventually make cuda_init_tensor sometimes return an error?

Yes but that a another PR in the ggml repo

WilliamTambellini · 2025-02-19T17:23:43Z

@slaren reready for review please. Best.

matiaslin

Good step forward towards the goal of returning an error instead of crashing.

WilliamTambellini · 2025-02-21T19:23:39Z

@ggerganov review please

slaren · 2025-02-25T23:18:49Z

ggml/src/ggml-alloc.c

+                enum ggml_status status = ggml_backend_view_init(t);
+                if (status != GGML_STATUS_SUCCESS) {
+                    GGML_LOG_WARN("%s: failed to ggml_backend_view_init: %s\n", __func__, ggml_status_to_string(status));
+                    return false;
+                }
            }
        } else {
            if (t->view_src != NULL && t->buffer == NULL) {
                // view of a pre-allocated tensor
-                ggml_backend_view_init(t);
+                enum ggml_status status = ggml_backend_view_init(t);
+                if (status != GGML_STATUS_SUCCESS) {
+                    GGML_LOG_WARN("%s: failed to ggml_backend_view_init: %s\n", __func__, ggml_status_to_string(status));
+                    return false;
+                }


This will leak memory if it fails.

ggml/src/ggml-backend.cpp

ggml/src/ggml-cuda/CMakeLists.txt

tests/test-backend-ops.cpp

To prepare for an 'abort-free' ggml (ggml not to abort on OOMs but return a OOM status), as agreeed with Diego in the ggml repo, upgrade the init_tensor() and view_init() APIs to return a ggml_status.

WilliamTambellini · 2025-03-04T17:58:23Z

@ggerganov I now have to retouch my PR in ggml. Could you please trigger a sync of ggml from llamacpp to the ggml repo?

ggerganov · 2025-03-04T19:25:27Z

@WilliamTambellini Should be good now.

* Upgrade init_tensor API to return a ggml_status To prepare for an 'abort-free' ggml (ggml not to abort on OOMs but return a OOM status), as agreeed with Diego in the ggml repo, upgrade the init_tensor() and view_init() APIs to return a ggml_status. * misc fixes --------- Co-authored-by: slaren <[email protected]>

github-actions bot added testing Everything test related Nvidia GPU Issues specific to Nvidia GPUs Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Feb 14, 2025

WilliamTambellini force-pushed the init_tensor branch from 150ffe8 to d12a712 Compare February 14, 2025 18:29

slaren reviewed Feb 17, 2025

View reviewed changes

ggml/src/ggml-backend.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-cuda/CMakeLists.txt Outdated Show resolved Hide resolved

ggml/src/ggml-cuda/ggml-cuda.cu Outdated Show resolved Hide resolved

WilliamTambellini force-pushed the init_tensor branch from d12a712 to 1205554 Compare February 18, 2025 17:09

WilliamTambellini mentioned this pull request Feb 18, 2025

Add option not to abort on cuda malloc errors ggml-org/ggml#1083

Open

slaren reviewed Feb 18, 2025

View reviewed changes

ggml/src/ggml-backend.cpp Outdated Show resolved Hide resolved

tests/test-backend-ops.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-backend.cpp Outdated Show resolved Hide resolved

WilliamTambellini force-pushed the init_tensor branch 4 times, most recently from e2486eb to 51a0f6c Compare February 18, 2025 23:48

This comment was marked as outdated.

Sign in to view

graehl approved these changes Feb 19, 2025

View reviewed changes

matiaslin approved these changes Feb 20, 2025

View reviewed changes

WilliamTambellini requested a review from slaren February 25, 2025 17:06

slaren reviewed Feb 25, 2025

View reviewed changes

WilliamTambellini force-pushed the init_tensor branch from 51a0f6c to 1bae362 Compare February 26, 2025 23:53

Upgrade init_tensor API to return a ggml_status

aa12f29

To prepare for an 'abort-free' ggml (ggml not to abort on OOMs but return a OOM status), as agreeed with Diego in the ggml repo, upgrade the init_tensor() and view_init() APIs to return a ggml_status.

WilliamTambellini force-pushed the init_tensor branch from 1bae362 to aa12f29 Compare February 27, 2025 03:46

WilliamTambellini requested review from slaren and matiaslin February 27, 2025 17:48

misc fixes

d04a23b

slaren approved these changes Feb 28, 2025

View reviewed changes

ggerganov approved these changes Feb 28, 2025

View reviewed changes

slaren merged commit 70680c4 into ggml-org:master Feb 28, 2025
47 checks passed

ag2s20150909 mentioned this pull request Mar 3, 2025

Fix kleidiai build #12159

Merged

Upgrade init_tensor API to return a ggml_status #11854

Upgrade init_tensor API to return a ggml_status #11854

Uh oh!

Conversation

WilliamTambellini commented Feb 14, 2025

Uh oh!

WilliamTambellini commented Feb 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

WilliamTambellini commented Feb 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

graehl left a comment

Choose a reason for hiding this comment

Uh oh!

WilliamTambellini commented Feb 19, 2025

Uh oh!

WilliamTambellini commented Feb 19, 2025

Uh oh!

matiaslin left a comment

Choose a reason for hiding this comment

Uh oh!

WilliamTambellini commented Feb 21, 2025

Uh oh!

slaren Feb 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

WilliamTambellini commented Mar 4, 2025

Uh oh!

ggerganov commented Mar 4, 2025

Uh oh!

Uh oh!