llama : use smart pointers for ggml resources #10117

slaren · 2024-11-01T02:17:12Z

Introduce the header ggml-cpp.h to ggml which contains ready to use smart pointer types for the ggml resources, and utilize them in llama.cpp.

The motivation is to avoid leaks and simplify the code, particularly in the model loader where exceptions are frequently used.

ggml-ci

danbev · 2024-11-01T06:51:33Z

I needed to add a symbolic link to ggml-cpp.h from the spm-headers directory to get this to build using xcodebuild:

$ cd spm-headers
$ ln -s ../ggml/include/ggml-cpp.h ggml-cpp.h

danbev · 2024-11-01T07:39:39Z

src/llama.cpp

-            ctx0 = nullptr;
-        }
+        ggml_free(ctx0);
+        ctx0 = nullptr;


Could ctx0 be changed into a ggml_context_ptr perhaps?

This one is used thousands of times and it would be impractical to change all of the uses to ctx0.get(). So we would need to keep a raw pointer for ease of use, as well as the smart pointer, and keep them synchronized. At that point, there would be little benefit from using a smart pointer.

Another reason is that this struct has init and free functions instead of using the constructor and destructor, and there are comments that explicitly ask to avoid doing initialization in the constructor. However, from what I can tell, init is called exactly once right after the constructor in every instance this struct is used, followed by single call free, so I don't think there is a good reason for this, but I don't know what was the motivation to do it this way.

Got it, thanks for the detailed explanation!

Another reason is that this struct has init and free functions instead of using the constructor and destructor, and there are comments that explicitly ask to avoid doing initialization in the constructor. However, from what I can tell, init is called exactly once right after the constructor in every instance this struct is used, followed by single call free, so I don't think there is a good reason for this, but I don't know what was the motivation to do it this way.

I try to avoid adding to much initializing logic in the constructors - just as a rule of thumb. In this case it is pointless, but in the future if we want to start handling error states from ggml_init or if we start doing extra things that could potentially fail it might not be a good idea to put them in the constructor.

Normally this would be handled by throwing an exception in the constructor. I imagine that the goal here was to avoid using exceptions, but they are already used everywhere in the llama.cpp code, and they are a fundamental part of RAII, which is a fundamental part of the C++ resource management. Without throwing an exception when the construction of an object fails, you end with an object in an inconsistent or invalid state. IMO we should start using C++ properly in llama.cpp instead of as a somewhat augmented C, there are a lot of inconsistencies in the code of llama.cpp due to this, and we need to standardize on one style at some point.

I'm OK to adopt a bit more idiomatic C++ in the codebase. The current style is I think mostly influenced by my own preference and the goal to write more C-style code. Working on ggml I started to appreciate it because the code ends up much more linear and explicit. But I have experience and can easily adapt to modern C++ (not too modern though 😄). The main goal should be the codebase to be consistent and any way to improve this is welcome.

ggml/include/ggml-cpp.h

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Nov 1, 2024

slaren force-pushed the sl/ggml-cpp-wrappers branch from 9e54531 to 9ac95b1 Compare November 1, 2024 02:32

llama : use smart pointers for ggml resources

48e6e4c

ggml-ci

slaren force-pushed the sl/ggml-cpp-wrappers branch from 9ac95b1 to 48e6e4c Compare November 1, 2024 02:36

danbev approved these changes Nov 1, 2024

View reviewed changes

ggerganov approved these changes Nov 1, 2024

View reviewed changes

ggml/include/ggml-cpp.h Outdated Show resolved Hide resolved

slaren added 2 commits November 1, 2024 12:01

minor

4e89bde

minor

14fa967

slaren merged commit e991e31 into master Nov 1, 2024
53 checks passed

slaren deleted the sl/ggml-cpp-wrappers branch November 1, 2024 22:48

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

llama : use smart pointers for ggml resources (ggml-org#10117)

9d8562d

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

llama : use smart pointers for ggml resources (ggml-org#10117)

9a7aa8f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : use smart pointers for ggml resources #10117

llama : use smart pointers for ggml resources #10117

Uh oh!

slaren commented Nov 1, 2024

Uh oh!

danbev commented Nov 1, 2024

Uh oh!

danbev Nov 1, 2024

Uh oh!

slaren Nov 1, 2024 •

edited

Loading

Uh oh!

danbev Nov 1, 2024

Uh oh!

ggerganov Nov 1, 2024

Uh oh!

slaren Nov 1, 2024 •

edited

Loading

Uh oh!

ggerganov Nov 1, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

llama : use smart pointers for ggml resources #10117

llama : use smart pointers for ggml resources #10117

Uh oh!

Conversation

slaren commented Nov 1, 2024

Uh oh!

danbev commented Nov 1, 2024

Uh oh!

danbev Nov 1, 2024

Choose a reason for hiding this comment

Uh oh!

slaren Nov 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

danbev Nov 1, 2024

Choose a reason for hiding this comment

Uh oh!

ggerganov Nov 1, 2024

Choose a reason for hiding this comment

Uh oh!

slaren Nov 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ggerganov Nov 1, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

slaren Nov 1, 2024 •

edited

Loading

slaren Nov 1, 2024 •

edited

Loading