Draft: Metal max buffer workaround #1825

kiltyj · 2023-06-12T18:47:56Z

This was my initial attempt at working around MTLBuffer.maxBufferLength last week. This seems to work for 7B models, but not for larger models (e.g. guanaco 65B).

I'll keep poking at it after hours, but creating this so others can take a look if/as they find time.

kiltyj · 2023-06-12T18:49:06Z

ggml-metal.m

-        ctx->buffers[ctx->n_buffers].name = name;
-        ctx->buffers[ctx->n_buffers].data = data;
-        ctx->buffers[ctx->n_buffers].size = size;
+        size_t sys_max_buffer_size = 2ul * 1024ul * 1024ul * 1024ul; // ctx->device.maxBufferLength;


Note: this is an artificial 2GB limit that I had in place to test this out, given I don't actually bump into maxBufferLength on my M1 Max.

Should be switched back to ctx->device.maxBufferLength once issues are worked out.

ggerganov · 2023-06-18T06:12:54Z

@kiltyj Thanks for the help. I merged #1826 for now. It's not the best outcome, as we seem to not be able to utilize all unified memory, but at least is should handle problematic situations better with an error instead of generating garbage

kiltyj added 2 commits June 12, 2023 02:00

Workaround Metal maxBufferLength

b2c0973

Merge branch 'ggerganov:master' into metal-max-buffer-workaround

350784d

kiltyj commented Jun 12, 2023

View reviewed changes

kiltyj mentioned this pull request Jun 12, 2023

[METAL] GPU Inference fails due to buffer error (buffer "data" size is larger than buffer maximum) #1815

Closed

4 tasks

kmcgowan mentioned this pull request Jun 12, 2023

metal : handle buffers larger than device's maxBufferLength #1826

Merged

ggerganov closed this Jun 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Draft: Metal max buffer workaround #1825

Draft: Metal max buffer workaround #1825

Uh oh!

kiltyj commented Jun 12, 2023

Uh oh!

kiltyj Jun 12, 2023

Uh oh!

ggerganov commented Jun 18, 2023

Uh oh!

Uh oh!

Draft: Metal max buffer workaround #1825

Draft: Metal max buffer workaround #1825

Uh oh!

Conversation

kiltyj commented Jun 12, 2023

Uh oh!

kiltyj Jun 12, 2023

Choose a reason for hiding this comment

Uh oh!

ggerganov commented Jun 18, 2023

Uh oh!

Uh oh!