metal: Cache compiled library at device level #12265

BB-fat · 2025-03-08T05:32:12Z

Currently, Metal shaders are recompiled for every llama context initialization, which is redundant and impacts performance when creating multiple contexts.
Cache the compiled Metal library at the device context level (g_ggml_ctx_dev_main), reusing it for subsequent context initializations.

Fixes #12199

BB-fat · 2025-03-10T03:41:03Z

During testing, I found an objc double-release issue, I am trying to fix it.

BB-fat · 2025-03-10T06:24:58Z

@ggerganov Please review when convenient.

…#12265)

github-actions bot added ggml changes relating to the ggml tensor library for machine learning Apple Metal https://en.wikipedia.org/wiki/Metal_(API) labels Mar 8, 2025

BB-fat force-pushed the metal-library-cache branch 2 times, most recently from 6b3a511 to 0569909 Compare March 9, 2025 12:27

BB-fat marked this pull request as ready for review March 9, 2025 12:29

metal : Cache the Metal library at the device context level

70432c7

BB-fat force-pushed the metal-library-cache branch from 0569909 to 70432c7 Compare March 10, 2025 05:33

ggerganov approved these changes Mar 11, 2025

View reviewed changes

ggerganov merged commit 6ab2e47 into ggml-org:master Mar 11, 2025
47 checks passed

BB-fat deleted the metal-library-cache branch March 12, 2025 02:19

ishaangandhi pushed a commit to ishaangandhi/llama.cpp that referenced this pull request Mar 12, 2025

metal : Cache the Metal library at the device context level (ggml-org…

8084934

…#12265)

jpohhhh pushed a commit to Telosnex/llama.cpp that referenced this pull request Mar 14, 2025

metal : Cache the Metal library at the device context level (ggml-org…

e9ebbb4

…#12265)

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Mar 19, 2025

metal : Cache the Metal library at the device context level (ggml-org…

715a993

…#12265)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

metal: Cache compiled library at device level #12265

metal: Cache compiled library at device level #12265

Uh oh!

BB-fat commented Mar 8, 2025

Uh oh!

BB-fat commented Mar 10, 2025

Uh oh!

BB-fat commented Mar 10, 2025

Uh oh!

Uh oh!

Uh oh!

metal: Cache compiled library at device level #12265

metal: Cache compiled library at device level #12265

Uh oh!

Conversation

BB-fat commented Mar 8, 2025

Uh oh!

BB-fat commented Mar 10, 2025

Uh oh!

BB-fat commented Mar 10, 2025

Uh oh!

Uh oh!

Uh oh!