[SYCL] Add environment variable to disable in-memory program caching #11751

jzc · 2023-11-01T21:07:43Z

This PR adds an environment variable SYCL_CACHE_IN_MEM to control the in-memory caching of programs. Currently, every program/kernel is saved in the global KernelProgramCache, which means that every program/kernel will not be released until end of the program when the destructor of KernelProgramCache is ran. By enabling this environment variables, caching is not performed and the resources program and kernels use are freed after use.

cperkinsintel · 2023-11-07T01:28:12Z

If a new environment variable is added, the EnvironmentVariables.md file should be updated to document it.

But, also, when I look at llvm/sycl/doc/EnvironmentVariables.md I see there are already several environment variables for controlling cacheing there.
SYCL_CACHE_DIR, SYCL_CACHE_PERSISTENT, SYCL_CACHE_EVICTION_DISABLE, SYCL_CACHE_MAX_SIZE , SYCL_CACHE_THRESHOLD, SYCL_CACHE_MIN_DEVICE_IMAGE_SIZE, SYCL_CACHE_MAX_DEVICE_IMAGE_SIZE

Are we sure we need another? How is this new environment variable different than neutering the SYCL_CACHE_DIR or disabling via SYCL_CACHE_PERSISTENT ?

jzc · 2023-11-07T15:38:27Z

@cperkinsintel Those environment variables control the persistent caching i.e. the storing of compiled code on disk, while this new environment variable intends to give an option to disable the in-memory cache. Even if persistent caching is disabled, programs still go through the in-memory cache.

sycl/source/detail/program_manager/program_manager.cpp

sycl/unittests/kernel-and-program/MultipleDevsCache.cpp

sycl/unittests/SYCL2020/GetNativeOpenCL.cpp

sycl/source/detail/scheduler/commands.cpp

maarquitos14 · 2023-11-14T14:44:24Z

@cperkinsintel Those environment variables control the persistent caching i.e. the storing of compiled code on disk, while this new environment variable intends to give an option to disable the in-memory cache. Even if persistent caching is disabled, programs still go through the in-memory cache.

Should we make it explicit in the name that this refers to in-memory, non-persistent, caching? E.g. SYCL_IN_MEM_CACHE_DISABLE?

sycl/source/detail/config.hpp

dm-vodopyanov

Overall, LGTM

sycl/doc/EnvironmentVariables.md

sycl/source/detail/program_manager/program_manager.cpp

steffenlarsen · 2023-11-15T12:22:03Z

sycl/source/detail/program_manager/program_manager.cpp

+    // nullptr for the mutex.
+    auto [Kernel, ArgMask] = BuildF();
+    return make_tuple(Kernel, nullptr, ArgMask, Program);
+  }


Could this logic be moved to getOrBuild to avoid having to do it before each call to it?

Yes, but I think it'd probably be better suited for another PR. There are really two types of uses of getOrBuild, one with kernels and one with programs. The returns types differ between the two case, so it'd be a little awkward to insert this special logic for the program case in there. Alternatively, one could make a getOrBuildProgram and getOrBuildKernel program and then put that special logic in the getOrBuildKernel function, but there would still be a oddity with the return type: when caching, we wrap the value in a BuildResult and return a pointer to that (owned by the cache), but when are not caching, we bypass the BuildResult object, and would only want to return the value. This can still be resolved by further modifying these getOrBuild functions to return only the values we extract from the BuildResult anyways, but I believe this'll most likely create a lot of changes unrelated to the original goal of PR.

From what I remember of the code, it is due for a bit of an overhaul anyway. 👍

maarquitos14

LGTM.

steffenlarsen

LGTM!

In #11751, ref counting of kernels objects was changed to be more accurate in order to allow for in-memory caching to be disabled. When getting a kernel form the cache, the ref count the kernel handle is now incremented (when caching is enabled). Thus, a method like `ProgramManager::getOrCreateKernel` will increment the ref count of the kernel it gets. However, in `enqueueImpKernel`, when enqueuing a kernel with a kernel bundle,`ProgramManager::getOrCreateKernel` is called twice, first indirectly by: https://github.com/intel/llvm/blob/c43a90f28eebfcdf1bc1d55430485e2834790a60/sycl/source/detail/scheduler/commands.cpp#L2527-L2528 and second directly by: https://github.com/intel/llvm/blob/c43a90f28eebfcdf1bc1d55430485e2834790a60/sycl/source/detail/scheduler/commands.cpp#L2538-L2548 This means that the ref count of the acquired kernel is incremented twice, yet the rest of the function will only free once, which leads to a leak of the kernel. As the second comment and asserts say, the only need for the second call to `getOrCreateKernel` is to fetch the mutex associated to the cached kernel retrieved from the first call, so this PR adjusts `get_kernel` to save this mutex and forgo this extra `getOrCreateKernel` call and unintentional additional ref count.

jzc added 2 commits November 1, 2023 13:31

[SYCL] Add environment variable to disable in-memory program caching

ac621c3

Add test

2021871

jzc requested a review from a team as a code owner November 1, 2023 21:07

jzc requested a review from cperkinsintel November 1, 2023 21:07

clang-format

88439ea

jzc had a problem deploying to WindowsCILock November 1, 2023 21:34 — with GitHub Actions Failure

jzc added 2 commits November 3, 2023 08:01

Add release in graph append and don't use optional with lock guard

b1afb68

clang-format

157db1b

jzc temporarily deployed to WindowsCILock November 3, 2023 15:15 — with GitHub Actions Inactive

jzc had a problem deploying to WindowsCILock November 3, 2023 15:47 — with GitHub Actions Failure

Fix kernel bundle graph append

0d2f79f

jzc temporarily deployed to WindowsCILock November 3, 2023 18:17 — with GitHub Actions Inactive

jzc had a problem deploying to WindowsCILock November 3, 2023 19:31 — with GitHub Actions Failure

cperkinsintel requested a review from sergey-semenov November 7, 2023 01:28

Set EliminatedArgMask

8462232

jzc temporarily deployed to WindowsCILock November 7, 2023 20:50 — with GitHub Actions Inactive

jzc temporarily deployed to WindowsCILock November 7, 2023 21:32 — with GitHub Actions Inactive

dm-vodopyanov reviewed Nov 14, 2023

View reviewed changes

jzc added 2 commits November 14, 2023 08:05

Merge remote-tracking branch 'intel/sycl' into disable-caching

a1c759d

Address review comments

d07e6fa

jzc requested a review from a team as a code owner November 14, 2023 18:56

clang-format

d2c7123

jzc temporarily deployed to WindowsCILock November 14, 2023 19:02 — with GitHub Actions Inactive

0x12CC reviewed Nov 14, 2023

View reviewed changes

sycl/source/detail/config.hpp Outdated Show resolved Hide resolved

jzc had a problem deploying to WindowsCILock November 14, 2023 20:16 — with GitHub Actions Failure

Change environment variable and update test

1eb9b5d

jzc temporarily deployed to WindowsCILock November 14, 2023 20:25 — with GitHub Actions Inactive

jzc temporarily deployed to WindowsCILock November 14, 2023 21:17 — with GitHub Actions Inactive

Merge remote-tracking branch 'intel/sycl' into disable-caching

d0b8610

jzc had a problem deploying to WindowsCILock November 14, 2023 22:49 — with GitHub Actions Failure

jzc temporarily deployed to WindowsCILock November 15, 2023 01:07 — with GitHub Actions Inactive

dm-vodopyanov approved these changes Nov 15, 2023

View reviewed changes

sycl/doc/EnvironmentVariables.md Outdated Show resolved Hide resolved

sycl/source/detail/program_manager/program_manager.cpp Outdated Show resolved Hide resolved

steffenlarsen reviewed Nov 15, 2023

View reviewed changes

Address review comments

ac48ff1

jzc temporarily deployed to WindowsCILock November 15, 2023 20:18 — with GitHub Actions Inactive

jzc had a problem deploying to WindowsCILock November 15, 2023 21:01 — with GitHub Actions Failure

Merge remote-tracking branch 'intel/sycl' into disable-caching

523ec14

maarquitos14 approved these changes Nov 16, 2023

View reviewed changes

jzc temporarily deployed to WindowsCILock November 16, 2023 15:49 — with GitHub Actions Inactive

jzc had a problem deploying to WindowsCILock November 16, 2023 16:27 — with GitHub Actions Failure

jzc temporarily deployed to WindowsCILock November 16, 2023 22:27 — with GitHub Actions Inactive

jzc temporarily deployed to WindowsCILock November 16, 2023 23:31 — with GitHub Actions Inactive

steffenlarsen approved these changes Nov 17, 2023

View reviewed changes

steffenlarsen merged commit 9322d14 into intel:sycl Nov 17, 2023

jzc mentioned this pull request Nov 20, 2023

[SYCL] Fix issue of acquring kernel twice #11953

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL] Add environment variable to disable in-memory program caching #11751

[SYCL] Add environment variable to disable in-memory program caching #11751

Uh oh!

jzc commented Nov 1, 2023 •

edited

Loading

Uh oh!

cperkinsintel commented Nov 7, 2023

Uh oh!

jzc commented Nov 7, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maarquitos14 commented Nov 14, 2023

Uh oh!

Uh oh!

dm-vodopyanov left a comment

Uh oh!

Uh oh!

Uh oh!

steffenlarsen Nov 15, 2023

Uh oh!

jzc Nov 15, 2023

Uh oh!

steffenlarsen Nov 17, 2023

Uh oh!

maarquitos14 left a comment

Uh oh!

steffenlarsen left a comment

Uh oh!

Uh oh!

[SYCL] Add environment variable to disable in-memory program caching #11751

[SYCL] Add environment variable to disable in-memory program caching #11751

Uh oh!

Conversation

jzc commented Nov 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cperkinsintel commented Nov 7, 2023

Uh oh!

jzc commented Nov 7, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maarquitos14 commented Nov 14, 2023

Uh oh!

Uh oh!

dm-vodopyanov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

steffenlarsen Nov 15, 2023

Choose a reason for hiding this comment

Uh oh!

jzc Nov 15, 2023

Choose a reason for hiding this comment

Uh oh!

steffenlarsen Nov 17, 2023

Choose a reason for hiding this comment

Uh oh!

maarquitos14 left a comment

Choose a reason for hiding this comment

Uh oh!

steffenlarsen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jzc commented Nov 1, 2023 •

edited

Loading