[SYCL][L0] Fix memory leak when tracking indirect accesses #7105

aelovikov-intel · 2022-10-18T20:36:00Z

On the plugin API boundaries extra piKernelRetain/piKernelRelease can be called in arbitrary moments (from other threads) include before the kernel was submitted to the actual GPU HW. As such, piKernelRelease has nothing to do with submissions and indirect memory tracking updates should be initiated from the points where the kernel actually finishes - CleanupCompletedEvent.

On the plugin API boundaries extra piKernelRetain/piKernelRelease can be called in arbitrary moments (from other threads) include *before* the kernel was submitted to the actual GPU HW. As such, piKernelRelease has nothing to do with submissions and indirect memory tracking updates should be initiated from the points where the kernel actually finishes - CleanupCompletedEvent.

againull · 2022-10-20T15:58:14Z

sycl/plugins/level_zero/pi_level_zero.cpp

+  if (AssociatedKernel) {
+    if (IndirectAccessTrackingEnabled) {
+      auto Kernel = AssociatedKernel;
+      // piKernelRelease is called by CleanupCompletedEvent(Event) as soon as
+      // kernel execution has finished. This is the place where we need to
+      // release memory allocations. If kernel is not in use (not submitted by
+      // some other thread) then release referenced memory allocations. As a
+      // result, memory can be deallocated and context can be removed from
+      // container in the platform. That's why we need to lock a mutex here.
+      pi_platform Plt = Kernel->Program->Context->getPlatform();
+      std::scoped_lock<pi_shared_mutex> ContextsLock(Plt->ContextsMutex);
+
+      if (--Kernel->SubmissionsCount == 0) {
+        // Kernel is not submitted for execution, release referenced memory
+        // allocations.
+        for (auto &MemAlloc : Kernel->MemAllocs) {
+          // std::pair<void *const, MemAllocRecord> *, Hash
+          USMFreeHelper(MemAlloc->second.Context, MemAlloc->first,
+                        MemAlloc->second.OwnZeMemHandle);
+        }
+        Kernel->MemAllocs.clear();
+      }
+    }
    PI_CALL(piKernelRelease(AssociatedKernel));


We call piKernelRelease for dependent events below in this function - https://github.com/intel/llvm/pull/7105/files#diff-15dd1eb076d2164bd9e87d9057f05f652a716498e8cdf5975e564c65309a0985L5895-L5896

It looks like we need to do the same for them as well.

aelovikov-intel requested a review from a team as a code owner October 18, 2022 20:36

smaslov-intel requested a review from againull October 18, 2022 20:44

againull reviewed Oct 20, 2022

View reviewed changes

aelovikov-intel added 2 commits October 21, 2022 15:08

Merge remote-tracking branch 'origin/sycl' into l0-ind-mem-leak

01124c3

Address review comment

b3cf265

aelovikov-intel requested a review from againull October 24, 2022 20:27

againull approved these changes Oct 24, 2022

View reviewed changes

againull merged commit 1b79491 into intel:sycl Oct 24, 2022

aelovikov-intel deleted the l0-ind-mem-leak branch November 8, 2022 20:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL][L0] Fix memory leak when tracking indirect accesses #7105

[SYCL][L0] Fix memory leak when tracking indirect accesses #7105

Uh oh!

aelovikov-intel commented Oct 18, 2022

Uh oh!

againull Oct 20, 2022

Uh oh!

Uh oh!

[SYCL][L0] Fix memory leak when tracking indirect accesses #7105

[SYCL][L0] Fix memory leak when tracking indirect accesses #7105

Uh oh!

Conversation

aelovikov-intel commented Oct 18, 2022

Uh oh!

againull Oct 20, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!