[SYCL][Graph] Optimize graph enqueue for in-order queues #18792

fabiomestre · 2025-06-03T16:17:01Z

Optimizes the enqueue() function of sycl graphs to bypass the scheduler whenever possible and avoid creating events when not needed.

Refactors the executable graph enqueue() to have different paths depending on workload:
- The direct path will be used when there are no host-tasks or accessor requirements in the graph and the execution dependencies are considered safe to bypass the scheduler.
- The scheduler path will be used when there are requirements in the graph but no host-tasks or, if the execution dependencies require using the scheduler.
- The multiple partitions path will be used when the graph contains host-tasks which requires scheduling multiple graph partitions. The implementation was also changed to avoid adding unnecessary event dependencies to partition executions and avoiding copying CGData when possible.
Extends the changes in [SYCL] Do not store last event for in-order queues #18277 to sycl graphs. This means that no implicit events will be created when using in-order queues and graphs without host-tasks. Also updates the handler to only request events from the graph enqueue() when they are needed.

…command lists are disabled

unified-runtime/source/adapters/hip/command_buffer.hpp

…_enqueue

fabiomestre · 2025-06-17T12:56:45Z

@intel/llvm-gatekeepers Can this PR be merged? The existing failures seem to be CI issues. This PR was green for those jobs before:

Intel-Arc jobs:

Cuda UR Job:

https://github.com/intel/llvm/actions/runs/15681443929/job/44174565853?pr=18792

There were only unrelated changes to the HIP UR adapter made since then.

Edit: Nevermind, recently merged commits broke this PR, needs to be rebased.

…_enqueue

fabiomestre · 2025-06-18T18:43:28Z

@intel/llvm-gatekeepers This PR is ready to merge. The CI failure on PVC is unrelated (I have seen it in other PR's).

uditagarwal97 · 2025-06-18T19:26:08Z

@intel/llvm-gatekeepers This PR is ready to merge. The CI failure on PVC is unrelated (I have seen it in other PR's).

Failing tests are unrelated and tracked by #18932

fabiomestre had a problem deploying to WindowsCILock June 3, 2025 16:17 — with GitHub Actions Error

fabiomestre force-pushed the fabio/eventless_graph_enqueue branch 2 times, most recently from 4c130dc to ad6ed0e Compare June 3, 2025 16:23

fabiomestre had a problem deploying to WindowsCILock June 3, 2025 16:25 — with GitHub Actions Error

fabiomestre force-pushed the fabio/eventless_graph_enqueue branch from ad6ed0e to e8ee455 Compare June 3, 2025 16:45

fabiomestre had a problem deploying to WindowsCILock June 3, 2025 16:45 — with GitHub Actions Error

fabiomestre force-pushed the fabio/eventless_graph_enqueue branch from e8ee455 to 8393ea8 Compare June 3, 2025 16:54

fabiomestre had a problem deploying to WindowsCILock June 3, 2025 16:54 — with GitHub Actions Failure

fabiomestre had a problem deploying to WindowsCILock June 3, 2025 17:22 — with GitHub Actions Failure

fabiomestre had a problem deploying to WindowsCILock June 4, 2025 11:22 — with GitHub Actions Failure

fabiomestre had a problem deploying to WindowsCILock June 4, 2025 11:37 — with GitHub Actions Failure

fabiomestre force-pushed the fabio/eventless_graph_enqueue branch 2 times, most recently from b00774f to 130688f Compare June 5, 2025 13:39

fabiomestre temporarily deployed to WindowsCILock June 5, 2025 13:39 — with GitHub Actions Inactive

fabiomestre had a problem deploying to WindowsCILock June 5, 2025 19:33 — with GitHub Actions Failure

fabiomestre temporarily deployed to WindowsCILock June 5, 2025 19:33 — with GitHub Actions Inactive

fabiomestre added 4 commits June 6, 2025 12:46

Optimization enqueue work in progress

7a5c5fd

Fix Unit test failure

0241111

Fix command-buffer dependencies on the legacy adapter when immediate …

d3445ef

…command lists are disabled

Fix data race in multiple_exec_graphs test

dd8f6d1

fabiomestre force-pushed the fabio/eventless_graph_enqueue branch from 30e862e to dd8f6d1 Compare June 6, 2025 11:47

fabiomestre temporarily deployed to WindowsCILock June 6, 2025 11:47 — with GitHub Actions Inactive

fabiomestre temporarily deployed to WindowsCILock June 6, 2025 12:12 — with GitHub Actions Inactive

fabiomestre had a problem deploying to WindowsCILock June 6, 2025 12:12 — with GitHub Actions Failure

Let L0 event implementation handler dependencies for in-order queue

240c952

fabiomestre had a problem deploying to WindowsCILock June 6, 2025 12:31 — with GitHub Actions Error

Wait for command-buffer execution before destroying

d70fc37

fabiomestre temporarily deployed to WindowsCILock June 6, 2025 12:54 — with GitHub Actions Inactive

fabiomestre temporarily deployed to WindowsCILock June 16, 2025 13:23 — with GitHub Actions Inactive

Workaround HIP limitations

71cc56f

fabiomestre temporarily deployed to WindowsCILock June 16, 2025 18:21 — with GitHub Actions Inactive

fabiomestre temporarily deployed to WindowsCILock June 16, 2025 18:58 — with GitHub Actions Inactive

EwanC reviewed Jun 17, 2025

View reviewed changes

unified-runtime/source/adapters/hip/command_buffer.hpp Outdated Show resolved Hide resolved

fabiomestre added 2 commits June 17, 2025 11:19

Update comment for new hip variable

03fe4b6

Merge remote-tracking branch 'origin/sycl' into fabio/eventless_graph…

92cf34f

…_enqueue

fabiomestre had a problem deploying to WindowsCILock June 17, 2025 10:19 — with GitHub Actions Failure

fabiomestre temporarily deployed to WindowsCILock June 17, 2025 10:48 — with GitHub Actions Inactive

fabiomestre temporarily deployed to WindowsCILock June 17, 2025 11:46 — with GitHub Actions Inactive

fabiomestre temporarily deployed to WindowsCILock June 17, 2025 12:14 — with GitHub Actions Inactive

[HIP] Enqueue event wait instead of waiting on the host

758cecf

fabiomestre had a problem deploying to WindowsCILock June 17, 2025 15:27 — with GitHub Actions Failure

fabiomestre added 2 commits June 17, 2025 17:03

Merge remote-tracking branch 'origin/sycl' into fabio/eventless_graph…

fc2ad25

…_enqueue

Fix build failures after rebase

a034bb8

fabiomestre temporarily deployed to WindowsCILock June 17, 2025 16:05 — with GitHub Actions Inactive

fabiomestre temporarily deployed to WindowsCILock June 17, 2025 16:34 — with GitHub Actions Inactive

Merge remote-tracking branch 'origin/sycl' into fabio/eventless_graph…

d31495e

…_enqueue

fabiomestre had a problem deploying to WindowsCILock June 18, 2025 14:15 — with GitHub Actions Error

fabiomestre temporarily deployed to WindowsCILock June 18, 2025 17:48 — with GitHub Actions Inactive

fabiomestre temporarily deployed to WindowsCILock June 18, 2025 18:22 — with GitHub Actions Inactive

uditagarwal97 merged commit b643b8b into intel:sycl Jun 18, 2025
48 of 51 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL][Graph] Optimize graph enqueue for in-order queues #18792

[SYCL][Graph] Optimize graph enqueue for in-order queues #18792

Uh oh!

fabiomestre commented Jun 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

fabiomestre commented Jun 17, 2025 •

edited

Loading

Uh oh!

fabiomestre commented Jun 18, 2025

Uh oh!

uditagarwal97 commented Jun 18, 2025

Uh oh!

Uh oh!

Uh oh!

[SYCL][Graph] Optimize graph enqueue for in-order queues #18792

[SYCL][Graph] Optimize graph enqueue for in-order queues #18792

Uh oh!

Conversation

fabiomestre commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

fabiomestre commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fabiomestre commented Jun 18, 2025

Uh oh!

uditagarwal97 commented Jun 18, 2025

Uh oh!

Uh oh!

Uh oh!

fabiomestre commented Jun 3, 2025 •

edited

Loading

fabiomestre commented Jun 17, 2025 •

edited

Loading