[CI] Run build on Ubuntu 22, pre-commit if CUDA adapter changes. #17757

uditagarwal97 · 2025-03-31T20:38:13Z

Our Ubuntu 22.04 container has CUDA 12.1 installed while Ubuntu 24.04 image has CUDA 12.6.1 installed.
If a PR changes CUDA adapter, we should test the change with both CUDA versions.

bader · 2025-03-31T21:49:52Z

.github/workflows/sycl-linux-precommit.yml

@@ -52,6 +52,22 @@ jobs:
      changes: ${{ needs.detect_changes.outputs.filters }}
      e2e_binaries_artifact: sycl_e2e_bin_default

+  # If a PR changes CUDA adapter, run the build on Ubuntu 22.04 as well.
+  # Ubuntu 22.04 container has CUDA 12.1 installed while Ubuntu 24.0 image
+  # has CUDA 12.6.1 installed.


If I get it right, patches with changes in CUDA adaptor will be tested with both CUDA 12.6.1 and CUDA 12.1 (at least build + LIT?).
In addition, I saw CUDA 12.8 is installed on (at least) one of Windows runners.
Do intentionally use different CUDA versions in different workflows (or different machines)?

@npmiller, FYI.

If I get it right, patches with changes in CUDA adaptor will be tested with both CUDA 12.6.1 and CUDA 12.1 (at least build + LIT?).

Yes, for Linux.

In addition, I saw CUDA 12.8 is installed on (at least) one of Windows runners.
Do intentionally use different CUDA versions in different workflows (or different machines)?

@sarnex Since you installed CUDA on Windows runners, do we plan to have CUDA 12.8 uniformly on all Windows runners? CodePlay's UR CUDA adapter workflow uses CUDA 12.4, so I'm not sure if CUDA 12.8 is even extensively tested

yeah im installing cuda 12.8 on all windows runners, but we dont have any nvidia windows gpu machines, so its basically build only testing

aarongreig · 2025-04-04T12:34:12Z

I'm seeing a fail that I think might be related to this change, this error state https://github.com/intel/llvm/actions/runs/14263534349/job/39981394795?pr=17571#step:17:323 is reached based on the definition of CUDA_VERSION

llvm/unified-runtime/source/adapters/cuda/usm.cpp

Line 445 in 92da52d

if (Limits->maxPoolableSize > 0) {

which seems to differ from what's reported by the device at runtime https://github.com/intel/llvm/actions/runs/14263534349/job/39981394795?pr=17571#step:14:25

Seanst98 · 2025-04-04T16:36:32Z

I'm seeing a fail that I think might be related to this change, this error state https://github.com/intel/llvm/actions/runs/14263534349/job/39981394795?pr=17571#step:17:323 is reached based on the definition of CUDA_VERSION

llvm/unified-runtime/source/adapters/cuda/usm.cpp

Line 445 in 92da52d

if (Limits->maxPoolableSize > 0) {

which seems to differ from what's reported by the device at runtime https://github.com/intel/llvm/actions/runs/14263534349/job/39981394795?pr=17571#step:14:25

We decided that in this circumstance, it would be simplest to warn the user that the property will not be used instead of throwing an error. See this PR which changes to a warning: #17863

…7863) Warn users over setting the memory pool maximum size property instead of error. This should aid CI which is affected by the memory_pool.cpp e2e test which expects to be able to set this property despite DPC++ being built with a CUDA version that is too old. See this CI error: #17757 (comment) This PR relies on #17095 to be merged for the printing of the warning to be enabled.

Warn users over setting the memory pool maximum size property instead of error. This should aid CI which is affected by the memory_pool.cpp e2e test which expects to be able to set this property despite DPC++ being built with a CUDA version that is too old. See this CI error: intel/llvm#17757 (comment) This PR relies on intel/llvm#17095 to be merged for the printing of the warning to be enabled.

[CI] Run build on Ubuntu 22 if CUDA adapter changes.

25d01ec

uditagarwal97 self-assigned this Mar 31, 2025

uditagarwal97 requested a review from a team as a code owner March 31, 2025 20:38

uditagarwal97 temporarily deployed to WindowsCILock March 31, 2025 20:38 — with GitHub Actions Inactive

uditagarwal97 requested review from sarnex, aelovikov-intel and bader March 31, 2025 20:38

sarnex approved these changes Mar 31, 2025

View reviewed changes

uditagarwal97 temporarily deployed to WindowsCILock March 31, 2025 21:01 — with GitHub Actions Inactive

bader reviewed Mar 31, 2025

View reviewed changes

uditagarwal97 merged commit 5b35190 into sycl Apr 4, 2025
24 checks passed

Seanst98 mentioned this pull request Apr 4, 2025

[AsyncAlloc][CUDA] Change memory pool max size error to a warning #17863

Merged

bader deleted the cuda_adapter_ci branch April 4, 2025 22:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CI] Run build on Ubuntu 22, pre-commit if CUDA adapter changes. #17757

[CI] Run build on Ubuntu 22, pre-commit if CUDA adapter changes. #17757

Uh oh!

uditagarwal97 commented Mar 31, 2025 •

edited

Loading

Uh oh!

bader Mar 31, 2025

Uh oh!

uditagarwal97 Mar 31, 2025

Uh oh!

sarnex Mar 31, 2025 •

edited

Loading

Uh oh!

Uh oh!

aarongreig commented Apr 4, 2025

Uh oh!

Seanst98 commented Apr 4, 2025

Uh oh!

Uh oh!

[CI] Run build on Ubuntu 22, pre-commit if CUDA adapter changes. #17757

[CI] Run build on Ubuntu 22, pre-commit if CUDA adapter changes. #17757

Uh oh!

Conversation

uditagarwal97 commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bader Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

uditagarwal97 Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

sarnex Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aarongreig commented Apr 4, 2025

Uh oh!

Seanst98 commented Apr 4, 2025

Uh oh!

Uh oh!

uditagarwal97 commented Mar 31, 2025 •

edited

Loading

sarnex Mar 31, 2025 •

edited

Loading