[UR][CUDA] Use different throttle reasons API based on CUDA version #17719

againull · 2025-03-28T19:33:39Z

Our pre-commit CI uses CUDA 12.6 but nightly uses CUDA 12.1, it turns out nvml which is part of CUDA 12.6 has nvmlDeviceGetCurrentClocksEventReasons API, but nvml which is part of CUDA 12.1 doesn't have it, but only supports older deprecated nvmlDeviceGetCurrentClocksThrottleReasons.

NVML doesn't provide a version macro to check the support for that API, so use new API nvmlDeviceGetCurrentClocksEventReasons based on cuda version.

Our pre-commit CI uses CUDA 12.6 but nightly uses CUDA 12.1, it turns out nvml which is part of CUDA 12.6 has nvmlDeviceGetCurrentClocksEventReasons API, but nvml which is part of CUDA 12.1 doesn't have it, but only supports older deprecated nvmlDeviceGetCurrentClocksThrottleReasons. NVML doesn't provide a version macro to check the support for that API, so use new API nvmlDeviceGetCurrentClocksEventReasons based on cuda version.

againull · 2025-03-31T09:03:33Z

Jenkins/Precommit failure is unrelated and visible in other PRs.

…17719) Our pre-commit CI uses CUDA 12.6 but nightly uses CUDA 12.1, it turns out nvml which is part of CUDA 12.6 has nvmlDeviceGetCurrentClocksEventReasons API, but nvml which is part of CUDA 12.1 doesn't have it, but only supports older deprecated nvmlDeviceGetCurrentClocksThrottleReasons. NVML doesn't provide a version macro to check the support for that API, so use new API nvmlDeviceGetCurrentClocksEventReasons based on cuda version.

Cherry-pick commits that reached the internal branch between intel/llvm cutoff and release branch pulldown. Patches included: --- [SYCL][Doc] Add new device descriptors to sycl_ext_intel_device_info extension (#17386) Patch-by: Artur Gainullin <[email protected]> --- [SYCL][CUDA] Add implementation of new device descriptors (#17590) Tests were not added because there are existing conformance tests which cover this functionality. Patch-by: Artur Gainullin <[email protected]> --- [UR][CUDA] Use different throttle reasons API based on CUDA version (#17719) Our pre-commit CI uses CUDA 12.6 but nightly uses CUDA 12.1, it turns out nvml which is part of CUDA 12.6 has nvmlDeviceGetCurrentClocksEventReasons API, but nvml which is part of CUDA 12.1 doesn't have it, but only supports older deprecated nvmlDeviceGetCurrentClocksThrottleReasons. NVML doesn't provide a version macro to check the support for that API, so use new API nvmlDeviceGetCurrentClocksEventReasons based on cuda version. Patch-by: Artur Gainullin <[email protected]>

againull requested a review from a team as a code owner March 28, 2025 19:33

againull requested a review from omarahmed1111 March 28, 2025 19:33

againull temporarily deployed to WindowsCILock March 28, 2025 19:33 — with GitHub Actions Inactive

againull temporarily deployed to WindowsCILock March 28, 2025 19:45 — with GitHub Actions Inactive

npmiller approved these changes Mar 31, 2025

View reviewed changes

againull merged commit cf1a3d3 into intel:sycl Mar 31, 2025
31 of 32 checks passed

npmiller mentioned this pull request Jun 10, 2025

[sycl-rel] Cherry-pick patches for sycl_ext_intel_device_info #18886

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[UR][CUDA] Use different throttle reasons API based on CUDA version #17719

[UR][CUDA] Use different throttle reasons API based on CUDA version #17719

Uh oh!

againull commented Mar 28, 2025

Uh oh!

againull commented Mar 31, 2025

Uh oh!

Uh oh!

Uh oh!

[UR][CUDA] Use different throttle reasons API based on CUDA version #17719

[UR][CUDA] Use different throttle reasons API based on CUDA version #17719

Uh oh!

Conversation

againull commented Mar 28, 2025

Uh oh!

againull commented Mar 31, 2025

Uh oh!

Uh oh!

Uh oh!