[SYCL][CUDA] Update program manager and queue to resolve multi-targeting issues #4921

AidanBeltonS · 2021-11-09T13:09:52Z

This PR makes two changes, the first is it moves the macro which prevents __devicelib_assert_read being used for nvptx64 devices. This is done to resolve an issue where the binary images of spirv64 and nvptx64 are neither identical nor disjoint (have no kernels in common). The program manager needs binary images to be identical or disjoint, it fails otherwise. This creates a kernel of the same name as when building for spirv64 but it does not use __devicelib_assert_read.
The second it prevents errors being thrown in the program manager when the binaries compatibility check returns false. This is to allow for multi-targeting to be used with module splitting.
A cuda and hip only regression test is added to check for successful compilation with multi-targeting and module splitting options.

Proposed solution to: #3631

s-kanaev · 2021-11-09T13:33:59Z

sycl/source/detail/program_manager/program_manager.cpp

      PIDeviceHandle, &DevBin,
      /*num bin images = */ (cl_uint)1, &SuitableImageID);
+  if (Error != PI_SUCCESS && Error != PI_INVALID_BINARY)


What are effects of allowing invalid binary for the caller?

PI_INVALID_BINARY is returned when piextDeviceSelectBinary cannot find any suitable image within the passed list. This is a valid response, in this case, as it is checking if the binary is suitable for the plugin.

s-kanaev · 2021-11-09T13:34:47Z

sycl/include/CL/sycl/queue.hpp

@@ -67,7 +67,7 @@

 // Helper macro to identify if fallback assert is needed
 // FIXME remove __NVPTX__ condition once devicelib supports CUDA
-#if !defined(SYCL_DISABLE_FALLBACK_ASSERT) && !defined(__NVPTX__)


This condition shouldn't be modified at the moment as it's due CUDA native support of assertions.

This was something I was struggling with for a while. When building for both cuda and opencl it can fail, based on the target order (#3631), because there is a kernel in spirv64 which is not in nvptx64 _ZTSN2cl4sycl6detail23__sycl_service_kernel__16AssertInfoCopierE. The program_manager requires binary images to either be identical or disjoint. So I am proposing removing this to create a version of this kernel which does not perform any action; similar to amdgcn which will support native asserts but generates this kernel. I am not seeing any failing tests as a result of this change. Please let me know if there is a better way of achieving this. Many thanks!

vladimirlaz · 2021-11-19T08:05:35Z

@s-kanaev, are you ok with the responses to your comments?

bader · 2021-11-30T10:10:13Z

@s-kanaev, ping.

s-kanaev

I'm fine with the changes.

AidanBeltonS · 2021-12-01T10:31:24Z

PR to add device testing to llvm-test-suite: intel/llvm-test-suite#593

update program manager and queue for multi-targeting

d1591b1

AidanBeltonS requested a review from a team as a code owner November 9, 2021 13:09

AidanBeltonS requested review from vladimirlaz and s-kanaev November 9, 2021 13:09

AidanBeltonS mentioned this pull request Nov 9, 2021

[SYCL] Target ordering breaks compilation #3631

Closed

s-kanaev reviewed Nov 9, 2021

View reviewed changes

vladimirlaz requested a review from s-kanaev November 17, 2021 08:38

vladimirlaz approved these changes Nov 25, 2021

View reviewed changes

s-kanaev approved these changes Nov 30, 2021

View reviewed changes

bader merged commit a346c08 into intel:sycl Nov 30, 2021

AidanBeltonS mentioned this pull request Dec 1, 2021

[SYCL][CUDA][HIP] Regression test with multiple compiler targets intel/llvm-test-suite#593

Merged

AidanBeltonS deleted the multi-targeting branch December 1, 2021 10:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL][CUDA] Update program manager and queue to resolve multi-targeting issues #4921

[SYCL][CUDA] Update program manager and queue to resolve multi-targeting issues #4921

Uh oh!

AidanBeltonS commented Nov 9, 2021

Uh oh!

s-kanaev Nov 9, 2021

Uh oh!

AidanBeltonS Nov 9, 2021

Uh oh!

s-kanaev Nov 9, 2021

Uh oh!

AidanBeltonS Nov 9, 2021 •

edited

Loading

Uh oh!

vladimirlaz commented Nov 19, 2021

Uh oh!

bader commented Nov 30, 2021

Uh oh!

s-kanaev left a comment

Uh oh!

AidanBeltonS commented Dec 1, 2021

Uh oh!

Uh oh!

[SYCL][CUDA] Update program manager and queue to resolve multi-targeting issues #4921

[SYCL][CUDA] Update program manager and queue to resolve multi-targeting issues #4921

Uh oh!

Conversation

AidanBeltonS commented Nov 9, 2021

Uh oh!

s-kanaev Nov 9, 2021

Choose a reason for hiding this comment

Uh oh!

AidanBeltonS Nov 9, 2021

Choose a reason for hiding this comment

Uh oh!

s-kanaev Nov 9, 2021

Choose a reason for hiding this comment

Uh oh!

AidanBeltonS Nov 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vladimirlaz commented Nov 19, 2021

Uh oh!

bader commented Nov 30, 2021

Uh oh!

s-kanaev left a comment

Choose a reason for hiding this comment

Uh oh!

AidanBeltonS commented Dec 1, 2021

Uh oh!

Uh oh!

AidanBeltonS Nov 9, 2021 •

edited

Loading