[SYCL] Add test for reductions using the resource pool #873

steffenlarsen · 2022-02-25T16:46:42Z

The current implementation of SYCL reductions may require some additional device-memory resources. To reduce the overhead of allocating memory every time a reduction runs, a resource pool is employed. These changes add a test to ensure that the resource pool prevents repeated allocations for reductions needing these resources.

The current implementation of SYCL reductions may require some additional device-memory resources. To reduce the overhead of allocating memory every time a reduction runs, a resource pool is employed. These changes add a test to ensure that the resource pool prevents repeated allocations for reductions needing these resources. Signed-off-by: Steffen Larsen <[email protected]>

steffenlarsen · 2022-02-25T16:47:12Z

This requires:

Signed-off-by: Steffen Larsen <[email protected]>

vladimirlaz · 2022-02-28T08:41:52Z

/verify with intel/llvm#5662

vladimirlaz · 2022-02-28T08:44:56Z

SYCL/Reduction/reduction_aux_resources.cpp

@@ -0,0 +1,35 @@
+// RUN: %clangxx -fsycl -fsycl-targets=%sycl_triple %s -o %t.out -Xsycl-target-backend=nvptx64-nvidia-cuda --cuda-gpu-arch=sm_60


the test is built for CUDA BE
Is it expected?

Suggested change

// RUN: %clangxx -fsycl -fsycl-targets=%sycl_triple %s -o %t.out -Xsycl-target-backend=nvptx64-nvidia-cuda --cuda-gpu-arch=sm_60

// RUN: %clangxx -fsycl -fsycl-targets=%sycl_triple %s -o %t.out

These options only have an effect if the sycl target is the CUDA backend and are ignored otherwise. All it does is instruct the generation of NVPTX to use SM60, which I believe is required here to enable an extended set of atomic operations used by reductions.

Fix CMPLRTST-16222: disable hier_par_wgscope_O0 on discrete GPUs because of a hang issue

steffenlarsen requested a review from a team as a code owner February 25, 2022 16:46

steffenlarsen requested a review from vladimirlaz February 25, 2022 16:46

steffenlarsen mentioned this pull request Feb 25, 2022

Draft: [SYCL] Implement resource pool for implementation allocations intel/llvm#5662

Closed

Fix formatting

025e13e

Signed-off-by: Steffen Larsen <[email protected]>

vladimirlaz approved these changes Feb 28, 2022

View reviewed changes

vladimirlaz suggested changes Feb 28, 2022

View reviewed changes

steffenlarsen closed this May 27, 2022

myler pushed a commit to myler/llvm-test-suite that referenced this pull request Jun 17, 2022

Merge pull request intel#873 from jiezzhang/CMPLRTST-16222

cb020ab

Fix CMPLRTST-16222: disable hier_par_wgscope_O0 on discrete GPUs because of a hang issue

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL] Add test for reductions using the resource pool #873

[SYCL] Add test for reductions using the resource pool #873

Uh oh!

steffenlarsen commented Feb 25, 2022

Uh oh!

steffenlarsen commented Feb 25, 2022

Uh oh!

vladimirlaz commented Feb 28, 2022

Uh oh!

vladimirlaz Feb 28, 2022

Uh oh!

steffenlarsen Feb 28, 2022

Uh oh!

Uh oh!

		@@ -0,0 +1,35 @@
		// RUN: %clangxx -fsycl -fsycl-targets=%sycl_triple %s -o %t.out -Xsycl-target-backend=nvptx64-nvidia-cuda --cuda-gpu-arch=sm_60

[SYCL] Add test for reductions using the resource pool #873

[SYCL] Add test for reductions using the resource pool #873

Uh oh!

Conversation

steffenlarsen commented Feb 25, 2022

Uh oh!

steffenlarsen commented Feb 25, 2022

Uh oh!

vladimirlaz commented Feb 28, 2022

Uh oh!

vladimirlaz Feb 28, 2022

Choose a reason for hiding this comment

Uh oh!

steffenlarsen Feb 28, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!