-
Notifications
You must be signed in to change notification settings - Fork 130
[SYCL] Add tests for SYCL2020 sub_groups features #283
Conversation
const size_t MaxNumSubgroups = | ||
Kernel.get_info<sycl::info::kernel_device_specific::max_num_sub_groups>( | ||
Q.get_device()); | ||
const size_t SubgroupSize = 32 / MaxNumSubgroups; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Any reason not to use info::kernel_device_specific::max_sub_group_size
instead?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's actually a nice piece of advice. kernel_device_specific::max_num_sub_groups
returns maximum number of subroups, not the one that was actually run. I updated implementation to use the correct values.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@alexbatashev, the newly added test is failing on CUDA. Could you please fix it?
I disabled test on CUDA, because sycl tools seemingly do not generate enough information for the runtime to function correctly. I'll create a tracker for that. As for windows regression, it does not seem to be related to my patch. |
Could you please create tickets for exposed issues? |
I added you to watchers for the created issues. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you split the test (move common part to the header)? You have just doubled test execution time which make cause timeouts on some configurations
I switched to use of |
Link to feature: intel/llvm#3765