-
Notifications
You must be signed in to change notification settings - Fork 787
[SYCL] Add spec for sycl_ext_intel_cslice #7513
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Add a proposed extension specification that allows partitioning a device by "cslice" (aka CCS-es).
It turns out that ATS-M does not support this type of partitioning, so reword the overview to note that PVC is the only supported device.
Co-authored-by: Ben Ashbaugh <[email protected]>
With this change, on PVC sub-sub-devices now require info::partition_property::ext_intel_partition_by_cslice instead of info::partition_property::partition_by_affinity_domain that wasn't quite accurately describing the actual scheme. On other devices, CSlice-based partitioning is now disabled because that's not how the actual H/W works. If precise manual access to individual CCS is required than sycl_ext_intel_queue_index extension should be used instead. Extension specification is being added in intel#7513.
// Available only when | ||
// Prop == info::partition_property::ext_intel_partition_by_cslice | ||
template <info::partition_property Prop> | ||
std::vector<device> create_sub_devices() const; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Shouldn't it be ext_intel_create_sub_devices
since this is an extension?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think it's OK to use create_sub_devices
since the template parameter already has "ext_intel". Code calling this function will clearly be to an extension because it will look like:
d.create_sub_devices<info::partition_property::ext_intel_partition_by_cslice>();
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@romanovvlad: Does my answer address your concern?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm satisfied with Greg's answer.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1
Ping @intel/dpcpp-specification-reviewers, can someone other than me review this? |
`info::partition_property::ext_intel_partition_by_cslice`. | ||
|
||
The only Intel GPU device that currently supports this type of partitioning is | ||
PVC, and this support is only available when the device driver is configured in |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Replace PVC with "the Data Center GPU Max series, formerly known as Ponte Vecchio".
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done in 17ca6b6
Co-authored-by: Ben Ashbaugh <[email protected]>
With this change, on PVC sub-sub-devices now require `info::partition_property::ext_intel_partition_by_cslice` instead of `info::partition_property::partition_by_affinity_domain` that wasn't quite accurately describing the actual scheme. The old behavior could be temporarily restored via `SYCL_PI_LEVEL_ZERO_EXPOSE_CSLICE_IN_AFFINITY_PARTITIONING` environment variable but it is immediately deprecated, and customers are encouraged to switch to the new partitioning scheme as soon as possible. However, even in this scenario, `sub_sub_device.get_info<info::device::partition_type_property>()` would return `info::partition_property::ext_intel_partition_by_cslice`. That is due to the fact that the whole device hierarchy is pre-populated in the plugin, and we don't know in advance what partitioning would be used in `get_sub_devices` call from SYCL RT. On other devices, CSlice-based partitioning is now disabled because that's not how the actual H/W works. If precise manual access to individual CCS is required than `sycl_ext_intel_queue_index` extension should be used instead. Extension specification is being added in #7513.
Recently, an extension was added here: #7513 This extension allows partitioning a device by "cslice" (aka CCS-es). With this change, on PVC sub-sub-devices now require info::partition_property::ext_intel_partition_by_cslice instead of info::partition_property::partition_by_affinity_domain that wasn't quite accurately describing the actual scheme. Level Zero backend support was added here: #7626 In this change, we are adding support in OpenCL plugin. This change makes use of the cl_intel_command_queue_families recently added. Signed-off-by: Arvind Sudarsanam <[email protected]>
Add a proposed extension specification that allows partitioning a device by "cslice" (aka CCS-es).