Skip to content

[SYCL] Add spec for sycl_ext_intel_cslice #7513

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 8 commits into from
Dec 9, 2022

Conversation

gmlueck
Copy link
Contributor

@gmlueck gmlueck commented Nov 23, 2022

Add a proposed extension specification that allows partitioning a device by "cslice" (aka CCS-es).

Add a proposed extension specification that allows partitioning a
device by "cslice" (aka CCS-es).
@gmlueck gmlueck requested a review from a team as a code owner November 23, 2022 16:11
It turns out that ATS-M does not support this type of partitioning, so
reword the overview to note that PVC is the only supported device.
aelovikov-intel added a commit to aelovikov-intel/llvm that referenced this pull request Dec 2, 2022
With this change, on PVC sub-sub-devices now require
info::partition_property::ext_intel_partition_by_cslice instead of
info::partition_property::partition_by_affinity_domain that wasn't quite
accurately describing the actual scheme.

On other devices, CSlice-based partitioning is now disabled because
that's not how the actual H/W works. If precise manual access to
individual CCS is required than sycl_ext_intel_queue_index extension
should be used instead.

Extension specification is being added in intel#7513.
// Available only when
// Prop == info::partition_property::ext_intel_partition_by_cslice
template <info::partition_property Prop>
std::vector<device> create_sub_devices() const;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Shouldn't it be ext_intel_create_sub_devices since this is an extension?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it's OK to use create_sub_devices since the template parameter already has "ext_intel". Code calling this function will clearly be to an extension because it will look like:

d.create_sub_devices<info::partition_property::ext_intel_partition_by_cslice>();

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@romanovvlad: Does my answer address your concern?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm satisfied with Greg's answer.

Copy link
Contributor

@jandres742 jandres742 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

@gmlueck
Copy link
Contributor Author

gmlueck commented Dec 8, 2022

Ping @intel/dpcpp-specification-reviewers, can someone other than me review this?

`info::partition_property::ext_intel_partition_by_cslice`.

The only Intel GPU device that currently supports this type of partitioning is
PVC, and this support is only available when the device driver is configured in
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Replace PVC with "the Data Center GPU Max series, formerly known as Ponte Vecchio".

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done in 17ca6b6

@jbrodman jbrodman self-requested a review December 8, 2022 22:15
@pvchupin pvchupin merged commit 5777e1f into intel:sycl Dec 9, 2022
steffenlarsen pushed a commit that referenced this pull request Dec 12, 2022
With this change, on PVC sub-sub-devices now require
`info::partition_property::ext_intel_partition_by_cslice` instead of
`info::partition_property::partition_by_affinity_domain` that wasn't
quite accurately describing the actual scheme.

The old behavior could be temporarily restored via
`SYCL_PI_LEVEL_ZERO_EXPOSE_CSLICE_IN_AFFINITY_PARTITIONING` environment
variable but it is immediately deprecated, and customers are encouraged
to switch to the new partitioning scheme as soon as possible.

However, even in this scenario,
`sub_sub_device.get_info<info::device::partition_type_property>()` would
return `info::partition_property::ext_intel_partition_by_cslice`. That
is due to the fact that the whole device hierarchy is pre-populated in
the plugin, and we don't know in advance what partitioning would be used
in `get_sub_devices` call from SYCL RT.

On other devices, CSlice-based partitioning is now disabled because
that's not how the actual H/W works. If precise manual access to
individual CCS is required than `sycl_ext_intel_queue_index` extension
should be used instead.

Extension specification is being added in
#7513.
@gmlueck gmlueck deleted the gmlueck/cslice branch December 19, 2022 15:42
againull pushed a commit that referenced this pull request Feb 7, 2023
Recently, an extension was added here:
#7513
This extension allows partitioning a device by "cslice" (aka CCS-es).
With this change, on PVC sub-sub-devices now require
info::partition_property::ext_intel_partition_by_cslice instead of
info::partition_property::partition_by_affinity_domain that wasn't quite
accurately describing the actual scheme.
Level Zero backend support was added here:
#7626
In this change, we are adding support in OpenCL plugin.

This change makes use of the cl_intel_command_queue_families recently
added.

Signed-off-by: Arvind Sudarsanam <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants