Skip to content

Commit 17ca6b6

Browse files
committed
Use correct marketing name for PVC
1 parent b93f2fe commit 17ca6b6

File tree

1 file changed

+22
-22
lines changed

1 file changed

+22
-22
lines changed

sycl/doc/extensions/proposed/sycl_ext_intel_cslice.asciidoc

Lines changed: 22 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -68,14 +68,14 @@ not expose this type of partitioning through
6868
new partitioning type
6969
`info::partition_property::ext_intel_partition_by_cslice`.
7070

71-
The only Intel GPU device that currently supports this type of partitioning is
72-
PVC, and this support is only available when the device driver is configured in
73-
{multi-CCS}[multi-CCS] mode. See that documentation for instructions on how to
74-
enable this mode and for other important information. Currently, it is only
75-
possible to partition a device by cslice if the driver is in "2 CCS Mode" or
76-
"4 CCS Mode". When in 2 CCS Mode, a tile can be partitioned into two cslice
77-
sub-devices. When in 4 CCS Mode, a tile can be partitioned into four cslice
78-
sub-devices.
71+
The only Intel GPU devices that currently support this type of partitioning
72+
are the Data Center GPU Max series (aka PVC), and this support is only
73+
available when the device driver is configured in {multi-CCS}[multi-CCS] mode.
74+
See that documentation for instructions on how to enable this mode and for
75+
other important information. Currently, it is only possible to partition a
76+
device by cslice if the driver is in "2 CCS Mode" or "4 CCS Mode". When in
77+
2 CCS Mode, a tile can be partitioned into two cslice sub-devices. When in
78+
4 CCS Mode, a tile can be partitioned into four cslice sub-devices.
7979

8080
This type of partitioning is currently supported only at the "tile" level.
8181
Therefore, a device must first be partitioned into per-tile sub-devices via
@@ -84,20 +84,20 @@ be further partitioned by `ext_intel_partition_by_cslice`.
8484

8585
It is important to understand that the device driver virtualizes work
8686
submission to the cslice sub-devices. (More specifically, the device driver
87-
virtualizes work submission to different CCS-es, and this means that on PVC
88-
the work submission to a cslice is virtualized.) This virtualization happens
89-
only between processes, and not within a single process. For example, consider
90-
a single process that constructs two SYCL queues on cslice sub-device #0.
91-
Kernels submitted to these two queues are guaranteed to conflict, both using
92-
the same set of execution units. Therefore, if a single process wants to
93-
explicitly submit kernels to cslice sub-devices and it wants to avoid conflict,
94-
it should create queues on different sub-devices. By contrast, consider an
95-
example where two separate processes create a SYCL queue on cslice sub-device
96-
#0. In this case, the device driver virtualizes access to this cslice, and
97-
kernels submitted from the first process may run on different execution units
98-
than kernels submitted from the second process. In this second case, the
99-
device driver binds the process's requested cslice to a physical cslice
100-
according to the overall system load.
87+
virtualizes work submission to different CCS-es, and this means that on Data
88+
Center GPU Max series devices the work submission to a cslice is virtualized.)
89+
This virtualization happens only between processes, and not within a single
90+
process. For example, consider a single process that constructs two SYCL
91+
queues on cslice sub-device #0. Kernels submitted to these two queues are
92+
guaranteed to conflict, both using the same set of execution units. Therefore,
93+
if a single process wants to explicitly submit kernels to cslice sub-devices
94+
and it wants to avoid conflict, it should create queues on different
95+
sub-devices. By contrast, consider an example where two separate processes
96+
create a SYCL queue on cslice sub-device #0. In this case, the device driver
97+
virtualizes access to this cslice, and kernels submitted from the first process
98+
may run on different execution units than kernels submitted from the second
99+
process. In this second case, the device driver binds the process's requested
100+
cslice to a physical cslice according to the overall system load.
101101

102102
Note that this extension can be supported by any implementation. If an
103103
implementation supports a backend or device without the concept of cslice

0 commit comments

Comments
 (0)