@@ -68,14 +68,14 @@ not expose this type of partitioning through
68
68
new partitioning type
69
69
`info::partition_property::ext_intel_partition_by_cslice`.
70
70
71
- The only Intel GPU device that currently supports this type of partitioning is
72
- PVC, and this support is only available when the device driver is configured in
73
- {multi-CCS}[multi-CCS] mode. See that documentation for instructions on how to
74
- enable this mode and for other important information. Currently, it is only
75
- possible to partition a device by cslice if the driver is in "2 CCS Mode" or
76
- "4 CCS Mode". When in 2 CCS Mode, a tile can be partitioned into two cslice
77
- sub-devices. When in 4 CCS Mode, a tile can be partitioned into four cslice
78
- sub-devices.
71
+ The only Intel GPU devices that currently support this type of partitioning
72
+ are the Data Center GPU Max series (aka PVC), and this support is only
73
+ available when the device driver is configured in {multi-CCS}[multi-CCS] mode.
74
+ See that documentation for instructions on how to enable this mode and for
75
+ other important information. Currently, it is only possible to partition a
76
+ device by cslice if the driver is in " 2 CCS Mode" or "4 CCS Mode". When in
77
+ 2 CCS Mode, a tile can be partitioned into two cslice sub-devices. When in
78
+ 4 CCS Mode, a tile can be partitioned into four cslice sub-devices.
79
79
80
80
This type of partitioning is currently supported only at the "tile" level.
81
81
Therefore, a device must first be partitioned into per-tile sub-devices via
@@ -84,20 +84,20 @@ be further partitioned by `ext_intel_partition_by_cslice`.
84
84
85
85
It is important to understand that the device driver virtualizes work
86
86
submission to the cslice sub-devices. (More specifically, the device driver
87
- virtualizes work submission to different CCS-es, and this means that on PVC
88
- the work submission to a cslice is virtualized.) This virtualization happens
89
- only between processes, and not within a single process. For example, consider
90
- a single process that constructs two SYCL queues on cslice sub-device #0.
91
- Kernels submitted to these two queues are guaranteed to conflict, both using
92
- the same set of execution units. Therefore, if a single process wants to
93
- explicitly submit kernels to cslice sub-devices and it wants to avoid conflict,
94
- it should create queues on different sub-devices. By contrast, consider an
95
- example where two separate processes create a SYCL queue on cslice sub-device
96
- #0. In this case, the device driver virtualizes access to this cslice, and
97
- kernels submitted from the first process may run on different execution units
98
- than kernels submitted from the second process. In this second case, the
99
- device driver binds the process's requested cslice to a physical cslice
100
- according to the overall system load.
87
+ virtualizes work submission to different CCS-es, and this means that on Data
88
+ Center GPU Max series devices the work submission to a cslice is virtualized.)
89
+ This virtualization happens only between processes, and not within a single
90
+ process. For example, consider a single process that constructs two SYCL
91
+ queues on cslice sub-device #0. Kernels submitted to these two queues are
92
+ guaranteed to conflict, both using the same set of execution units. Therefore,
93
+ if a single process wants to explicitly submit kernels to cslice sub-devices
94
+ and it wants to avoid conflict, it should create queues on different
95
+ sub-devices. By contrast, consider an example where two separate processes
96
+ create a SYCL queue on cslice sub-device #0. In this case, the device driver
97
+ virtualizes access to this cslice, and kernels submitted from the first process
98
+ may run on different execution units than kernels submitted from the second
99
+ process. In this second case, the device driver binds the process's requested
100
+ cslice to a physical cslice according to the overall system load.
101
101
102
102
Note that this extension can be supported by any implementation. If an
103
103
implementation supports a backend or device without the concept of cslice
0 commit comments