@@ -11,19 +11,19 @@ Release notes for commit range 23ca0c2..27f59d8
11
11
which can be used specify how many processes the compiler can use for
12
12
linking the device code [ c2221f0]
13
13
### SYCL Library
14
- - Added support for [ default context extension] ( doc/extensions/supported/SYCL_EXT_ONEAPI_DEFAULT_CONTEXT .asciidoc )
14
+ - Added support for [ default context extension] ( doc/extensions/supported/sycl_ext_oneapi_default_context .asciidoc )
15
15
on Linux [ 315593d]
16
- - Added experimental support for [ group sorting algorithm] ( doc/extensions/experimental/SYCL_EXT_ONEAPI_GROUP_SORT .asciidoc )
16
+ - Added experimental support for [ group sorting algorithm] ( doc/extensions/experimental/sycl_ext_oneapi_group_sort .asciidoc )
17
17
[ 932ae56]
18
- - Added support for [ sub-group mask extension] ( doc/extensions/supported/SYCL_EXT_ONEAPI_SUB_GROUP_MASK .asciidoc )
18
+ - Added support for [ sub-group mask extension] ( doc/extensions/supported/sycl_ext_oneapi_sub_group_mask .asciidoc )
19
19
[ 78a3e77]
20
20
- Added ` sycl::ext::intel::experimental::esimd::simd_mask ` as a replaced for
21
21
` sycl::ext::intel::experimental::esimd::mask_type_t ` to represent Gen
22
22
predicates [ 01351f1]
23
23
- Added stripped PDBs for SYCL libraries [ 6e5dd483]
24
24
- Added support ESIMD emulator backend [ f4ad3c11]
25
25
- Added support for [ FPGA DSP control] ( https://github.com/KhronosGroup/SPIRV-Registry/blob/main/extensions/INTEL/SPV_INTEL_fpga_dsp_control.asciidoc ) extension [ 790aa8ba]
26
- - Implemented [ discard_events extension] ( doc/extensions/supported/SYCL_EXT_ONEAPI_DISCARD_QUEUE_EVENTS .asciidoc )
26
+ - Implemented [ discard_events extension] ( doc/extensions/supported/sycl_ext_oneapi_discard_queue_events .asciidoc )
27
27
[ 9542e288]
28
28
- Extended XPTI notifications with information about SYCL memory management
29
29
[ a068b154] [ 8f9d0d2b]
@@ -43,17 +43,17 @@ Release notes for commit range 23ca0c2..27f59d8
43
43
Level Zero, ESIMD emulator, HIP [ 2b0ebab376dc]
44
44
- Added support for ` sycl::ext::intel::experimental::esimd_ballot ` function
45
45
[ 0bbb091c1baa]
46
- - Added initial support for [ Tensorcore matrix extension] ( doc/extensions/experimental/SYCL_EXT_ONEAPI_MATRIX .asciidoc )
46
+ - Added initial support for [ Tensorcore matrix extension] ( doc/extensions/experimental/sycl_ext_oneapi_matrix .asciidoc )
47
47
[ 711ba58c30a8]
48
48
49
49
### Documentation
50
- - Added [ device global extension specification] ( doc/extensions/proposed/SYCL_EXT_ONEAPI_DEVICE_GLOBAL .asciidoc )
50
+ - Added [ device global extension specification] ( doc/extensions/proposed/sycl_ext_oneapi_device_global .asciidoc )
51
51
[ d3e70d4]
52
- - Added [ property list extension specification] ( doc/extensions/proposed/SYCL_EXT_ONEAPI_PROPERTIES .asciidoc )
52
+ - Added [ property list extension specification] ( doc/extensions/proposed/sycl_ext_oneapi_properties .asciidoc )
53
53
[ a7da8b4]
54
- - Added extension specification for [ discard queue events] ( doc/extensions/supported/SYCL_EXT_ONEAPI_DISCARD_QUEUE_EVENTS .asciidoc )
54
+ - Added extension specification for [ discard queue events] ( doc/extensions/supported/sycl_ext_oneapi_discard_queue_events .asciidoc )
55
55
[ 23ca24bb]
56
- - Added [ KernelProperties] ( doc/extensions/proposed/SYCL_EXT_ONEAPI_KERNEL_PROPERTIES .asciidoc )
56
+ - Added [ KernelProperties] ( doc/extensions/proposed/sycl_ext_oneapi_kernel_properties .asciidoc )
57
57
extension [ 64f5e707daed]
58
58
59
59
## Improvements
@@ -185,30 +185,30 @@ Release notes for commit range 23ca0c2..27f59d8
185
185
### Tools
186
186
- Improved ` sycl-ls ` device indexing to handle new backends [ 0cee18e]
187
187
### Documentation
188
- - Updated [ Level-Zero backend extension] ( doc/extensions/LevelZeroBackend/LevelZeroBackend .md )
188
+ - Updated [ Level-Zero backend extension] ( doc/extensions/supported/sycl_ext_oneapi_backend_level_zero .md )
189
189
to align with the SYCL 2020 specification [ 8bbebf5]
190
- - Updated [ ITT instrumentation documentation] ( doc/ITTAnnotations.md ) [ 9c0508b]
191
- - Extended [ FPGA lsu] ( ./sycl/ doc/extensions/proposed/SYCL_EXT_INTEL_FPGA_LSU .md)
192
- and [ SYCL_INTEL_data_flow_pipes] ( ./sycl/ doc/extensions/proposed/SYCL_EXT_INTEL_DATAFLOW_PIPES .asciidoc)
190
+ - Updated [ ITT instrumentation documentation] ( doc/design/ ITTAnnotations.md ) [ 9c0508b]
191
+ - Extended [ FPGA lsu] ( doc/extensions/proposed/sycl_ext_intel_fpga_lsu .md )
192
+ and [ SYCL_INTEL_data_flow_pipes] ( doc/extensions/proposed/sycl_ext_intel_dataflow_pipes .asciidoc )
193
193
extensions to with latency control feature [ 5ab3cd3a]
194
194
- Updated OpenCL CPU and FPGA emulator runtimes configuration on Windows to
195
195
use OpenCL ICD registry records instead of ` OCL_ICD_FILENAMES ` . That is
196
196
done to use the latest OpenCL ICD loader which ignores ` OCL_ICD_FILENAMES `
197
197
configuration in the administrative console [ 92bcb419]
198
- - Improved [ group sorting spec] ( ./sycl/ doc/extensions/experimental/SYCL_EXT_ONEAPI_GROUP_SORT .asciidoc) [ f1c3506b]
199
- - Clarified restrictions on [ device global variables] ( doc/extensions/DeviceGlobal/SYCL_INTEL_device_global .asciidoc )
198
+ - Improved [ group sorting spec] ( doc/extensions/experimental/sycl_ext_oneapi_group_sort .asciidoc ) [ f1c3506b]
199
+ - Clarified restrictions on [ device global variables] ( doc/extensions/DeviceGlobal/sycl_intel_device_global .asciidoc )
200
200
[ 589d48844edb]
201
- - Deprecated [ group algorithms] ( doc/extensions/SYCL_INTEL_group_algorithms .asciidoc )
202
- and [ subgroup extensions] ( doc/extensions/SubGroup/SYCL_INTEL_sub_group .asciidoc )
201
+ - Deprecated [ group algorithms] ( doc/extensions/deprecated/sycl_ext_oneapi_group_algorithms .asciidoc )
202
+ and [ subgroup extensions] ( doc/extensions/deprecated/sycl_ext_oneapi_sub_group .asciidoc )
203
203
as it's superseded by SYCL2020 [ 7dc82742b5ae] [ d2a4f502b558]
204
204
- Removed ` SubGroupAlgorithms ` extension [ cd5417b13a27]
205
- - Added proposal to add [ query] ( doc/extensions/IntelGPU/IntelGPUDeviceInfo .md )
205
+ - Added proposal to add [ query] ( doc/extensions/supported/sycl_ext_intel_device_info .md )
206
206
for number of hardware threads per execution unit [ 5b771a9c4eca]
207
207
- Updated the experimental matrix interface to match new AMX/DPAS JIT
208
208
implementation [ 6495575258a8]
209
209
- Updated test section of the contribution guide [ 145b9e782258]
210
210
- Updated CUDA documentation with Windows support details [ 1cf024ac32a3]
211
- - Added FPGA properties to [ device global] ( doc/extensions/DeviceGlobal/SYCL_INTEL_device_global .asciidoc )
211
+ - Added FPGA properties to [ device global] ( doc/extensions/proposed/sycl_ext_oneapi_device_global .asciidoc )
212
212
specification [ fd2bd6e1eac7]
213
213
- Added clarifications in environment variables documentation [ d5ba0dbb4912]
214
214
@@ -363,13 +363,27 @@ Release notes for commit range 23ca0c2..27f59d8
363
363
- Removed ` half ` class from global namespace [ c9128e68ef39]
364
364
365
365
## Known issues
366
+ - [ new] Having MESA OpenCL implementation which provides no devices on a
367
+ system may cause incorrect device discovery. As a workaround such an OpenCL
368
+ implementation can be disabled by removing ` /etc/OpenCL/vendor/mesa.icd ` .
369
+ - [ new] Compilation may fail on Windows in debug mode if a kernel uses
370
+ ` std::array ` . This happens because debug version of ` std::array ` in
371
+ Microsoft STL C++ headers calls functions that are illegal for the device
372
+ code. As a workaround the following can be done:
373
+ 1 . Dump compiler pipeline execution strings by passing ` -### ` option to the
374
+ compiler. The compiler will print the internal execution strings of
375
+ compilation tools. The actual compilation will not happen.
376
+ 2 . Modify the (usually) first execution string (it should have
377
+ ` -fsycl-is-device ` option) by adding
378
+ ` -D_CONTAINER_DEBUG_LEVEL=0 -D_ITERATOR_DEBUG_LEVEL=0 ` options to the
379
+ end of the string. Execute all string one by one.
366
380
- [ new] ` -fsycl-dead-args-optimization ` can't help eliminate offset of
367
381
accessor even though it's created with no offset specified
368
382
- [ new] ` cuMemPrefetchAsync ` has issues on Windows. Hence, using
369
- ` sycl::queue::prefetch ` API` on Windows might lead to failure [ 0c33048e4926]
383
+ ` sycl::queue::prefetch ` API on Windows might lead to failure [ 0c33048e4926]
370
384
- SYCL 2020 barriers show worse performance than SYCL 1.2.1 do [ 18c80fa]
371
385
- [ new] When using fallback assert in separate compilation flow it requires
372
- explicit, linking against ` lib/libsycl-fallback-cassert.o ` or
386
+ explicit linking against ` lib/libsycl-fallback-cassert.o ` or
373
387
` lib/libsycl-fallback-cassert.spv `
374
388
- [ new] Performance may be impacted by JIT-ing an extra 'copier' kernel and due
375
389
running the 'copier' kernel and host-task after each kernel which uses
0 commit comments