1
- = sycl_oneapi_bf16_conversion
1
+ = sycl_ext_oneapi_bfloat16
2
2
3
3
:source-highlighter: coderay
4
4
:coderay-linenums-mode: table
@@ -73,7 +73,7 @@ command (e.g. from `parallel_for`).
73
73
This extension provides a feature-test macro as described in the core SYCL
74
74
specification section 6.3.3 "Feature test macros". Therefore, an implementation
75
75
supporting this extension must predefine the macro
76
- `SYCL_EXT_ONEAPI_BF16_CONVERSION ` to one of the values defined in the table
76
+ `SYCL_EXT_ONEAPI_BFLOAT16 ` to one of the values defined in the table
77
77
below. Applications can test for the existence of this macro to determine if
78
78
the implementation supports this feature, or applications can test the macro’s
79
79
value to determine which of the extension’s APIs the implementation supports.
@@ -91,18 +91,18 @@ the implementation supports this feature, or applications can test the macro’s
91
91
namespace sycl {
92
92
enum class aspect {
93
93
...
94
- ext_oneapi_bf16_conversion
94
+ ext_oneapi_bfloat16
95
95
}
96
96
}
97
97
----
98
98
99
- If a SYCL device has the `ext_oneapi_bf16_conversion ` aspect, then it natively
99
+ If a SYCL device has the `ext_oneapi_bfloat16 ` aspect, then it natively
100
100
supports conversion of values of `float` type to `bfloat16` and back.
101
101
102
102
If the device doesn't have the aspect, objects of `bfloat16` class must not be
103
103
used in the device code.
104
104
105
- **NOTE**: The `ext_oneapi_bf16_conversion ` aspect is not yet supported. The
105
+ **NOTE**: The `ext_oneapi_bfloat16 ` aspect is not yet supported. The
106
106
`bfloat16` class is currently supported only on Xe HP GPU and Nvidia A100 GPU.
107
107
108
108
== New `bfloat16` class
@@ -304,7 +304,7 @@ int main (int argc, char *argv[]) {
304
304
sycl::queue deviceQueue{dev};
305
305
sycl::buffer<float, 1> buf {data, sycl::range<1> {3}};
306
306
307
- if (dev.has(sycl::aspect::ext_oneapi_bf16_conversion )) {
307
+ if (dev.has(sycl::aspect::ext_oneapi_bfloat16 )) {
308
308
deviceQueue.submit ([&] (sycl::handler& cgh) {
309
309
auto numbers = buf.get_access<sycl::access::mode::read_write> (cgh);
310
310
cgh.single_task<class simple_kernel> ([=] () {
@@ -332,5 +332,5 @@ None.
332
332
Add operator overloadings +
333
333
Apply code review suggestions
334
334
|3|2021-08-18|Alexey Sotkin |Remove `uint16_t` constructor
335
- |4|2022-03-07|Jack Kirk |Switch from Intel vendor specific to oneapi
335
+ |4|2022-03-07|Aidan Belton and Jack Kirk |Switch from Intel vendor specific to oneapi
336
336
|========================================
0 commit comments