Skip to content

Commit b03e661

Browse files
author
Hugh Delaney
committed
Zeroing out bottom bits in sware impl
1 parent 1499836 commit b03e661

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

sycl/include/sycl/ext/oneapi/matrix/matrix-tensorcore.hpp

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -611,6 +611,7 @@ float float_to_tf32(float a) {
611611
#else
612612
uint32_t tmp_uint = reinterpret_cast<uint32_t &>(a);
613613
tmp_uint += 0x1000u;
614+
tmp_uint &= 0xFFFFE000u;
614615
float ret = reinterpret_cast<float &>(tmp_uint);
615616
return ret;
616617
#endif // defined(__SYCL_DEVICE_ONLY__) && defined(__NVPTX__)

0 commit comments

Comments
 (0)