[ESIMD] Fix atomic_update() implementation for N=16 and N=32 on Gen12 #12722

v-klochkov · 2024-02-15T03:49:54Z

atomic_update() for USM and ACC N=16,32 were lowered to SVM/DWORD atomic
intrinsics even though the HW instructions on Gen12 supported only
N up to 8 for USM and up to 16 for ACC.

GPU had legalization pass for N that split longer vectors to smaller and available in HW.
That GPU optimization/legalization workes incorrectly for USM as it
splits longer vectors assuming instruction is available for N=16 in case
of USM, which is not correct.

The patch here implements splitting of N=16 and N=32 cases for
atomic_update(usm, ...) to N=8 vectors until GPU fixes the legalization
for USM atomic_update.

atomic_update() for USM and ACC were lowered to SVM/DWORD atomic intrinsics even though the HW instructions on Gen12 supported only N up to 8 for USM and up to 16 for ACC. GPU had legalization pass for N that split longer vectors to smaller and available in HW. That GPU optimization/legalization workes incorrectly for USM as it splits longer vectors assuming instruction is available for N=16 in case of USM, which is not correct. The patch here implements splitting of N=16 and N=32 cases for atomic_update(usm, ...) to N=8 vectors until GPU fixes the legalization for USM atomic_update. Signed-off-by: Klochkov, Vyacheslav N <[email protected]>

v-klochkov force-pushed the esimd_fix_atomic_update_gen12 branch from a626b8e to 9c3515a Compare February 15, 2024 03:52

v-klochkov temporarily deployed to WindowsCILock February 15, 2024 03:53 — with GitHub Actions Inactive

v-klochkov temporarily deployed to WindowsCILock February 15, 2024 04:14 — with GitHub Actions Inactive

v-klochkov force-pushed the esimd_fix_atomic_update_gen12 branch from 9c3515a to 732d63e Compare February 15, 2024 15:58

v-klochkov temporarily deployed to WindowsCILock February 15, 2024 16:06 — with GitHub Actions Inactive

v-klochkov temporarily deployed to WindowsCILock February 15, 2024 16:43 — with GitHub Actions Inactive

v-klochkov marked this pull request as ready for review February 15, 2024 18:34

v-klochkov requested a review from a team as a code owner February 15, 2024 18:34

v-klochkov requested a review from fineg74 February 15, 2024 18:35

fineg74 approved these changes Feb 15, 2024

View reviewed changes

v-klochkov merged commit 44a74d0 into intel:sycl Feb 15, 2024

v-klochkov deleted the esimd_fix_atomic_update_gen12 branch February 16, 2024 16:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ESIMD] Fix atomic_update() implementation for N=16 and N=32 on Gen12 #12722

[ESIMD] Fix atomic_update() implementation for N=16 and N=32 on Gen12 #12722

Uh oh!

v-klochkov commented Feb 15, 2024 •

edited

Loading

Uh oh!

Uh oh!

[ESIMD] Fix atomic_update() implementation for N=16 and N=32 on Gen12 #12722

[ESIMD] Fix atomic_update() implementation for N=16 and N=32 on Gen12 #12722

Uh oh!

Conversation

v-klochkov commented Feb 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

v-klochkov commented Feb 15, 2024 •

edited

Loading