You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[SYCL][CUDA] Improve group scan/reduce algorithms performance (#2902)
Proposed changes should improve general performance of group inclusive/exclusive scans and reduces for NVidia backend.
Main idea is to replace local memory usage/barriers with some more computations.
0 commit comments