You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[ROCM] Navi21 Enablement 9: Range and Multinomial Kernels (pytorch#73550)
Summary:
This PR is a follow up to the following prs.
pytorch#69942pytorch#72682pytorch#72809pytorch#73543pytorch#73545pytorch#73546pytorch#73548pytorch#73549
We are adding support to Navi21 GPUs which have a warpsize of 32. We cannot rely on a constant so we have to dynamically look up the warpsize when launching the kernel on the host side. Inside device functions this is not needed and the compiler can correctly detect the correct warpsize to replace the C10_WARP_SIZE constant.
Pull Request resolved: pytorch#73550
Reviewed By: malfet
Differential Revision: D35444958
Pulled By: ngimel
fbshipit-source-id: c65f06d3227c23bb097a71fc6c86e3f884114e04
(cherry picked from commit 7f3ba52)
0 commit comments