Skip to content

Commit c3db575

Browse files
committed
Fix doc failure
1 parent f38ba43 commit c3db575

File tree

2 files changed

+3
-3
lines changed

2 files changed

+3
-3
lines changed

llvm/docs/NVPTXUsage.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -150,7 +150,7 @@ Example: 64-bit PTX for CUDA Driver API: ``nvptx64-nvidia-cuda``
150150
.. _nvptx_arch_hierarchy:
151151

152152
NVPTX Architecture Hierarchy and Ordering
153-
========================================
153+
=========================================
154154

155155
GPU architectures: sm_2Y/sm_3Y/sm_5Y/sm_6Y/sm_7Y/sm_8Y/sm_9Y/sm_10Y/sm_12Y
156156
('Y' represents version within the architecture)

llvm/lib/Target/NVPTX/NVPTXSubtarget.h

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -120,9 +120,9 @@ class NVPTXSubtarget : public NVPTXGenSubtargetInfo {
120120
// TMA G2S copy with cta_group::1/2 support
121121
bool hasCpAsyncBulkTensorCTAGroupSupport() const {
122122
// TODO: Update/tidy-up after the family-conditional support arrives
123-
return ((FullSmVersion == 1001 || FullSmVersion == 1011) &&
123+
return ((FullSmVersion == 10011 || FullSmVersion == 10111) &&
124124
PTXVersion >= 86) ||
125-
(FullSmVersion == 1031 && PTXVersion >= 88);
125+
(FullSmVersion == 10311 && PTXVersion >= 88);
126126
}
127127

128128
// Prior to CUDA 12.3 ptxas did not recognize that the trap instruction

0 commit comments

Comments
 (0)