File tree Expand file tree Collapse file tree 2 files changed +3
-3
lines changed Expand file tree Collapse file tree 2 files changed +3
-3
lines changed Original file line number Diff line number Diff line change @@ -150,7 +150,7 @@ Example: 64-bit PTX for CUDA Driver API: ``nvptx64-nvidia-cuda``
150
150
.. _nvptx_arch_hierarchy :
151
151
152
152
NVPTX Architecture Hierarchy and Ordering
153
- ========================================
153
+ =========================================
154
154
155
155
GPU architectures: sm_2Y/sm_3Y/sm_5Y/sm_6Y/sm_7Y/sm_8Y/sm_9Y/sm_10Y/sm_12Y
156
156
('Y' represents version within the architecture)
Original file line number Diff line number Diff line change @@ -120,9 +120,9 @@ class NVPTXSubtarget : public NVPTXGenSubtargetInfo {
120
120
// TMA G2S copy with cta_group::1/2 support
121
121
bool hasCpAsyncBulkTensorCTAGroupSupport () const {
122
122
// TODO: Update/tidy-up after the family-conditional support arrives
123
- return ((FullSmVersion == 1001 || FullSmVersion == 1011 ) &&
123
+ return ((FullSmVersion == 10011 || FullSmVersion == 10111 ) &&
124
124
PTXVersion >= 86 ) ||
125
- (FullSmVersion == 1031 && PTXVersion >= 88 );
125
+ (FullSmVersion == 10311 && PTXVersion >= 88 );
126
126
}
127
127
128
128
// Prior to CUDA 12.3 ptxas did not recognize that the trap instruction
You can’t perform that action at this time.
0 commit comments