Skip to content

Commit 7724540

Browse files
jhuber6GeorgeARM
authored andcommitted
[Clang] Fix GPU match any truncating 64-bit lane mask
Summary: This is a lane mask, needs all 64 bits for those wave64 targets. At some point we should introduce __lanemask_t for this.
1 parent 2be7897 commit 7724540

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

clang/lib/Headers/gpuintrin.h

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -261,7 +261,7 @@ __DO_LANE_SUM(double, f64); // double __gpu_lane_sum_f64(m, x)
261261
// Returns a bitmask marking all lanes that have the same value of __x.
262262
_DEFAULT_FN_ATTRS static __inline__ uint64_t
263263
__gpu_match_any_u32_impl(uint64_t __lane_mask, uint32_t __x) {
264-
uint32_t __match_mask = 0;
264+
uint64_t __match_mask = 0;
265265

266266
bool __done = 0;
267267
while (__gpu_ballot(__lane_mask, !__done)) {

0 commit comments

Comments
 (0)