release/19.x: [MachineLICM] Don't allow hoisting invariant loads across mem barrier. (#116987) #117154

llvmbot · 2024-11-21T12:59:06Z

Backport a9b3ec1 ef102b4

Requested by: @fhahn

llvmbot · 2024-11-21T12:59:15Z

@david-arm What do you think about merging this PR to the release branch?

llvmbot · 2024-11-21T12:59:42Z

@llvm/pr-subscribers-backend-aarch64

Author: None (llvmbot)

Changes

Backport a9b3ec1 ef102b4

Requested by: @fhahn

Full diff: https://github.com/llvm/llvm-project/pull/117154.diff

3 Files Affected:

(modified) llvm/lib/CodeGen/MachineLICM.cpp (+1-1)
(modified) llvm/test/CodeGen/AArch64/machine-licm-hoist-load.ll (+29)
(modified) llvm/test/CodeGen/Mips/lcb5.ll (+2-2)

diff --git a/llvm/lib/CodeGen/MachineLICM.cpp b/llvm/lib/CodeGen/MachineLICM.cpp
index f24ab187ef4005..21a02a6f094784 100644
--- a/llvm/lib/CodeGen/MachineLICM.cpp
+++ b/llvm/lib/CodeGen/MachineLICM.cpp
@@ -1474,7 +1474,7 @@ void MachineLICMBase::InitializeLoadsHoistableLoops() {
       if (!AllowedToHoistLoads[Loop])
         continue;
       for (auto &MI : *MBB) {
-        if (!MI.mayStore() && !MI.isCall() &&
+        if (!MI.isLoadFoldBarrier() && !MI.mayStore() && !MI.isCall() &&
             !(MI.mayLoad() && MI.hasOrderedMemoryRef()))
           continue;
         for (MachineLoop *L = Loop; L != nullptr; L = L->getParentLoop())
diff --git a/llvm/test/CodeGen/AArch64/machine-licm-hoist-load.ll b/llvm/test/CodeGen/AArch64/machine-licm-hoist-load.ll
index e8dafd5e8fbabe..17f8263560430d 100644
--- a/llvm/test/CodeGen/AArch64/machine-licm-hoist-load.ll
+++ b/llvm/test/CodeGen/AArch64/machine-licm-hoist-load.ll
@@ -497,6 +497,35 @@ for.exit:                                 ; preds = %for.body
   ret i64 %spec.select
 }
 
+@a = external local_unnamed_addr global i32, align 4
+
+; Make sure the load is not hoisted out of the loop across memory barriers.
+define i32 @load_between_memory_barriers() {
+; CHECK-LABEL: load_between_memory_barriers:
+; CHECK:       // %bb.0:
+; CHECK-NEXT:    adrp x8, :got:a
+; CHECK-NEXT:    ldr x8, [x8, :got_lo12:a]
+; CHECK-NEXT:  .LBB8_1: // %loop
+; CHECK-NEXT:    // =>This Inner Loop Header: Depth=1
+; CHECK-NEXT:    //MEMBARRIER
+; CHECK-NEXT:    ldr w0, [x8]
+; CHECK-NEXT:    //MEMBARRIER
+; CHECK-NEXT:    cbz w0, .LBB8_1
+; CHECK-NEXT:  // %bb.2: // %exit
+; CHECK-NEXT:    ret
+  br label %loop
+
+loop:
+  fence syncscope("singlethread") acq_rel
+  %l = load i32, ptr @a, align 4
+  fence syncscope("singlethread") acq_rel
+  %c = icmp eq i32 %l, 0
+  br i1 %c, label %loop, label %exit
+
+exit:
+  ret i32 %l
+}
+
 declare i32 @bcmp(ptr, ptr, i64)
 declare i32 @memcmp(ptr, ptr, i64)
 declare void @func()
diff --git a/llvm/test/CodeGen/Mips/lcb5.ll b/llvm/test/CodeGen/Mips/lcb5.ll
index f320f6fc5660ce..bb059f1ee8453e 100644
--- a/llvm/test/CodeGen/Mips/lcb5.ll
+++ b/llvm/test/CodeGen/Mips/lcb5.ll
@@ -186,7 +186,7 @@ if.end:                                           ; preds = %if.then, %entry
 }
 
 ; ci:	.ent	z3
-; ci:	bteqz	$BB6_3
+; ci:	bteqz	$BB6_2
 ; ci:	.end	z3
 
 ; Function Attrs: nounwind optsize
@@ -210,7 +210,7 @@ if.end:                                           ; preds = %if.then, %entry
 
 ; ci:	.ent	z4
 ; ci:	btnez	$BB7_1  # 16 bit inst
-; ci:	jal	$BB7_3	# branch
+; ci:	jal	$BB7_2	# branch
 ; ci:	nop
 ; ci: $BB7_1:
 ; ci:	.p2align	2

tru · 2024-11-25T08:28:10Z

@david-arm Should this be merged?

david-arm · 2024-11-25T08:57:36Z

@david-arm Should this be merged?

Hi yes I think it should be merged. It's a fairly serious bug fix.

(cherry picked from commit a9b3ec1)

llvm#116987) The improvements in 63917e1 / llvm#70796 do not check for memory barriers/unmodelled sideeffects, which means we may incorrectly hoist loads across memory barriers. Fix this by checking any machine instruction in the loop is a load-fold barrier. PR: llvm#116987 (cherry picked from commit ef102b4)

github-actions · 2024-11-25T10:17:19Z

@fhahn (or anyone else). If you would like to add a note about this fix in the release notes (completely optional). Please reply to this comment with a one or two sentence description of the fix. When you are done, please add the release:note label to this PR.

llvmbot added this to the LLVM 19.X Release milestone Nov 21, 2024

llvmbot requested a review from david-arm November 21, 2024 12:59

llvmbot mentioned this pull request Nov 21, 2024

[MachineLICM] Don't allow hoisting invariant loads across mem barrier. #116987

Merged

llvmbot added the backend:AArch64 label Nov 21, 2024

tru force-pushed the issue116987 branch from 4ed7f75 to 086d8e6 Compare November 25, 2024 10:16

fhahn added 2 commits November 25, 2024 11:16

[MachineLICM] Add test case showing load hoisted across memory barrier.

32cbe24

(cherry picked from commit a9b3ec1)

tru merged commit 086d8e6 into llvm:release/19.x Nov 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

release/19.x: [MachineLICM] Don't allow hoisting invariant loads across mem barrier. (#116987) #117154

release/19.x: [MachineLICM] Don't allow hoisting invariant loads across mem barrier. (#116987) #117154

Uh oh!

llvmbot commented Nov 21, 2024

Uh oh!

llvmbot commented Nov 21, 2024

Uh oh!

llvmbot commented Nov 21, 2024

Uh oh!

tru commented Nov 25, 2024

Uh oh!

david-arm commented Nov 25, 2024

Uh oh!

github-actions bot commented Nov 25, 2024

Uh oh!

Uh oh!

release/19.x: [MachineLICM] Don't allow hoisting invariant loads across mem barrier. (#116987) #117154

release/19.x: [MachineLICM] Don't allow hoisting invariant loads across mem barrier. (#116987) #117154

Uh oh!

Conversation

llvmbot commented Nov 21, 2024

Uh oh!

llvmbot commented Nov 21, 2024

Uh oh!

llvmbot commented Nov 21, 2024

Uh oh!

tru commented Nov 25, 2024

Uh oh!

david-arm commented Nov 25, 2024

Uh oh!

github-actions bot commented Nov 25, 2024

Uh oh!

Uh oh!