[flang] Enable optimizeEmptyElementals for BufferizeHLFIR pass. #124982

vzakhari · 2025-01-29T20:37:34Z

Enable the option under opt-for-speed. Elementals with shapes
like (0, HUGE) should run faster.

Enable the option under opt-for-speed. Elementals with shapes like `(0, HUGE)` should run faster.

llvmbot · 2025-01-29T20:38:09Z

@llvm/pr-subscribers-flang-fir-hlfir

Author: Slava Zakharin (vzakhari)

Changes

Enable the option under opt-for-speed. Elementals with shapes
like (0, HUGE) should run faster.

Full diff: https://github.com/llvm/llvm-project/pull/124982.diff

1 Files Affected:

(modified) flang/lib/Optimizer/Passes/Pipelines.cpp (+9-1)

diff --git a/flang/lib/Optimizer/Passes/Pipelines.cpp b/flang/lib/Optimizer/Passes/Pipelines.cpp
index 1cc3f0b81c20ad..d55ad9e603ffaf 100644
--- a/flang/lib/Optimizer/Passes/Pipelines.cpp
+++ b/flang/lib/Optimizer/Passes/Pipelines.cpp
@@ -245,7 +245,15 @@ void createHLFIRToFIRPassPipeline(mlir::PassManager &pm, bool enableOpenMP,
   }
   pm.addPass(hlfir::createLowerHLFIROrderedAssignments());
   pm.addPass(hlfir::createLowerHLFIRIntrinsics());
-  pm.addPass(hlfir::createBufferizeHLFIR());
+
+  hlfir::BufferizeHLFIROptions bufferizeOptions;
+  // For opt-for-speed, avoid running any of the loops resulting
+  // from hlfir.elemental lowering, if the result is an empty array.
+  // This helps to avoid long running loops for elementals with
+  // shapes like (0, HUGE).
+  if (optLevel.isOptimizingForSpeed())
+    bufferizeOptions.optimizeEmptyElementals = true;
+  pm.addPass(hlfir::createBufferizeHLFIR(bufferizeOptions));
   // Run hlfir.assign inlining again after BufferizeHLFIR,
   // because the latter may introduce new hlfir.assign operations,
   // e.g. for copying an array into a temporary due to

vzakhari · 2025-01-29T20:40:25Z

x86 performance run showed some fluctuations on fatigue2 and cactusADM, but they look just like noise to me: the patch is not triggered in cactusADM at all; in fatigue2 the selects are inserted in one spot affecting the instruction addresses, but otherwise the code looks the same.

jeanPerier

Thanks

tblah

No change to spec2017 on aarch64

[flang] Enable optimizeEmptyElementals for BufferizeHLFIR pass.

825a4c2

Enable the option under opt-for-speed. Elementals with shapes like `(0, HUGE)` should run faster.

vzakhari requested review from tblah and jeanPerier January 29, 2025 20:37

llvmbot added flang Flang issues not falling into any other category flang:fir-hlfir labels Jan 29, 2025

jeanPerier approved these changes Jan 30, 2025

View reviewed changes

tblah approved these changes Jan 30, 2025

View reviewed changes

vzakhari merged commit 81f5098 into llvm:main Jan 30, 2025
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[flang] Enable optimizeEmptyElementals for BufferizeHLFIR pass. #124982

[flang] Enable optimizeEmptyElementals for BufferizeHLFIR pass. #124982

Uh oh!

vzakhari commented Jan 29, 2025

Uh oh!

llvmbot commented Jan 29, 2025

Uh oh!

vzakhari commented Jan 29, 2025

Uh oh!

jeanPerier left a comment

Uh oh!

tblah left a comment

Uh oh!

Uh oh!

Uh oh!

[flang] Enable optimizeEmptyElementals for BufferizeHLFIR pass. #124982

[flang] Enable optimizeEmptyElementals for BufferizeHLFIR pass. #124982

Uh oh!

Conversation

vzakhari commented Jan 29, 2025

Uh oh!

llvmbot commented Jan 29, 2025

Uh oh!

vzakhari commented Jan 29, 2025

Uh oh!

jeanPerier left a comment

Choose a reason for hiding this comment

Uh oh!

tblah left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!