mlir/lib/Dialect/GPU/Transforms: improve context management in SerializeToCubin #65779

rohany · 2023-09-08T17:03:37Z

This commit adjusts the CUDA context management in the SerializeToCubin pass. In particular, it uses the device 0 primary context instead of creating a new CUDA context on each invocation of SerializeToCubin. This yields very large improvements in compile time, especially if an application (like a JIT compiler) is calling SerializeToCubin repeatedly.

Differential Revision: https://reviews.llvm.org/D159487

fabianmcg

The patch itself LGTM, however, we are on the process of deprecating SerializeToCubin in favor of Target Attributes. I'm introducing today deprecation notices. So I'm -1 on improving the existing passes, in my opinion all new efforts should focus on the new mechanism. However, don't know if @joker-eph or someone else has a different opinion.

joker-eph · 2023-09-08T23:22:56Z

The patch is small enough that it seems worthwhile to take in, I would just want to make sure we don't diverge from the lowering done through the new flow: do we need to replicate this somewhere as well @fabianmcg ?

fabianmcg · 2023-09-08T23:35:53Z

Currently no, as we don't invoke the driver. However, I was thinking on adding a compilation path to stop at PTX and let the driver JIT the code at runtime, I only need to do some small updates, so maybe then.

joker-eph

@rohany LG but please acknowledge that this pass is on it's way of deprecation.

rohany · 2023-09-09T17:34:22Z

LG but please acknowledge that this pass is on it's way of deprecation.

That's fine -- i'm currently using it (as part of the reference pipeline), so I am incentivized to make it faster.

I don't understand the CI failure, can I get some help with that?

joker-eph · 2023-09-09T21:00:10Z

It's an infra failure, feel free to ignore

joker-eph · 2023-09-09T21:01:34Z

Actually your PR is not rebased, seems like you're based on a commit from May!

…izeToCubin This commit adjusts the CUDA context management in the SerializeToCubin pass. In particular, it uses the device 0 primary context instead of creating a new CUDA context on each invocation of SerializeToCubin. This yields very large improvements in compile time, especially if an application (like a JIT compiler) is calling SerializeToCubin repeatedly. Differential Revision: https://reviews.llvm.org/D159487

rohany · 2023-09-09T21:29:46Z

Thanks, fixed it.

xgupta · 2023-10-20T17:20:28Z

@rohany Do you need help to commit this change?

rohany · 2023-10-20T17:23:11Z

Yes, i don't know how to get it to land, given that tests pass + accepted review.

Local branch amd-gfx 319c66a Merged main:080fb3e5b73b into amd-gfx:7c4daea7af99 Remote branch main 71bdd2c mlir/lib/Dialect/GPU/Transforms: improve context management in SerializeToCubin (llvm#65779)

rohany requested a review from a team as a code owner September 8, 2023 17:03

joker-eph requested a review from fabianmcg September 8, 2023 18:18

fabianmcg reviewed Sep 8, 2023

View reviewed changes

joker-eph approved these changes Sep 9, 2023

View reviewed changes

rohany force-pushed the serialize-cubin-context-management branch from 91bcd17 to fb0a003 Compare September 9, 2023 21:04

llvmbot added mlir:core MLIR Core Infrastructure mlir:gpu mlir labels Sep 9, 2023

rohany force-pushed the serialize-cubin-context-management branch from fb0a003 to 5e1a41b Compare September 9, 2023 21:07

xgupta merged commit 71bdd2c into llvm:main Oct 20, 2023

spupyrev mentioned this pull request Oct 23, 2023

[BOLT] Rename cds to cdsort #69966

Merged

banach-space mentioned this pull request Oct 24, 2023

[mlir][vector] Add scalable vectors to tests for vector.contract #70039

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

mlir/lib/Dialect/GPU/Transforms: improve context management in SerializeToCubin #65779

mlir/lib/Dialect/GPU/Transforms: improve context management in SerializeToCubin #65779

Uh oh!

rohany commented Sep 8, 2023

Uh oh!

fabianmcg left a comment

Uh oh!

joker-eph commented Sep 8, 2023

Uh oh!

fabianmcg commented Sep 8, 2023

Uh oh!

joker-eph left a comment

Uh oh!

rohany commented Sep 9, 2023

Uh oh!

joker-eph commented Sep 9, 2023

Uh oh!

joker-eph commented Sep 9, 2023

Uh oh!

rohany commented Sep 9, 2023

Uh oh!

xgupta commented Oct 20, 2023

Uh oh!

rohany commented Oct 20, 2023

Uh oh!

Uh oh!

mlir/lib/Dialect/GPU/Transforms: improve context management in SerializeToCubin #65779

mlir/lib/Dialect/GPU/Transforms: improve context management in SerializeToCubin #65779

Uh oh!

Conversation

rohany commented Sep 8, 2023

Uh oh!

fabianmcg left a comment

Choose a reason for hiding this comment

Uh oh!

joker-eph commented Sep 8, 2023

Uh oh!

fabianmcg commented Sep 8, 2023

Uh oh!

joker-eph left a comment

Choose a reason for hiding this comment

Uh oh!

rohany commented Sep 9, 2023

Uh oh!

joker-eph commented Sep 9, 2023

Uh oh!

joker-eph commented Sep 9, 2023

Uh oh!

rohany commented Sep 9, 2023

Uh oh!

xgupta commented Oct 20, 2023

Uh oh!

rohany commented Oct 20, 2023

Uh oh!

Uh oh!