[LLVM][Parser] Check invalid overload suffix for intrinsics #108315

jurahul · 2024-09-12T00:59:54Z

No description provided.

nikic

Won't this break the remangling upgrade?

jurahul · 2024-09-12T11:20:00Z

Maybe. I didn't know what it is until now. I understand that in the bitcode reader context but did not know that it was supported in the asm parser as well. So currently it looks like we allow any suffix when parsing (and essentially ignore it), so old versions of intrinsics with say different types or different mangling scheme still get parsed and then get auto-upgraded (since the correct mangling can be always constructed from the existing types in the IR). And there is no check to verify that the ignored suffix was a valid suffix in one of the older versions of the intrinsic.

It seems to me this check could still be useful in some cases. For example, I see ~113 test failures in LLVM due to this and it looks like a lot of them are just typos that get glossed over by the parser. Maybe we can add a mode to enable strict-intrinsic-overload-mangle to llvm-as (and by definition to LLParser and LLVM's assembly parsing API)? The API will disable it by default, so existing users of the API (upstream or downstream) are unaffected. llvm-as will enable it by default, so LLVM lit tests can be strict. Or even the LIT testing infra can enable the strict mode for llvm-as and other tools, with a way to disable it for specific tests. That will ensure that we don't unintentionally let bad IR sneak in and stay un-flagged.

WDYT? If this seems reasonable, I can also start a discourse thread if needed.

jurahul · 2024-09-12T11:25:51Z

Additionally, if mangling can always be reconstructed, maybe LLVM should always elide it? I know that's a bigger change but may be worth considering. That means LLVM assembly syntax won't require mangling (it does not today as well due to it being ignored) but in strict mode LLVM assembly for overloaded intrinsics will have no mangling (We assert that Suffix = "" in the code above) and ASM printer will also hide the mangling. In memoy IR (i.e., Function objects) will still have mangled names (that's required I think as 2 different Global Values cannot have same name).

jurahul · 2024-09-12T11:37:18Z

In fact, changing ASM writer to drop the mangling in a call inst is easy enough and that code will continue to get parsed. We would just need the parser to allow declaring overloaded intrinsics with same name and different arg/ret types.

nikic · 2024-09-12T11:49:01Z

We no longer require mangling suffixes for intrinsic calls since #89172.

I don't think we should omit the mangling suffix when writing IR though, since that will obscure what actually happens without clear benefit. (For example, it would become very hard to actually figure out whether the correct mangling suffix was chosen or not.)

Can you please explain what the actual problem you are trying to solve is?

jurahul · 2024-09-12T12:01:00Z

The problem I ran into was literally this: I was writing a unit test using overloaded intrinsics and observed that no one complained if I messed up the mangling suffix (use .f16 for a float variant for instance), and that motivated this change.

I agree that having mangling in the IR is good. May be the --strict-intrinsic-overloads mode in various tools (llvm-as and opt might be good enough) disabled by default, auto enabled by LIT testing framework is a good middle ground to not break existing code and to not allow bad mangling suffix when not intended?

I am actually not sure what happens if in C++ someone creates a Function::Create() to create an intrinsic decl but with bad mangling suffix. I'd expect that to be strict, not sure if it is (maybe IR verifier checks this?)

It does:

 const std::string ExpectedName =
      Intrinsic::getName(ID, ArgTys, IF->getParent(), IFTy);
  Check(ExpectedName == IF->getName(),
        "Intrinsic name not mangled correctly for type arguments! "
        "Should be: " +
            ExpectedName,
        IF);

jurahul · 2024-09-12T12:09:48Z

BTW, thanks for proactively looking at the PR!

[LLVM][Parser] Check invalid overload suffix for intrinsics

ab5c255

nikic reviewed Sep 12, 2024

View reviewed changes

jurahul mentioned this pull request Sep 20, 2024

[LLVM][TableGen] Add overloaded intrinsic name conflict checks #109314

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[LLVM][Parser] Check invalid overload suffix for intrinsics #108315

[LLVM][Parser] Check invalid overload suffix for intrinsics #108315

Uh oh!

jurahul commented Sep 12, 2024

Uh oh!

nikic left a comment

Uh oh!

jurahul commented Sep 12, 2024 •

edited

Loading

Uh oh!

jurahul commented Sep 12, 2024 •

edited

Loading

Uh oh!

jurahul commented Sep 12, 2024 •

edited

Loading

Uh oh!

nikic commented Sep 12, 2024

Uh oh!

jurahul commented Sep 12, 2024 •

edited

Loading

Uh oh!

jurahul commented Sep 12, 2024

Uh oh!

Uh oh!

[LLVM][Parser] Check invalid overload suffix for intrinsics #108315

Are you sure you want to change the base?

[LLVM][Parser] Check invalid overload suffix for intrinsics #108315

Uh oh!

Conversation

jurahul commented Sep 12, 2024

Uh oh!

nikic left a comment

Choose a reason for hiding this comment

Uh oh!

jurahul commented Sep 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jurahul commented Sep 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jurahul commented Sep 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nikic commented Sep 12, 2024

Uh oh!

jurahul commented Sep 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jurahul commented Sep 12, 2024

Uh oh!

Uh oh!

jurahul commented Sep 12, 2024 •

edited

Loading

jurahul commented Sep 12, 2024 •

edited

Loading

jurahul commented Sep 12, 2024 •

edited

Loading

jurahul commented Sep 12, 2024 •

edited

Loading