-
Notifications
You must be signed in to change notification settings - Fork 790
[SYCL] Do module cleanup in sycl-post-link even if there were no splits #10863
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
This allows for unused symbol removal that would happen if there were splits. Signed-off-by: Sarnie, Nick <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not really an expert in the modified code. Can we have ; CHECK: _ZTSZ4mainEUlT_E0_
to give me a hint, that the change won't affect kernels?
@sarnex Could you please give a clue why |
@maksimsab sure, it's because those globals are now getting optimized out by globalDCE. the point of those tests is to test something else, so i changed the linkage as to maintain the purpose of the test. note that if there were any splits (like if they used fp64), they would already be optimized out in the splits, so it's just because there happen to be no splits that they weren't optimized. @MrSidims sure, i will add this |
Signed-off-by: Sarnie, Nick <[email protected]>
@AlexeySachkov Hey, do you mind taking a look at this one to make sure it's not insane? Thanks |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks Nick. LGTM. Seems like something that was missed earlier.
One question: if we are now doing cleanup in ALL cases, can we just call the cleanup routine in one site (in sycl-post-link, right after calling getDeviceCodeSplitter)?
I will approve nevertheless. This is just something that you can address here if possible.
Thanks
@asudarsa We should be able to do that but I will do it in a follow up change if that's okay. |
@intel/llvm-gatekeepers can we merge please? the original cuda failure is not related and i tried to rerun it but it seems no runners picked it up
|
@intel/llvm-gatekeepers CUDA passed on rerun so all CI is passing now, can we merge? Thanks for rerunning the CI! |
@asudarsa It looks like even if we move the |
Ah. Ok.....Thanks for considering it Nick. It was again not a PR blocker. Thanks again for fixing this. |
This allows for unused symbol removal through GlobalDCE that would happen if there were splits. Right now we have a mismatch where split modules have unused symbols removed but copied modules (no splits) do not.