-
Notifications
You must be signed in to change notification settings - Fork 787
[SYCL][E2E] Conditionally use -fsycl-embed-ir
flags in KernelFusion
e2e tests
#14249
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
@intel/dpcpp-kernel-fusion-reviewers, this change makes me wonder: why do we report a warning in the first place? If I get it right, users have to add @mdtoguchi, FYI. |
Hey, too add on to this: I did some limited testing on a machine with a CUDA device, and the tests seem to pass with or without the compile flag. I was unsure whether I should have removed it, but I went through with making it optional instead because of this mention in the docs. |
I hope @intel/dpcpp-kernel-fusion-reviewers team can say which tests require this option and where we can drop it. |
Embedding the IR by default would have a notable impact on binary size, which is particularly relevant for libraries. As the kernel fusion feature is only an experimental feature, we therefore decided against embedding by default. Generally, I would assume that the tests that specify the flag also need it, but we will investigate the details. An alternative approach to this PR would be to make sure that the warning isn't printed in the clang frontend. This would not only result in the tests not printing it anymore, but would fix the warning for all users. |
I investigated this, on my machine, 25 tests failed as intended for the CUDA backend when the embed IR flag was not correctly defined. However, I used the opportunity to improve some of the tests and drop the flag from some tests that don't require it: #14366 |
-fsycl-embed-ir
flag is only used when building theKernelFusion
tests if there is a cuda or hip device. This removes the "argument unused during compilation" warning when running this test on other platforms.