Skip to content

Commit d3d7ef6

Browse files
author
Diptorup Deb
committed
Optimize the kernel module to inline the actual kernel into wrapper.
1 parent f666abe commit d3d7ef6

File tree

2 files changed

+2
-0
lines changed

2 files changed

+2
-0
lines changed

numba_dpex/core/codegen.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -35,6 +35,7 @@ def _optimize_final_module(self):
3535
pmb.opt_level = config.OPT
3636

3737
pmb.disable_unit_at_a_time = False
38+
pmb.inlining_threshold = 2
3839
pmb.disable_unroll_loops = True
3940
pmb.loop_vectorize = False
4041
pmb.slp_vectorize = False

numba_dpex/core/kernel_interface/spirv_kernel.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -136,6 +136,7 @@ def compile(
136136
kernel = cres.target_context.prepare_ocl_kernel(
137137
func, cres.signature.args
138138
)
139+
cres.library._optimize_final_module()
139140
self._llvm_module = kernel.module.__str__()
140141
self._module_name = kernel.name
141142

0 commit comments

Comments
 (0)