Refactor/kernel interfaces #804

diptorupd · 2022-10-18T03:59:01Z

diptorupd · 2022-11-09T03:23:38Z

@mingjie-intel @chudur-budur All existing tests except caching work with the new API. I have updated the description to capture pending TODOs. Any early feedback will be very helpful.

numba_dpex/core/kernel_interface/spirv_kernel.py

chudur-budur · 2022-11-10T00:10:59Z

numba_dpex/core/kernel_interface/spirv_kernel.py

+            extra_compile_flags=extra_compile_flags,
+        )
+
+        self._target_context = cres.target_context


@diptorupd Why we are not doing the same as it has been done in _compile()?

numba_dpex/compiler.py

numba_dpex/core/_compile_helper.py

chudur-budur · 2022-12-07T18:08:16Z

numba_dpex/core/kernel_interface/arg_pack_unpacker.py

+                np.copyto(obj._orig_val, obj._packed_val)
+
+    def __init__(
+        self, kernel_name, arg_list, argty_list, access_specifiers_list, queue


I think we need to use consistent variable naming, like pyfunc_name instead of kernel_name

numba_dpex/core/kernel_interface/dispatcher.py

chudur-budur · 2022-12-07T18:13:45Z

numba_dpex/core/kernel_interface/dispatcher.py

+        compile_flags=None,
+        array_access_specifiers=None,
+    ):
+        self.typingctx = dpex_target.typing_context


dpex_target is a global object coming from an external module, why do we always keep it in some local variable? Is there any specific reason for this?

chudur-budur · 2022-12-07T18:19:25Z

numba_dpex/core/kernel_interface/func.py

+    )
+    func = cres.library.get_function(cres.fndesc.llvm_func_name)
+    cres.target_context.mark_ocl_device(func)
+    devfn = DpexFunction(cres)


Do we need to cache compiled func?

numba_dpex/core/kernel_interface/func.py

numba_dpex/decorators.py

github-actions · 2022-12-16T02:39:07Z

Documentation preview: show.

numba_dpex/tests/kernel_tests/test_atomic_op.py

- The compute follows data checking is now based on queue equality. - USMNdArray no longer requires usm_type and device during construction. It allows us to specialize an usm_ndarray only on ndims, layout and dtype. - No check for compute follows data for eager compilation. - Change caching to not require backend and device-type. - Fixes to test cases.

- The DEFAULT_LOCAL_SIZE is deprecated and users warned to provided a valid local range for nd_range kernels. - Removed the global_range and local_range kw args from JitKernel.__call__(). - Undeprecate the JitKernel.__getitem__ call. - Fix and improve how arguments to JitKernel.__call__() are parsed to extract the global_range and local_range.

github-actions · 2023-01-18T22:11:29Z

Documentation preview: show.

diptorupd · 2023-01-18T23:41:39Z

Merging as TeamCity CI is all green.

github-actions · 2023-01-18T23:42:25Z

Documentation preview removed.

Refactor/kernel interfaces 187782d

I don't have a minimal reproducer yet but I can say that this fixes the issues I've had after IntelPython#804

diptorupd requested a review from mingjie-intel as a code owner October 18, 2022 03:59

diptorupd marked this pull request as draft October 18, 2022 03:59

diptorupd force-pushed the refactor/kernel_interfaces branch from 736bff3 to 6b179af Compare October 18, 2022 04:44

diptorupd added this to the 0.19.0 milestone Oct 18, 2022

diptorupd force-pushed the refactor/kernel_interfaces branch from 6b179af to d426eb2 Compare October 18, 2022 19:45

diptorupd mentioned this pull request Oct 20, 2022

State of interoperability with cuda backend IntelPython/dpctl#947

Open

diptorupd force-pushed the refactor/kernel_interfaces branch 4 times, most recently from f489b60 to 921624e Compare October 23, 2022 18:39

diptorupd mentioned this pull request Oct 23, 2022

Refactor of the kernel dispatch API #810

Closed

12 tasks

diptorupd force-pushed the refactor/kernel_interfaces branch from 2d98ded to b2ed65d Compare October 26, 2022 04:38

diptorupd force-pushed the refactor/kernel_interfaces branch 3 times, most recently from f0e4ed0 to 211d5d8 Compare November 9, 2022 02:03

diptorupd requested a review from chudur-budur November 9, 2022 03:28

chudur-budur reviewed Nov 10, 2022

View reviewed changes

diptorupd force-pushed the refactor/kernel_interfaces branch 3 times, most recently from f99df29 to b43201c Compare November 22, 2022 15:45

diptorupd force-pushed the refactor/kernel_interfaces branch 4 times, most recently from 27490c8 to 17676d2 Compare December 7, 2022 17:10

chudur-budur reviewed Dec 7, 2022

View reviewed changes

diptorupd force-pushed the refactor/kernel_interfaces branch 2 times, most recently from 26c4334 to 4656193 Compare December 12, 2022 20:14

mingjie-intel reviewed Dec 16, 2022

View reviewed changes

numba_dpex/tests/kernel_tests/test_atomic_op.py Show resolved Hide resolved

Diptorup Deb and others added 16 commits January 18, 2023 16:04

Use new compiler to compiler parfors.

876d4d2

Fully remove numba_dpex.compiler module.

da1440f

Remove the temporary driver.py file.

1d3a529

Added ndarray setup check.

020da9e

Added tests for ndrange exceptions.

12fa8a3

Add an example for aot kernel specialization.

f341ab2

Fix deprecation warnings.

d275e1d

Add docsstrings.

8150a96

Switched to dpctl.tensor in test_ndrange_exceptions.py.

6ef8212

Rename AOT to eager compilation.

42958f4

Formatting changes to error message.

20a8f7d

Update tests after changes to kernel lauch params.

75f269b

Update kernel examples based on latest changes.

664d57e

Improve dispatcher checks for laych args and add unit test.

c74ea24

diptorupd force-pushed the refactor/kernel_interfaces branch from 637a04d to c74ea24 Compare January 18, 2023 22:04

diptorupd merged commit 187782d into main Jan 18, 2023

diptorupd deleted the refactor/kernel_interfaces branch January 18, 2023 23:41

github-actions bot added a commit that referenced this pull request Jan 18, 2023

Merge pull request #804 from IntelPython/refactor/kernel_interfaces

db50327

Refactor/kernel interfaces 187782d

diptorupd mentioned this pull request Jan 19, 2023

Pass dpnp arrays directly to kernel instead of using dpctl.asarray. #878

Closed

5 tasks

This was referenced Jan 29, 2023

Perormance regressions introduced by latest changes to main #886

Closed

Sum up of dpex.kernel JIT issues #891

Closed

fcharras added a commit to fcharras/numba-dpex that referenced this pull request Feb 3, 2023

Fixing JIT issues after IntelPython#804

ab013af

I don't have a minimal reproducer yet but I can say that this fixes the issues I've had after IntelPython#804

This was referenced Feb 3, 2023

Fixing performance issues after #804 #897

Closed

Minimal reproducer for regression in #804 #898

Closed

fcharras mentioned this pull request Feb 13, 2023

Solving perferomance regression issue by caching the kernel_bundle #896

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor/kernel interfaces #804

Refactor/kernel interfaces #804

diptorupd commented Oct 18, 2022 •

edited

Loading

diptorupd commented Nov 9, 2022

chudur-budur Nov 10, 2022

chudur-budur Dec 7, 2022

chudur-budur Dec 7, 2022

chudur-budur Dec 7, 2022

github-actions bot commented Dec 16, 2022

github-actions bot commented Jan 18, 2023

diptorupd commented Jan 18, 2023

github-actions bot commented Jan 18, 2023

Refactor/kernel interfaces #804

Refactor/kernel interfaces #804

Conversation

diptorupd commented Oct 18, 2022 • edited Loading

diptorupd commented Nov 9, 2022

chudur-budur Nov 10, 2022

Choose a reason for hiding this comment

chudur-budur Dec 7, 2022

Choose a reason for hiding this comment

chudur-budur Dec 7, 2022

Choose a reason for hiding this comment

chudur-budur Dec 7, 2022

Choose a reason for hiding this comment

github-actions bot commented Dec 16, 2022

github-actions bot commented Jan 18, 2023

diptorupd commented Jan 18, 2023

github-actions bot commented Jan 18, 2023

diptorupd commented Oct 18, 2022 •

edited

Loading