ENH: Array API dispatching #2096

samir-nasibli · 2024-10-08T22:45:04Z

Description

This PR enables the interface for scikit-learn-intelex estimators to dispatch Array API inputs to oneDAL. This removes the need to move data to host and back in the case of on-device data (see the need behind use_raw_input) in properly enabled estimators. This PR will enable array_api_dispatching for sklearn >=1.2 . As it is experimental in sklearn versions, the direct use of sklearn functions may fail (see Ensemble estimators' apply). Scikit-learn 1.2 does not have functional array checking with array_api_dispatch, so sklearnex/onedal array_api support with zero-copy will begin in earnest with sklearn 1.3. Enablement of the zero-copy will be for sklearn versions >=1.3 going forward.

To accomplish this, the following related changes had to be made

A new object oneDALEstimator is created in base.py which combines the capabilities of PatchableEstimator and ExtensionEstimator. All sklearnex estimators must inherit this before sklearn's BaseEstimator in the mro. This is enforced by testing in test_common.py. This is needed in order to provide a tag for signifying an Array_API supported sklearnex estimator without cluttering the inheritance of the estimators. Note: as a consequence the HTML docs support is limited to the range of sklearn versions which are also supported (>=1.4), and is the logic is rewritten to minimize maintenance. This required changes to TSNE as well.
A function/class get_tags is introduced to check for sklearn's tags and to minimize maintenance. This duck-types the capability of recent (>=1.6) changes to the tag system to older sklearn versions, simplifying the code. This is needed to check for array_api support and onedal array_api support via the tag system with minimal maintenance for older versions.
All sklearnex estimators are modified to inherit the oneDALEstimator
The code for sklearnex/_device_offload.py is refactored. This new logic changes the cumbersome passing of strings from _get_backend into a boolean flag. A second checkover of inputs was removed from that function.
The core function dispatch is extensively refactored in order to check for array_api support. The logic there is of particular note due to the increase in branching. It attempts at all times to minimize the checking or movement of data unless necessary, therefore requiring checking array_api support first (as this is data-movement free).
tests test_standard_estimator_patching and test_special_estimator_patching now reference a combined _check_estimator_patching function to reduce code duplication. test_standard_estimator_patching now includes array_api dispatching for the array-api-strict data type. Changes coming in dpnp 0.17 and dpctl 0.19 will make array_api_dispatch a necessity, requiring that for dataframe support for non-numpy, non-pandas inputs going forward. This is a significant task and must be done separately and also meaning further modification to the test suite.
Standard estimator testing with array_api_strict inputs are now done with config_context(array_api_dispatch=True) which verifies proper operation of sklearnex patched methods. This required changes to KMeans.transform, IncrementalEmpiricalCovariance.mahalanobis, IncrementalEmpiricalCovariance.score, and KMeans.fit_transform beyond the changes to the standard dispatcher.
a new test test_fallback_to_host is added due to the complexity of dispatch. This validates that the various scenarios with offloading + fallbacks place the data properly while still preserving the queue

The following bugs needed to be solved to get things working properly:

is_dpctl_device_available calls from test_memory_usage need to be a list input, not a string.
checks on transform_output used in _device_offload support_input_format and wrap_output_data need to check for default and not None, the array_api logic should only apply for non-numpy inputs, but should be able to convert numpy results. previously this logic was always skipped
support for allow_fallback_to_host is corrected for when used in conjunction with target_offload
_asarray does not work for older numpy versions, as arrays do not have the __array_namespace__ attribute. Following the guideline from the array_api standard, a check for the buffer protocol is added in line to match: https://github.com/data-apis/array-api-strict/blob/25cc3d7ff0069b222d228d380e6d95bbf9a5dbcf/array_api_strict/_creation_functions.py#L50
test_run_to_run_stability includes a branch for .dtype-having objects which will then flatten, as array_api_strict arrays are very picky with the use of __iter__. This will simplify the analysis of 2d-array types, which have otherwise split them unnecessarily into 1d arrays for individual analysis.
Add missing doclink support for ElasticNet and Lasso

Additional cleanup changes

use of the import of inspect for just inspect.signature in test_patching is removed
An unnecessary base class is removed from LogisticRegression, as cleanup to the inheritance is undertaken in this PR.
creation of get_tags in sklearnex.utils makes the function get_array_api_support_tag unnecessary in sklearnex/_device_offload and is removed. It also attempts to match sklearn public API calls for simplicity in development.
Inheritance in BaseSVM of BaseEstimator is removed, as this is inherited from the various sklearn SVM algorithms in inheritance
The logic for use_raw_input in dispatch is inverted and placed at the beginning for simplicity in analysis at no cost in computational time.

As none of the onedal/ estimators are array_api enabled yet, the code is not fully tested, and additional PRs will be necessary to properly test and verify array_api support. These are not simple PRs and are to be included later before any estimator becomes array_api enabled. The initial testing should be sufficient for simple array_api support verification, and prevent non-functional changes for array_api support from entering the codebase.

Follow up work must be made for support extracting queues from array_api inputs which comply with the new sycl_queue management system. As of this PR, it will not properly understand array_api devices. This will modify a separate file and must be a follow-on PR.

It is likely that this PR needs to be separated based on individual themes contained within that were necessary to enable the dispatching. Help from the reviewers is greatly desired (and can be based solely on the description).

NOTE: The expansion of testing for array_api to use config_context(array_api_dispatch=True) has uncovered some bugs which were heretofore unnoticed. 1st the array_api dispatching of IncrementalEmpiricalCovariance.score for sklearn <1.3 is not possible, and has been deselected. 2nd PCA.score_sample and PCA.score do not currently work as the return values from PCA.fit (._precision and ._mean) need to be the same type as the input X values. As of now, the return values and estimator attributes are not properly checked for type matching, and conversions to non-numpy types from oneDAL is inconsistently implemented. This is a significant undertaking and must be done as a follow-up with the individual array_api support in oneDAL while maintaining quality code. This will also require the completion of __dlpack__ support to oneDAL table types, such that from_table can return all array_api types.

Checklist to comply with before moving PR from draft:

PR completeness and readability

I have reviewed my changes thoroughly before submitting this pull request.
I have commented my code, particularly in hard-to-understand areas.
I have updated the documentation to reflect the changes or created a separate PR with update and provided its number in the description, if necessary.
Git commit message contains an appropriate signed-off-by string (see CONTRIBUTING.md for details).
I have added a respective label(s) to PR if I have a permission for that.
I have resolved any merge conflicts that might occur with the base branch.

Testing

I have run it locally and tested the changes extensively.
All CI jobs are green or I have provided justification why they aren't.
I have extended testing suite if new functionality was introduced in this PR.

Performance

I have measured performance for affected algorithms using scikit-learn_bench and provided at least summary table with measured data, if performance change is expected.
I have provided justification why performance has changed or why changes are not expected.
I have provided justification why quality metrics have changed or why changes are not expected.
I have extended benchmarking suite and provided corresponding scikit-learn_bench PR if new measurable functionality was introduced in this PR.

added array-api-compat to test env

mergify · 2024-10-08T22:45:43Z

⚠️ The sha of the head commit of this PR conflicts with #2079. Mergify cannot evaluate rules on this PR. ⚠️

samir-nasibli · 2024-10-08T23:18:19Z

sklearnex/utils/_array_api.py

+        # Stricter NumPy-based Array API implementation. The
+        # array_api_strict.Array instances always have a dummy "device" attribute.
+        "array_api_strict",
+        "dpctl.tensor",


Should be reverted, since stock scikit-learn testing not well designed for having input data being located on single target device.

samir-nasibli · 2024-10-08T23:18:36Z

sklearnex/utils/_array_api.py

+        yield array_namespace
+
+
+def yield_namespace_device_dtype_combinations(include_numpy_namespaces=True):


just for testing were added. Will be removed due to https://github.com/intel/scikit-learn-intelex/pull/2096/files#r1792608548

added versioning for the get_nnamespace

…m/samir-nasibli/scikit-learn-intelex into enh/array_api_dispatching_testing

sklearnex/utils/_array_api.py

sklearnex/utils/tests/test_array_api.py

Co-authored-by: david-cortes-intel <david.cortes@intel.com>

icfaust · 2025-04-17T08:58:37Z

/intelci: run

sklearnex/base.py

Co-authored-by: david-cortes-intel <david.cortes@intel.com>

icfaust · 2025-04-17T13:17:28Z

/intelci: run

sklearnex/_utils.py

sklearnex/svm/_common.py

ahuber21

@icfaust Just a few comments. Please ping me if you need explicit approval.

onedal/_device_offload.py

ahuber21 · 2025-04-17T13:12:45Z

sklearnex/base.py

+        onedal_array_api: bool = False
+
+
+class oneDALEstimator:


Great idea!

ahuber21 · 2025-04-17T13:15:43Z

sklearnex/tests/test_memory_usage.py

@@ -296,7 +296,7 @@ def test_memory_leaks(estimator, dataframe, queue, order, data_shape):


 @pytest.mark.skipif(
-    os.getenv("ZES_ENABLE_SYSMAN") is None or not is_dpctl_device_available("gpu"),
+    os.getenv("ZES_ENABLE_SYSMAN") is None or not is_dpctl_device_available(["gpu"]),


Oh what a fail, glad you caught this.
-> #2438

sklearnex/_device_offload.py

Co-authored-by: Andreas Huber <9201869+ahuber21@users.noreply.github.com> Co-authored-by: david-cortes-intel <david.cortes@intel.com>

icfaust · 2025-04-17T13:37:47Z

/intelci: run

icfaust · 2025-04-17T13:53:35Z

http://intel-ci.intel.com/f01b91e9-a110-f123-b04f-a4bf010d0e2d

ethanglaser · 2025-04-17T15:21:25Z

/intelci: run

icfaust · 2025-04-17T15:36:32Z

NOTE TO REVIEWERS: I have patched in a CI change in order to get CIs to run, #2440 must be brought in first, and the changes to the CI .yml must be removed

.ci/pipeline/build-and-test-lnx.yml

ethanglaser · 2025-04-17T16:44:38Z

onedal/utils/_array_api.py

 def _asarray(data, xp, *args, **kwargs):
    """Converted input object to array format of xp namespace provided."""
-    if hasattr(data, "__array_namespace__"):
+    if hasattr(data, "__array_namespace__") or _supports_buffer_protocol(data):


should it be an or or an and?

Needs to be an or in this case (meaning it will work for numpy or other array api formats)

ethanglaser · 2025-04-17T17:32:24Z

CI is now green and no objections from my side

david-cortes-intel · 2025-04-18T10:44:35Z

/intelci: run

samir-nasibli added 11 commits October 2, 2024 03:20

ENH: array api dispatching

8bedde1

added array-api-compat to test env

Deselect some scikit-learn Array API tests

b11fcf3

Merge branch 'intel:main' into enh/array_api_dispatching

467634a

Merge branch 'intel:main' into enh/array_api_dispatching

31030f7

deselect more tests

943796e

deselect more tests

ef42daa

disabled tests for

3bc755d

fix the deselection comment

76f1876

disabled test for Ridge regression

ce0b8e1

Disabled tests and added comment

404e8c0

ENH: Array API dispatching

ced43bf

samir-nasibli mentioned this pull request Oct 8, 2024

ENH: DBSCAN via Array API #2014

Closed

10 tasks

samir-nasibli commented Oct 8, 2024

View reviewed changes

samir-nasibli added 9 commits October 9, 2024 09:19

Merge branch 'intel:main' into enh/array_api_dispatching_testing

968365f

Revert adding dpctl into Array PI conformance testing

c395d03

added versioning for the get_nnamespace

Merge branch 'enh/array_api_dispatching_testing' of https://github.co…

9271479

…m/samir-nasibli/scikit-learn-intelex into enh/array_api_dispatching_testing

minor refactoring onedal _array_api

5784c25

add tests

8d7f664

addressed memory usage tests

63d8f30

Address some array api test fails

6bd0280

linting

90411e7

addressed test_get_namespace

2b7bbc5

samir-nasibli commented Oct 9, 2024

View reviewed changes

sklearnex/utils/_array_api.py Outdated Show resolved Hide resolved

samir-nasibli commented Oct 9, 2024

View reviewed changes

sklearnex/utils/tests/test_array_api.py Outdated Show resolved Hide resolved

samir-nasibli added 4 commits October 9, 2024 14:45

adding test case for validate_data check with Array API inputs

b7b8f03

minor refactoring

169009d

addressed test_patch_map_match fail

9ca118c

Added docstrings for get_namespace

7ddcf40

icfaust and others added 5 commits April 16, 2025 15:19

Apply suggestions from code review

b655f13

Co-authored-by: david-cortes-intel <david.cortes@intel.com>

make suggested changes

fa2777b

fix import in test_patching

1ab6cdf

Update test_patching.py

9dd23a8

Update build-and-test-lnx.yml

cc7e100

icfaust requested review from napetrov and homksei as code owners April 17, 2025 05:41

icfaust added 2 commits April 17, 2025 08:22

Update build-and-test-lnx.yml

f97fdfe

Update build-and-test-lnx.yml

65e8823

david-cortes-intel reviewed Apr 17, 2025

View reviewed changes

sklearnex/base.py Outdated Show resolved Hide resolved

sklearnex/base.py Outdated Show resolved Hide resolved

Apply suggestions from code review

b387106

Co-authored-by: david-cortes-intel <david.cortes@intel.com>

david-cortes-intel reviewed Apr 17, 2025

View reviewed changes

sklearnex/_utils.py Outdated Show resolved Hide resolved

sklearnex/svm/_common.py Outdated Show resolved Hide resolved

ahuber21 reviewed Apr 17, 2025

View reviewed changes

icfaust reviewed Apr 17, 2025

View reviewed changes

sklearnex/_device_offload.py Outdated Show resolved Hide resolved

icfaust and others added 2 commits April 17, 2025 15:24

Apply suggestions from code review

42141eb

Co-authored-by: Andreas Huber <9201869+ahuber21@users.noreply.github.com> Co-authored-by: david-cortes-intel <david.cortes@intel.com>

formatting and ABC change

131625f

fix gpu test

aee46bc

ethanglaser reviewed Apr 17, 2025

View reviewed changes

david-cortes-intel added 2 commits April 18, 2025 12:43

Merge branch 'main' into enh/array_api_dispatching_testing

fbdd745

remove workaround

f364637

david-cortes-intel approved these changes Apr 18, 2025

View reviewed changes

icfaust merged commit 43596ee into uxlfoundation:main Apr 22, 2025
28 checks passed

		yield array_namespace


		def yield_namespace_device_dtype_combinations(include_numpy_namespaces=True):

ENH: Array API dispatching #2096

ENH: Array API dispatching #2096

Uh oh!

Conversation

samir-nasibli commented Oct 8, 2024 • edited by icfaust Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

mergify bot commented Oct 8, 2024

Uh oh!

samir-nasibli Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

samir-nasibli Oct 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

icfaust commented Apr 17, 2025

Uh oh!

Uh oh!

Uh oh!

icfaust commented Apr 17, 2025

Uh oh!

Uh oh!

Uh oh!

ahuber21 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ahuber21 Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

ahuber21 Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

icfaust commented Apr 17, 2025

Uh oh!

icfaust commented Apr 17, 2025

Uh oh!

ethanglaser commented Apr 17, 2025

Uh oh!

icfaust commented Apr 17, 2025

Uh oh!

Uh oh!

ethanglaser Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

icfaust Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ethanglaser commented Apr 17, 2025

Uh oh!

david-cortes-intel commented Apr 18, 2025

Uh oh!

Uh oh!

Uh oh!

samir-nasibli commented Oct 8, 2024 •

edited by icfaust

Loading

samir-nasibli Oct 8, 2024 •

edited

Loading

ahuber21 left a comment •

edited

Loading

icfaust Apr 17, 2025 •

edited

Loading