[ENH] sklearn 1.6.dev0 adjustments. #335

goraj · 2024-11-11T18:25:01Z

Hi all,

New to the project but I've been loosely following @adam2392 and the project for a while now.
I setup a dev environment according to DEVELOPMENT.md and ran into a few issues due to sklearn 1.6.dev0 being installed. Namely the introduction of check_sample_weight_equivalence in scikit-learn/scikit-learn@364cafe leads to expected but not skipped test-case failures.

What does this implement/fix? Explain your changes.

Changes will skip check_sample_weight_equivalence testing for forest implementations. It also addresses some un-pickling issues encountered during testing due to joblib/loky in the structure of treeple.stats.utils.
Changes should be backward compatible.

for more information, see https://pre-commit.ci

codecov · 2024-11-12T19:18:50Z

Codecov Report

❌ Patch coverage is 80.00000% with 2 lines in your changes missing coverage. Please review.
✅ Project coverage is 80.33%. Comparing base (e1c38ad) to head (f0f2a9e).
⚠️ Report is 25 commits behind head on main.

Files with missing lines	Patch %	Lines
treeple/ensemble/_honest_forest.py	50.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #335      +/-   ##
==========================================
- Coverage   80.50%   80.33%   -0.18%     
==========================================
  Files          24       24              
  Lines        2334     2339       +5     
  Branches      339      339              
==========================================
  Hits         1879     1879              
- Misses        318      322       +4     
- Partials      137      138       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

goraj · 2024-12-09T14:27:03Z

Nothing?

adam2392

Sorry for the delay @goraj and thanks for the PR!

I left a review. So I was thinking, should we just revamp how we're using the parametrize_with_checks function. Instead of labeling sample_weight_equivalence as an ignored test per area, perhaps let's introduce a test_common.py function under treeple/tests/, and then we can do the "sklearn compatible" check there. Then, we can consolidate what tests we want to ignore in the same file.

Kind of like here:

https://github.com/scikit-learn/scikit-learn/blob/66270e46b77d6202559bae4929ec83ab320beb1e/sklearn/utils/_test_common/instance_generator.py#L771

and how expected_failed_checks kwarg of parametrize_with_checks is used to ignore tests inside https://github.com/scikit-learn/scikit-learn/blob/66270e46b77d6202559bae4929ec83ab320beb1e/sklearn/tests/test_common.py#L118

WDYT?

adam2392 · 2024-12-09T14:32:52Z

treeple/stats/utils.py

+    with parallel_config("multiprocessing"):
+        out = Parallel(n_jobs=n_jobs)(
+            delayed(_parallel_build_null_forests)(
+                y_pred_ind_arr,
+                n_estimators,
+                all_y_pred,
+                y_test,
+                seed,
+                metric,
+                **metric_kwargs,
+            )
+            for i, seed in zip(range(n_repeats), ss.spawn(n_repeats))


Why was this change made?

If I remember correctly the default loky would segfault during unit testing the *Oblique trees.

adam2392 · 2024-12-09T14:33:01Z

treeple/stats/utils.py

+    with parallel_config("multiprocessing"):
+        out = Parallel(n_jobs=n_jobs)(


Same as above.

goraj · 2024-12-17T14:58:54Z

Sorry for the delay @goraj and thanks for the PR!

I left a review. So I was thinking, should we just revamp how we're using the parametrize_with_checks function. Instead of labeling sample_weight_equivalence as an ignored test per area, perhaps let's introduce a test_common.py function under treeple/tests/, and then we can do the "sklearn compatible" check there. Then, we can consolidate what tests we want to ignore in the same file.

Kind of like here:

https://github.com/scikit-learn/scikit-learn/blob/66270e46b77d6202559bae4929ec83ab320beb1e/sklearn/utils/_test_common/instance_generator.py#L771

and how expected_failed_checks kwarg of parametrize_with_checks is used to ignore tests inside https://github.com/scikit-learn/scikit-learn/blob/66270e46b77d6202559bae4929ec83ab320beb1e/sklearn/tests/test_common.py#L118

WDYT?

Thank you.
I agree that would help quite a bit. I will update the PR accordingly, just a bit busy right now.

Jacob Gora added 2 commits November 11, 2024 18:56

sklearn 1.6.dev0 adjustments.

01e55d4

Adds sklearn<1.6 compatibility.

9686597

goraj changed the title ~~[FIX] sklearn 1.6.dev0 adjustments.~~ [WIP] sklearn 1.6.dev0 adjustments. Nov 12, 2024

Jacob Gora and others added 5 commits November 12, 2024 09:25

Additional tests covered.

0fef8d7

[pre-commit.ci] auto fixes from pre-commit.com hooks

29a34f4

for more information, see https://pre-commit.ci

Fixes OSX unpickling issue with loky/joblib.

732ca4b

Merge remote-tracking branch 'goraj/sklearn_1.6dev' into sklearn_1.6dev

8ce748a

[pre-commit.ci] auto fixes from pre-commit.com hooks

f0f2a9e

for more information, see https://pre-commit.ci

goraj changed the title ~~[WIP] sklearn 1.6.dev0 adjustments.~~ [ENH] sklearn 1.6.dev0 adjustments. Nov 12, 2024

adam2392 requested changes Dec 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[ENH] sklearn 1.6.dev0 adjustments. #335

[ENH] sklearn 1.6.dev0 adjustments. #335

Uh oh!

goraj commented Nov 11, 2024 •

edited

Loading

Uh oh!

codecov bot commented Nov 12, 2024 •

edited

Loading

Uh oh!

goraj commented Dec 9, 2024

Uh oh!

adam2392 left a comment

Uh oh!

adam2392 Dec 9, 2024

Uh oh!

goraj Dec 17, 2024

Uh oh!

adam2392 Dec 9, 2024

Uh oh!

goraj Dec 17, 2024

Uh oh!

goraj commented Dec 17, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		with parallel_config("multiprocessing"):
		out = Parallel(n_jobs=n_jobs)(

Uh oh!

[ENH] sklearn 1.6.dev0 adjustments. #335

Are you sure you want to change the base?

[ENH] sklearn 1.6.dev0 adjustments. #335

Uh oh!

Conversation

goraj commented Nov 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this implement/fix? Explain your changes.

Uh oh!

codecov bot commented Nov 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

goraj commented Dec 9, 2024

Uh oh!

adam2392 left a comment

Choose a reason for hiding this comment

Uh oh!

adam2392 Dec 9, 2024

Choose a reason for hiding this comment

Uh oh!

goraj Dec 17, 2024

Choose a reason for hiding this comment

Uh oh!

adam2392 Dec 9, 2024

Choose a reason for hiding this comment

Uh oh!

goraj Dec 17, 2024

Choose a reason for hiding this comment

Uh oh!

goraj commented Dec 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

goraj commented Nov 11, 2024 •

edited

Loading

codecov bot commented Nov 12, 2024 •

edited

Loading

goraj commented Dec 17, 2024 •

edited

Loading