FEA SLEP006: Metadata routing for `SelfTrainingClassifier` #28494

adam2392 · 2024-02-21T15:18:38Z

Reference Issues/PRs

Towards: #22893

What does this implement/fix? Explain your changes.

Implements metadata routing for SelfTrainingClassifier. Note the added code diff simply comes from a replacement of base_estimator for estimator.

convert base_estimator to estimator and add a deprecation
uncomments other functions that previously were not tested in NonConsumingClassifier
~~fixes some minor design choices in the unit-testing framework of test_metaestimators_metadata_routing.py (e.g. try/except -> if/else to be more transparent)~~

Any other comments?

cc: @adrinjalali

Some open questions:

~~1. I presume, we want to forward metadata within all the functions possibly?~~
2. As a result, I'm not sure if the unit-testing approach is the best, so I was wondering if you have any suggestions? Should I try potentially refactoring the existing unit-testing code to allow testing for more than just fit?

Signed-off-by: Adam Li <adam2392@gmail.com>

github-actions · 2024-02-21T15:19:58Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 2cacdc9. Link to the linter CI: here}

Signed-off-by: Adam Li <adam2392@gmail.com>

adrinjalali

A few notes, thanks @adam2392

sklearn/semi_supervised/_self_training.py

Signed-off-by: Adam Li <adam2392@gmail.com>

adam2392

Thank you for the review and pointers! I went thru and fixed the issues in the doc-strings and Bunch.

sklearn/semi_supervised/_self_training.py

Signed-off-by: Adam Li <adam2392@gmail.com>

…learn into self-learn-meta

adam2392 · 2024-03-14T13:42:52Z

Resolved conflicts. Feel free to ping me if there's additional changes desired

Signed-off-by: Adam Li <adam2392@gmail.com>

adam2392 · 2024-04-02T13:42:59Z

This PR should not be affected by #28734

sklearn/semi_supervised/tests/test_self_training.py

sklearn/tests/test_metaestimators_metadata_routing.py

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

OmarManzoor

Thanks for the updates. A few other comments.

OmarManzoor · 2024-06-24T11:43:38Z

sklearn/tests/test_metaestimators_metadata_routing.py

+                if method_name in ["fit", "partial_fit", "score"]:
+                    # `fit`, `partial_fit`, 'score' accept y, others don't.
                    method(X, y, **method_kwargs)
-                except TypeError:
+                else:


I think with try, except we don't really need to bother to maintain this list. However I am fine with either way. I'll let @adrinjalali finalise.

doc/whats_new/v1.6.rst

sklearn/semi_supervised/_self_training.py

Signed-off-by: Adam Li <adam2392@gmail.com>

adam2392 · 2024-06-24T12:12:23Z

Thanks for the review @OmarManzoor!

I'll change the last comment if @adrinjalali has any issues (#28494 (comment))

OmarManzoor

LGTM. Thanks @adam2392

adrinjalali

Thanks @adam2392

sklearn/semi_supervised/_self_training.py

adrinjalali · 2024-07-05T04:17:18Z

sklearn/tests/test_metaestimators_metadata_routing.py

+                if method_name in ["fit", "partial_fit", "score"]:
+                    # `fit`, `partial_fit`, 'score' accept y, others don't.
                    method(X, y, **method_kwargs)
-                except TypeError:
+                else:


I'm not sure why this makes things hard to debug really. The try/except is more foolproof since a library like imbalance-learn adds some methods to the whole system by patching a few things in sklearn and things would just work.

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

Signed-off-by: Adam Li <adam2392@gmail.com>

…learn into self-learn-meta

adam2392

Thanks for the review @adrinjalali ! I addressed your comments in 3b5a3f0

sklearn/semi_supervised/_self_training.py

adam2392 · 2024-07-05T12:17:32Z

sklearn/tests/test_metaestimators_metadata_routing.py

+                if method_name in ["fit", "partial_fit", "score"]:
+                    # `fit`, `partial_fit`, 'score' accept y, others don't.
                    method(X, y, **method_kwargs)
-                except TypeError:
+                else:


Fair. I think it was hard before because the error message was not very clear on what method was failing from what class, so I assume that's been fixed with #29226

adam2392 · 2024-07-05T12:17:46Z

sklearn/tests/test_metaestimators_metadata_routing.py

+                if method_name in ["fit", "partial_fit", "score"]:
+                    # `fit`, `partial_fit`, 'score' accept y, others don't.
                    method(X, y, **method_kwargs)
-                except TypeError:
+                else:


I reverted it to the try/except

Signed-off-by: Adam Li <adam2392@gmail.com>

sklearn/semi_supervised/_self_training.py

adrinjalali · 2024-07-05T13:02:04Z

sklearn/semi_supervised/_self_training.py

+            )
+        else:
+            estimator_ = clone(self.estimator)
+        return estimator_


still missing the case where both estimator and base_estimator are passed, in which case we need to raise

adrinjalali · 2024-07-05T13:03:43Z

sklearn/semi_supervised/tests/test_self_training.py

+
+
+# TODO(1.8): remove in 1.8
+def test_deprecation_warning_base_estimator():


this should also test for all other cases in _get_estimator

Signed-off-by: Adam Li <adam2392@gmail.com>

adam2392 · 2024-07-06T15:30:36Z

Lmk if I missed anything else @adrinjalali.

Thanks for the review and patience!

adrinjalali

Otherwise LGTM.

I'll let @OmarManzoor have another look since this changed a bit since last he reviewed.

sklearn/semi_supervised/_self_training.py

Signed-off-by: Adam Li <adam2392@gmail.com>

adam2392 · 2024-07-08T13:52:58Z

SG! Thanks for the reviews.

OmarManzoor

LGTM. Thanks @adam2392

adam2392 added 2 commits February 21, 2024 10:16

Mergeing

616c178

Signed-off-by: Adam Li <adam2392@gmail.com>

Mergeing

1e8c340

Signed-off-by: Adam Li <adam2392@gmail.com>

github-actions bot added the module:semi_supervised label Feb 21, 2024

adam2392 added 4 commits February 21, 2024 10:37

WIP

61cb4df

Signed-off-by: Adam Li <adam2392@gmail.com>

Merging main

b18fbbd

Signed-off-by: Adam Li <adam2392@gmail.com>

Merged

b96e9e9

Signed-off-by: Adam Li <adam2392@gmail.com>

WIP

2053545

Signed-off-by: Adam Li <adam2392@gmail.com>

adam2392 marked this pull request as ready for review February 23, 2024 20:38

adrinjalali reviewed Feb 27, 2024

View reviewed changes

Address adrin's comments

7da8015

Signed-off-by: Adam Li <adam2392@gmail.com>

adam2392 commented Feb 27, 2024

View reviewed changes

adam2392 added 4 commits February 27, 2024 09:32

Merge branch 'main' into self-learn-meta

c88e943

Merge branch 'main' into self-learn-meta

b46e26f

Fix unit tests

ed2437d

Signed-off-by: Adam Li <adam2392@gmail.com>

Merge branch 'self-learn-meta' of https://github.com/adam2392/scikit-…

8180ee0

…learn into self-learn-meta

adam2392 requested a review from adrinjalali February 27, 2024 17:17

adam2392 added 3 commits February 28, 2024 12:55

Merge branch 'main' into self-learn-meta

d863ce1

Merge branch 'main' into self-learn-meta

3314455

Merge branch 'main' into self-learn-meta

7103f04

adam2392 added 4 commits March 14, 2024 22:32

Merge branch 'main' into self-learn-meta

549380a

Reformat lint

b5cfed2

Signed-off-by: Adam Li <adam2392@gmail.com>

Merge branch 'main' into self-learn-meta

dca2712

Merge branch 'main' into self-learn-meta

7235ae1

adam2392 added 2 commits April 2, 2024 09:43

Merge branch 'main' into self-learn-meta

5ab88c1

Merge branch 'main' into self-learn-meta

972ddc1

adrinjalali reviewed Apr 10, 2024

View reviewed changes

sklearn/semi_supervised/tests/test_self_training.py Outdated Show resolved Hide resolved

sklearn/semi_supervised/tests/test_self_training.py Outdated Show resolved Hide resolved

sklearn/tests/test_metaestimators_metadata_routing.py Outdated Show resolved Hide resolved

Update sklearn/semi_supervised/tests/test_self_training.py

dbfdace

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

OmarManzoor reviewed Jun 24, 2024

View reviewed changes

adam2392 added 2 commits June 24, 2024 08:11

Address omar comments

9306266

Signed-off-by: Adam Li <adam2392@gmail.com>

Merge branch 'main' into self-learn-meta

f46395c

OmarManzoor approved these changes Jun 24, 2024

View reviewed changes

adam2392 added 2 commits June 27, 2024 10:00

Merge branch 'main' into self-learn-meta

c5c2e27

Merge branch 'main' into self-learn-meta

36c1b4a

adrinjalali reviewed Jul 5, 2024

View reviewed changes

adam2392 and others added 3 commits July 5, 2024 08:16

Apply suggestions from code review

8815328

Co-authored-by: Adrin Jalali <adrin.jalali@gmail.com>

Merging

3b5a3f0

Signed-off-by: Adam Li <adam2392@gmail.com>

Merge branch 'self-learn-meta' of https://github.com/adam2392/scikit-…

938e4f7

…learn into self-learn-meta

adam2392 commented Jul 5, 2024

View reviewed changes

adam2392 added 2 commits July 5, 2024 08:19

Merge branch 'main' into self-learn-meta

9672e5b

Rename the get estimator

b2b518b

Signed-off-by: Adam Li <adam2392@gmail.com>

adrinjalali reviewed Jul 5, 2024

View reviewed changes

adam2392 added 3 commits July 5, 2024 09:12

Add extra deprecation unit tests

42dade6

Signed-off-by: Adam Li <adam2392@gmail.com>

Fixed

de8b828

Signed-off-by: Adam Li <adam2392@gmail.com>

Fix unit tests

7012dbd

Signed-off-by: Adam Li <adam2392@gmail.com>

adam2392 requested a review from adrinjalali July 5, 2024 14:21

adam2392 added 2 commits July 5, 2024 10:28

Merge branch 'main' into self-learn-meta

31f012d

Merge branch 'main' into self-learn-meta

7a13581

adrinjalali approved these changes Jul 8, 2024

View reviewed changes

sklearn/semi_supervised/_self_training.py Outdated Show resolved Hide resolved

adam2392 added 2 commits July 8, 2024 09:42

Fix doc string

feff521

Signed-off-by: Adam Li <adam2392@gmail.com>

Merge branch 'main' into self-learn-meta

2cacdc9

adam2392 requested a review from OmarManzoor July 8, 2024 13:42

OmarManzoor approved these changes Jul 9, 2024

View reviewed changes

OmarManzoor merged commit cef803a into scikit-learn:main Jul 9, 2024
30 checks passed

adam2392 deleted the self-learn-meta branch July 9, 2024 11:06



		# TODO(1.8): remove in 1.8
		def test_deprecation_warning_base_estimator():

Uh oh!

FEA SLEP006: Metadata routing for SelfTrainingClassifier #28494

FEA SLEP006: Metadata routing for SelfTrainingClassifier #28494

Uh oh!

Conversation

adam2392 commented Feb 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Feb 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adam2392 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adam2392 commented Mar 14, 2024

Uh oh!

adam2392 commented Apr 2, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

OmarManzoor left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

adam2392 commented Jun 24, 2024

Uh oh!

OmarManzoor left a comment

Choose a reason for hiding this comment

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adam2392 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adam2392 commented Jul 6, 2024

FEA SLEP006: Metadata routing for `SelfTrainingClassifier` #28494

FEA SLEP006: Metadata routing for `SelfTrainingClassifier` #28494

adam2392 commented Feb 21, 2024 •

edited

Loading

github-actions bot commented Feb 21, 2024 •

edited

Loading