FIX accept meta-estimator in SelfTrainingClassifier #19126

glemaitre · 2021-01-06T10:46:49Z

closes #19119

SelfTrainingClassifier did not accept nested estimators that did not expose predict_proba.
One fix is to validate a fitted estimator where we know if predict_proba will be then available.

glemaitre · 2021-01-06T10:48:15Z

sklearn/semi_supervised/_self_training.py

@@ -207,8 +207,11 @@ def fit(self, X, y):

            if self.n_iter_ == 1:
                # Only validate in the first iteration so that n_iter=0 is
-                # equivalent to the base_estimator itself.
-                _validate_estimator(self.base_estimator)
+                # equivalent to the base_estimator_ itself.


@oliverrausch Do you recall what is the meaning of this line and the previous one?
It was not obvious to me.

May I suggest:

# Only validate the fitted estimator of the first iteration.

Actually since the validation is really cheap, I would not mind simplifying this code and removing the if self.n_iter_ == 1: condition.

thomasjpfan

LGTM

sklearn/semi_supervised/tests/test_self_training.py

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

ogrisel

LGTM (once the review comments are addressed).

doc/whats_new/v0.24.rst

ogrisel · 2021-01-06T14:37:29Z

sklearn/semi_supervised/_self_training.py

@@ -207,8 +207,11 @@ def fit(self, X, y):

            if self.n_iter_ == 1:
                # Only validate in the first iteration so that n_iter=0 is
-                # equivalent to the base_estimator itself.
-                _validate_estimator(self.base_estimator)
+                # equivalent to the base_estimator_ itself.


May I suggest:

# Only validate the fitted estimator of the first iteration.

Actually since the validation is really cheap, I would not mind simplifying this code and removing the if self.n_iter_ == 1: condition.

riyadhctg · 2021-01-06T15:48:24Z

LGTM. For the sake of sharing, the solution I had in my mind for #19119 was to change the order of the following two lines in https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/ensemble/_stacking.py

names, all_estimators = self._validate_estimators()
self._validate_final_estimator()

to

self._validate_final_estimator()
names, all_estimators = self._validate_estimators()

glemaitre · 2021-01-06T16:28:54Z

@riyadhctg The changes that you proposed is in the StackingClassifier. I think that the code is fine there. The changes required is in the SelfTrainingClassifier.

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

FIX accept meta-estimator in SelfTrainingClassifier

554a686

github-actions bot added the module:semi_supervised label Jan 6, 2021

update whats new

30b85d7

glemaitre commented Jan 6, 2021

View reviewed changes

glemaitre added this to the 0.24.1 milestone Jan 6, 2021

glemaitre added the To backport PR merged in master that need a backport to a release branch defined based on the milestone. label Jan 6, 2021

thomasjpfan approved these changes Jan 6, 2021

View reviewed changes

sklearn/semi_supervised/tests/test_self_training.py Outdated Show resolved Hide resolved

Update sklearn/semi_supervised/tests/test_self_training.py

a5c161f

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

ogrisel approved these changes Jan 6, 2021

View reviewed changes

glemaitre and others added 2 commits January 6, 2021 17:29

Update doc/whats_new/v0.24.rst

59f11a7

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

address ogrisel comments

d8728de

ogrisel merged commit 6d3d1b8 into scikit-learn:master Jan 8, 2021

glemaitre added a commit to glemaitre/scikit-learn that referenced this pull request Jan 18, 2021

FIX accept meta-estimator in SelfTrainingClassifier (scikit-learn#19126)

59f6ec9

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

jeremiedbb pushed a commit that referenced this pull request Jan 19, 2021

FIX accept meta-estimator in SelfTrainingClassifier (#19126)

78e1530

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

FIX accept meta-estimator in SelfTrainingClassifier #19126

FIX accept meta-estimator in SelfTrainingClassifier #19126

Uh oh!

glemaitre commented Jan 6, 2021

Uh oh!

glemaitre Jan 6, 2021

Uh oh!

ogrisel Jan 6, 2021 •

edited

Loading

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

ogrisel left a comment •

edited

Loading

Uh oh!

Uh oh!

ogrisel Jan 6, 2021 •

edited

Loading

Uh oh!

riyadhctg commented Jan 6, 2021 •

edited

Loading

Uh oh!

glemaitre commented Jan 6, 2021

Uh oh!

Uh oh!

Uh oh!

FIX accept meta-estimator in SelfTrainingClassifier #19126

FIX accept meta-estimator in SelfTrainingClassifier #19126

Uh oh!

Conversation

glemaitre commented Jan 6, 2021

Uh oh!

glemaitre Jan 6, 2021

Choose a reason for hiding this comment

Uh oh!

ogrisel Jan 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ogrisel left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ogrisel Jan 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

riyadhctg commented Jan 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glemaitre commented Jan 6, 2021

Uh oh!

Uh oh!

ogrisel Jan 6, 2021 •

edited

Loading

ogrisel left a comment •

edited

Loading

ogrisel Jan 6, 2021 •

edited

Loading

riyadhctg commented Jan 6, 2021 •

edited

Loading