[MRG] FIX run test for meta-estimator having estimators keyword #14305

glemaitre · 2019-07-11T16:53:39Z

We should start to run tests for meta-estimator like VotingClassifier and VotingRegressor.

This is also useful for the stackingclassifier and stackingregressor for which we should ensure to pass these tests.

sklearn/ensemble/voting.py

glemaitre · 2019-07-11T17:54:51Z

I would be happy to have some feedback @amueller @jnothman @thomasjpfan @NicolasHug

sklearn/utils/estimator_checks.py

sklearn/ensemble/voting.py

NicolasHug

Thanks for doing this, it was clearly lacking.

Only a few comments, looks good overall.

doc/whats_new/v0.22.rst

NicolasHug · 2019-07-18T13:32:49Z

doc/whats_new/v0.22.rst

@@ -84,6 +84,13 @@ Changelog
  preserve the class balance of the original training set. :pr:`14194`
  by :user:`Johann Faouzi <johannfaouzi>`.

+- |Fix| Enable to run :func:`utils.check_estimator` on both


since this is not in the utils module section, the link is broken ;)

sklearn/tests/test_common.py

sklearn/utils/estimator_checks.py

NicolasHug · 2019-07-18T13:55:05Z

sklearn/utils/estimator_checks.py

@@ -2165,6 +2173,21 @@ def check_parameters_default_constructible(name, Estimator):
                    estimator = Estimator(Ridge())
                else:
                    estimator = Estimator(LinearDiscriminantAnalysis())
+            elif "estimators" in required_parameters:


It seems that this whole estimator initialization is duplicated between check_parameters_default_constructible and _tested_estimators. Might be worth considering a unifying helper?

I agree but I would do that in another PR or go for a better solution as proposed by @jnothman.

jnothman · 2019-07-25T13:31:50Z

I find this too implicit/magical. I'd really just rather a way for estimators to specify test parameters, which I was moving towards in #11324.

glemaitre · 2019-07-26T10:16:06Z

I find this too implicit/magical. I'd really just rather a way for estimators to specify test parameters, which I was moving towards in #11324.

I agree. Could we find a middle ground by introducing this test for the time being and later on refactorize/remove it by something as you are proposing. In the meantime, we at least run some tests on these estimators.

NicolasHug · 2019-07-26T12:18:59Z

I agree that we can temporarily merge this. It is indeed implicit etc., but after all that's what we've been doing with all the other meta-estimators so far. I'll try to take a look at #11324.

@glemaitre I think I can approve once you address the comments?

glemaitre · 2019-07-26T13:41:01Z

@NicolasHug I should have addressed all comments

NicolasHug

Thanks @glemaitre

amueller

Looks mostly good, I'd really rather not add to the checking parameters though.

amueller · 2019-07-26T15:32:45Z

sklearn/utils/estimator_checks.py

@@ -396,6 +399,10 @@ def set_checking_parameters(estimator):
    if name == 'OneHotEncoder':
        estimator.set_params(handle_unknown='ignore')

+    # set voting='soft' to be able to use predict_proba
+    if name == 'VotingClassifier':
+        estimator.set_params(voting='soft')


Is this necessary to pass the test or do you just add it so we can test the version with predict_proba? I'd really rather not add anything here, and if we want to test this particular instantiation, we should call check_estimator on VotingClassifier with these parameters directly.

This should be necessary for making the tests pass if the ducktyping works correctly.

This is necessary otherwise predict_proba is not defined and raise an error (leading to some failure in the common tests).

#14287 might have fixed this?

amueller · 2019-07-26T15:33:38Z

doc/whats_new/v0.22.rst

@@ -107,6 +107,13 @@ Changelog
  preserve the class balance of the original training set. :pr:`14194`
  by :user:`Johann Faouzi <johannfaouzi>`.

+- |Fix| Enable to run :func:`utils.estimator_checks.check_estimator` on both


I would say "run it by default" because you could already run it by giving it an instance.

jnothman · 2019-07-27T09:14:48Z

I'd rather explicitly run check_estimator for the relevant meta-estimators than put this temporarily in estimator checks.

glemaitre · 2019-07-29T10:00:07Z

@jnothman I moved the tests in ensemble/tests/test_voting.py until we have something specific for all meta-estimators.

NicolasHug

Still LGTM after the changes ;)

amueller · 2019-07-29T19:02:06Z

thanks! (grr I forgot to fix the merge message again, sorry!)

glemaitre added 3 commits July 11, 2019 18:41

TST run test for meta-estimator having estimators keyword

75ae8a3

TST check the target when classifying

f674ce8

raise warning for 2d array in regression

82a5680

glemaitre commented Jul 11, 2019

View reviewed changes

sklearn/ensemble/voting.py Show resolved Hide resolved

glemaitre changed the title ~~[WIP] FIX run test for meta-estimator having estimators keyword~~ [MRG] FIX run test for meta-estimator having estimators keyword Jul 11, 2019

glemaitre added 2 commits July 12, 2019 10:58

always convert to 2D array before average

835a5c7

DOC add whats new

d6ed3cd

glemaitre mentioned this pull request Jul 12, 2019

[MRG] FEA: Stacking estimator for classification and regression #11047

Merged

glemaitre added 2 commits July 12, 2019 16:28

iter

c0eabba

iter

c94f490

jeremiedbb reviewed Jul 12, 2019

View reviewed changes

sklearn/utils/estimator_checks.py Outdated Show resolved Hide resolved

sklearn/utils/estimator_checks.py Outdated Show resolved Hide resolved

sklearn/ensemble/voting.py Show resolved Hide resolved

NicolasHug reviewed Jul 18, 2019

View reviewed changes

glemaitre added 2 commits July 26, 2019 15:33

address comments Nicolas

fc5a55f

Merge remote-tracking branch 'origin/master' into test_meta_estimators

fd3174c

NicolasHug approved these changes Jul 26, 2019

View reviewed changes

amueller reviewed Jul 26, 2019

View reviewed changes

iter

10c0564

glemaitre added 4 commits July 29, 2019 10:57

remove parameter for VotingClassifier

61450d2

Merge remote-tracking branch 'origin/master' into test_meta_estimators

31894aa

move tests into ensemble voting

22ebc90

PEP8

bba8825

NicolasHug approved these changes Jul 29, 2019

View reviewed changes

amueller merged commit 5925fb9 into scikit-learn:master Jul 29, 2019

Uh oh!

[MRG] FIX run test for meta-estimator having estimators keyword #14305

[MRG] FIX run test for meta-estimator having estimators keyword #14305

Uh oh!

Conversation

glemaitre commented Jul 11, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

glemaitre commented Jul 11, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman commented Jul 25, 2019

Uh oh!

glemaitre commented Jul 26, 2019

Uh oh!

NicolasHug commented Jul 26, 2019

Uh oh!

glemaitre commented Jul 26, 2019

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

amueller left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman commented Jul 27, 2019 via email

Uh oh!

glemaitre commented Jul 29, 2019

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

amueller commented Jul 29, 2019

Uh oh!

Uh oh!

glemaitre commented Jul 11, 2019 •

edited

Loading

glemaitre commented Jul 11, 2019 •

edited

Loading