Allow disassembled use of check_estimator #11622

azrdev · 2018-07-18T11:59:56Z

Description

For my downstream project, I'm testing my classifier with check_estimator, and would like to see which of its individual tests failed.
Doing so under nose works by using
for check in _yield_all_checks(name, estimator): yield check, name, my_estimator (and a separate call to the class-level checks), but py.test won't support yield-tests, and using _yield_all_checks to parameterize a test doesn't work because the function requires the estimator instance.

Therefore the request is to provide all checks from check_estimator as a (non-private) iterable, so they can be used separately.

I brought this up in #10728 first, errorneously.

Versions

Linux-4.17.5-1-ARCH-x86_64-with-arch-Arch-Linux
Python 3.6.6 (default, Jun 27 2018, 13:11:40)
[GCC 8.1.1 20180531]
NumPy 1.14.5
SciPy 1.1.0
Scikit-Learn 0.19.1

The text was updated successfully, but these errors were encountered:

rth · 2018-07-18T14:54:11Z

We could for instance add a parameter evaluate=True to check_estimator that would make it yield check, name, estimator. In which case, downstream projects would be able use it, for instance with pytest as follows,

from sklearn.utils.estimator_checks import check_estimator

@pytest.mark.parametrize('check, name, estimator',
                         check_estimator(SomeEstimatorClass, evaluate=False)
def test_skearn_compatible_estimator(check, name, estimator):
    check(name, estimator)

the limitation of this approach is that,

the name of the estimator is passed as parametrization parameter. For a single one, it's a bit useless, but this would make it easier to test multiples estimators in the downstream project at once by using e.g.,

itertools.chain_fromiterable(check_estimator(Estimator, evaluate=False)
                             for Estimator in [Estimator1, Estimator2, etc])

the repr of the estimator instance is a bit verbose and subopimal in the list of pytest generated test names. But this can be fixed by customizing pytest options.

Will make a PR to illustrate this better.

massich · 2018-07-18T15:53:27Z

~~I'll give it a try~~

jnothman · 2018-07-19T20:49:01Z

Another way to potentially handle this is to provide, as an alternative to check_estimator, a pytest fixture that generates the various checks, or something similar... I've not thought this out in detail.

rth · 2018-07-20T16:04:43Z

Another way to potentially handle this is to provide, as an alternative to
check_estimator, a pytest fixture that generates the various checks, or
something similar... I've not thought this out in detail.

Interesting idea. The problem is that checks are generated based on the estimator, so the fixture would need to know which third party estimators to test which might be difficult.

azrdev · 2018-07-21T09:08:40Z

The problem is that checks are generated based on the estimator

what about separating this decision (does check apply to this estimator) from enumerating all checks? simplest way would be to move it into the check, but maybe there's an even better idea?

rth · 2018-07-24T01:54:18Z

what about separating this decision (does check apply to this estimator) from enumerating all checks? simplest way would be to move it into the check, but maybe there's an even better idea?

Generally some of this was discussed in #6715 but it would require some significant refactoring of the way estimator checks works. Estimator tags (#8022) could also be part of the solution in the long term.

We can't just move the checks inside of the corresponding check_* functions (and skip if the estimator should not be run) because then we get an overly verbose list of tests that are harder to follow. E.g. if estimator check_classification_* is skipped for some estimator, currently it would typically mean that there is some issue why it was not supported as skipped as a workaround. If the estimator is not a classifier in the first place, the check will not be run. With the proposed change it would be run for all estimators, which would be confusing IMO.

amueller · 2018-07-24T18:24:23Z

I thought I had commented on this. I'm not sure I'm convinced it's good to enable running individual tests because that will encourage not running all tests.

If your motivation is debugging, then really we should provide a better error message.

rth · 2018-07-24T18:50:09Z

I thought I had commented on this. I'm not sure I'm convinced it's good to enable running individual tests because that will encourage not running all tests.

Thanks for the feedback. Still suppose one has a a single check that doesn't pass in check_estimator in a sklearn-contrib project. With an iterative setup, one could just skip the check in question and mark it as TODO later, and run the rest of checks. Without it, in the current situation, one would just not use check_estimator altogether because it would fail. I'm sure why the latter situation would be better.

It would give developers in scikit-learn contrib projects more flexibility if needed, without waiting for our 6month-12month release cycle. Flexibility is one of the things that was mentioned as limiting factor in #6715. Of course in the ideal world check_estimator would take into account all the needs of contrib projects, but currently we are not there yet.

Also honestly it's annoying to have one tests that runs several dozens checks. That's why we are parametrizing them in scikit-learn (before with yields now with pytest). Giving contrib projects a possibility to do the same, and trusting developers not to misuse it, would be nice IMO.

rth · 2018-07-24T19:03:15Z

Also, some projects will never manage to be fully compliant with the scikit-learn API due to other constraints, e.g. hybrid recommendation, but at the same time, the API is somewhat inspired from scikit-learn, in which case one could imagine testing only the relevant subset of checks in check_estimator.

amueller · 2018-07-24T19:54:48Z

@rth that could be done with data formats and possibly tasks / estimator types.
But you make a good point that more flexibility is probably better and we're not going to cover 100% of use cases with our first version of estimator tags.

azrdev · 2019-04-17T11:03:28Z

@rth commented on 24 Jul 2018

We can't just move the checks inside of the corresponding check_* functions (and skip if the estimator should not be run) because then we get an overly verbose list of tests that are harder to follow. E.g. if estimator check_classification_* is skipped for some estimator, currently it would typically mean that there is some issue why it was not supported as skipped as a workaround.

I think pytest.mark.xfail(run=false) is there for exactly such "workaround" cases, to be distinguished from skip.

amueller · 2019-06-07T16:27:01Z

Some possibly useful pointers:

I think this might be a good entry point:
https://docs.pytest.org/en/latest/usage.html#calling-pytest-from-python-code

Here's how to use hooks:
https://docs.pytest.org/en/latest/example/simple.html#incremental-testing-test-steps

I think you want to work with the collection hooks?
https://docs.pytest.org/en/latest/reference.html#collection-hooks

amueller · 2019-07-10T21:12:18Z

Never mind, I was overthinking it, let's go with #11622 (comment)

rth mentioned this issue Oct 30, 2018

check_estimator not valid for vectorizers #12491

Open

rth mentioned this issue Feb 24, 2019

check_estimator is not sufficiently general #6715

Closed

rth mentioned this issue Apr 30, 2019

show check_estimator progress (verbose flag?) #13748

Closed

scouvreur mentioned this issue May 9, 2019

[WIP] Verbose flag displaying progress bar for check_estimator in sklearn.utils.estimator_checks #13843

Closed

2 tasks

thomasjpfan mentioned this issue Jul 16, 2019

[MRG] ENH Disassemble check estimator #14381

Merged

rth closed this as completed in #14381 Aug 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow disassembled use of check_estimator #11622

Allow disassembled use of check_estimator #11622

azrdev commented Jul 18, 2018

rth commented Jul 18, 2018 •

edited

Loading

massich commented Jul 18, 2018 •

edited

Loading

jnothman commented Jul 19, 2018 via email

rth commented Jul 20, 2018

azrdev commented Jul 21, 2018

rth commented Jul 24, 2018

amueller commented Jul 24, 2018

rth commented Jul 24, 2018

rth commented Jul 24, 2018

amueller commented Jul 24, 2018

azrdev commented Apr 17, 2019

amueller commented Jun 7, 2019

amueller commented Jul 10, 2019

Allow disassembled use of check_estimator #11622

Allow disassembled use of check_estimator #11622

Comments

azrdev commented Jul 18, 2018

Description

Versions

rth commented Jul 18, 2018 • edited Loading

massich commented Jul 18, 2018 • edited Loading

jnothman commented Jul 19, 2018 via email

rth commented Jul 20, 2018

azrdev commented Jul 21, 2018

rth commented Jul 24, 2018

amueller commented Jul 24, 2018

rth commented Jul 24, 2018

rth commented Jul 24, 2018

amueller commented Jul 24, 2018

azrdev commented Apr 17, 2019

amueller commented Jun 7, 2019

amueller commented Jul 10, 2019

rth commented Jul 18, 2018 •

edited

Loading

massich commented Jul 18, 2018 •

edited

Loading