TST allow categorisation of tests into API and legacy #29699

adrinjalali · 2024-08-21T13:54:07Z

This is a very minimal PR, allowing us to start working on categorisation of tests. The idea is:

a set of API only checks which will always be run
a set of legacy tests which start with almost all common tests we have
gradually move / create API only tests into the API only category, and document them in the developer guides
gradually move other tests into their own categories, while adding those categories as a boolean parameter to parametrize_with_checks, such as statistical, etc.

cc @glemaitre @thomasjpfan @OmarManzoor @adam2392

github-actions · 2024-08-21T13:55:44Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 38d82bc. Link to the linter CI: here}

adam2392

I assume documentation will come later?

Is there an associated GH issue to track what will move where and such?

glemaitre · 2024-08-21T14:02:34Z

At least for the moment, this is going to solve: #16241 -> it means that we need to have a API documentation page in line with the check that we are doing.

Then, adding tests to other category would be for later.

adrinjalali · 2024-08-21T14:06:46Z

Yes, there will be a bunch of documentation work for this page: https://scikit-learn.org/stable/developers/develop.html

I rather we settle with this API here and merge, and then work on API only tests dedicated in a different PR, and follow up PR to update our documentation. Along with other PRs here: Developer API Third party developer API related my goal is to make it nicer and cleaner for third party developers to develop scikit-learn compatible estimators.

adrinjalali · 2024-08-21T14:12:59Z

Also for some historical context, @NicolasHug had some a lot of work in #16882, #16890, #17252, and #17361. But we ended up reverting them last minute before a release.

adam2392 · 2024-08-21T14:17:37Z

I see! That's exciting. I always did find the parametrize_with_checks just a blackbox, so I think having some more fine-grained control over what's "checked" is useful.

thomasjpfan

@adrinjalali Can you open an issue or comment on an issue that gives an overview on what the categories will be?

thomasjpfan · 2024-08-21T17:35:36Z

sklearn/utils/estimator_checks.py

@@ -513,9 +522,14 @@ def _should_be_skipped_or_marked(estimator, check):
    return False, "placeholder reason that will never be used"


-def parametrize_with_checks(estimators):
+def parametrize_with_checks(estimators, legacy=True):


Does this need to be added to check_estimator?

I'll add it.

adrinjalali

@thomasjpfan

Can you open an issue or comment on an issue that gives an overview on what the categories will be?

We don't really know at this point what those categories are gonna be. API and "statistical" tests are probably the two we have, then there's array API, sample weight, and I'm not sure about the rest. My plan is to start going through them and to create the categories as we go, while having an overview once I start working on it.

adrinjalali · 2024-08-21T17:45:27Z

sklearn/utils/estimator_checks.py

@@ -513,9 +522,14 @@ def _should_be_skipped_or_marked(estimator, check):
    return False, "placeholder reason that will never be used"


-def parametrize_with_checks(estimators):
+def parametrize_with_checks(estimators, legacy=True):


I'll add it.

adrinjalali · 2024-08-22T06:25:54Z

Issue for test categories: #29703

adrinjalali · 2024-08-22T06:27:08Z

At least for the moment, this is going to solve: #16241 -> it means that we need to have a API documentation page in line with the check that we are doing.

@glemaitre I agree, but do you want this in this PR? I rather have that in a followup PR since adding all API tests would be quite large.

sklearn/utils/estimator_checks.py

glemaitre · 2024-09-03T09:41:06Z

I agree, but do you want this in this PR? I rather have that in a followup PR since adding all API tests would be quite large.

Not in this PR, it was just to provide the big picture here.

glemaitre

We should add a test now to make sure that we have less test runnnings with legacy=False.

sklearn/utils/estimator_checks.py

glemaitre · 2024-09-03T09:50:54Z

sklearn/utils/estimator_checks.py

@@ -533,6 +547,11 @@ def parametrize_with_checks(estimators):

        .. versionadded:: 0.24

+    legacy : bool (default=True)
+        Whether to include legacy checks.


I'm wondering if you should add a little note to mention that the legacy checks exists during the transition that we create the categorisation.

sklearn/utils/estimator_checks.py

sklearn/utils/tests/test_estimator_checks.py

glemaitre

We should add new test to make sure that the length of the generator with legacy=True is bigger than legacy=False.

glemaitre · 2024-09-03T14:42:15Z

Enabling auto-merge and trying to make Debian build works

TST allow categorisation of tests into API and legacy

d28c3cb

adrinjalali added the Developer API Third party developer API related label Aug 21, 2024

github-actions bot added the module:utils label Aug 21, 2024

adam2392 approved these changes Aug 21, 2024

View reviewed changes

thomasjpfan reviewed Aug 21, 2024

View reviewed changes

adrinjalali commented Aug 21, 2024

View reviewed changes

add legacy to check_estimator

3474eea

adrinjalali mentioned this pull request Aug 22, 2024

Split common tests into groups #29703

Open

adrinjalali added 2 commits August 22, 2024 17:44

fix tests

3975f17

Merge remote-tracking branch 'upstream/main' into tests/legacy

d460786

glemaitre added the No Changelog Needed label Aug 22, 2024

thomasjpfan reviewed Aug 22, 2024

View reviewed changes

sklearn/utils/estimator_checks.py Outdated Show resolved Hide resolved

This was referenced Aug 23, 2024

TST remove _required_parameters and improve instance generation #29707

Merged

TST Create dedicated dataframe / feature count tests category #29713

Draft

adrinjalali added 2 commits August 29, 2024 10:19

make arg kwonly

87a0e30

Merge remote-tracking branch 'upstream/main' into tests/legacy

ce85bf8

glemaitre self-requested a review September 3, 2024 09:36

glemaitre approved these changes Sep 3, 2024

View reviewed changes

adrinjalali added 2 commits September 3, 2024 13:29

Guillaume's comments

2e0ba19

Merge remote-tracking branch 'upstream/main' into tests/legacy

22f5c29

glemaitre enabled auto-merge (squash) September 3, 2024 14:42

Merge branch 'main' into tests/legacy

38d82bc

glemaitre merged commit 1c3dcb4 into scikit-learn:main Sep 4, 2024
28 checks passed

adrinjalali deleted the tests/legacy branch September 4, 2024 11:26

adrinjalali mentioned this pull request Mar 10, 2025

Add non-strict mode to check_estimator #13969

Closed

Uh oh!

TST allow categorisation of tests into API and legacy #29699

TST allow categorisation of tests into API and legacy #29699

Uh oh!

Conversation

adrinjalali commented Aug 21, 2024

Uh oh!

github-actions bot commented Aug 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

adam2392 left a comment

Choose a reason for hiding this comment

Uh oh!

glemaitre commented Aug 21, 2024

Uh oh!

adrinjalali commented Aug 21, 2024

Uh oh!

adrinjalali commented Aug 21, 2024

Uh oh!

adam2392 commented Aug 21, 2024

Uh oh!

thomasjpfan left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

thomasjpfan Aug 21, 2024

Choose a reason for hiding this comment

Uh oh!

adrinjalali Aug 21, 2024

Choose a reason for hiding this comment

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

adrinjalali Aug 21, 2024

Choose a reason for hiding this comment

Uh oh!

adrinjalali commented Aug 22, 2024

Uh oh!

adrinjalali commented Aug 22, 2024

Uh oh!

Uh oh!

glemaitre commented Sep 3, 2024

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glemaitre Sep 3, 2024

Choose a reason for hiding this comment

Uh oh!

adrinjalali Sep 3, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

glemaitre commented Sep 3, 2024

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Aug 21, 2024 •

edited

Loading

thomasjpfan left a comment •

edited

Loading