CI Make common test xfails strict #32080

betatim · 2025-09-02T12:51:44Z

Follow up to #31951

This PR enables strict xfail mode for just the common estimator checks. This allows us to find common tests that were marked as xfail but have started passing. If a test outcome changes we either need to update our xfail list or it is a bug that needs adressing.

As part of this I found some checks that no longer fail, the xfail list is updated. There are also a few estimators that are tested in different configurations, and fail/pass depending on the configuration. I've implemented that by adding custom logic to pop the check from the xfail list for some configs.

For some of the checks related to sample weights I wonder if. they really pass now or if the check is not strict enough to detect that the estimator doesn't full fill the requirements implied by the check? Maybe @ogrisel knows more about this?

ping @adrinjalali who thought it could be interesting to enable this for the scikit-learn test suite. What do you think?

This allows us to find common tests that were marked as xfail but have started passing. If a test outcome changes we either need to update our xfail list or it is a bug that needs adressing.

github-actions · 2025-09-02T12:52:34Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: cc8f605. Link to the linter CI: here}

betatim · 2025-09-02T12:53:32Z

sklearn/utils/_test_common/instance_generator.py

@@ -845,24 +842,6 @@ def _yield_instances_for_check(check, estimator_orig):


 PER_ESTIMATOR_XFAIL_CHECKS = {
-    AdaBoostClassifier: {
-        # TODO: replace by a statistical test, see meta-issue #16298


This is an instance where the check doesn't fail but I am not sure if this is because AdaBoostClassifier has been fixed or because the check is not "good enough" to detect that sample weight handling is broken?

It's possible that the check is a bit too weak. Still we can remove the XFAIL markers for this and maybe readd it later when needed if we ever make the check stronger.

Make common tests strict

cc8f605

This allows us to find common tests that were marked as xfail but have started passing. If a test outcome changes we either need to update our xfail list or it is a bug that needs adressing.

github-actions bot added module:utils Build / CI labels Sep 2, 2025

betatim commented Sep 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

CI Make common test xfails strict #32080

CI Make common test xfails strict #32080

Uh oh!

betatim commented Sep 2, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 2, 2025

Uh oh!

betatim Sep 2, 2025

Uh oh!

ogrisel Sep 2, 2025

Uh oh!

Uh oh!

Uh oh!

CI Make common test xfails strict #32080

Are you sure you want to change the base?

CI Make common test xfails strict #32080

Uh oh!

Conversation

betatim commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Sep 2, 2025

✔️ Linting Passed

Uh oh!

betatim Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

ogrisel Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

betatim commented Sep 2, 2025 •

edited

Loading