FIX MultiOutput* when sub-estimator does not accept metadata #28240

adrinjalali · 2024-01-24T13:47:54Z

Fixes #28239

And adds a common test to check the same issue with all other meta-estimators in the common test file.

cc @glemaitre @OmarManzoor

github-actions · 2024-01-24T13:49:15Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 035441f. Link to the linter CI: here}

sklearn/multioutput.py

glemaitre · 2024-01-24T16:27:36Z

In process_routing we say:

    Assuming this signature: ``fit(self, X, y, sample_weight=None, **fit_params)``,
    a call to this function would be:
    ``process_routing(self, sample_weight=sample_weight, **fit_params)``.

It looks incorrect then because we should not pass sample_weight when they are None.

adrinjalali · 2024-01-25T11:29:29Z

Would we want to enforce this contract?

Only non-None metadata are routed. All metadata whose value is None, will be ignored.

I think it would be very odd for None to actually be a non-default value which has a side effect.

adrinjalali · 2024-02-02T08:12:20Z

Oh, I see why CalibratedClassifierCV doesn't raise the same error. Here:

scikit-learn/sklearn/utils/_metadata_requests.py

Line 1058 in 548fc6f

extra_keys = set(params.keys()) - param_names - self_params

we only raise if anything is passed but is not either requested, or is a self accepted metadata. Therefore it doesn't matter if we pass sample_weight there or no, there will be no errors anyway. And passing it as None is of no consequence since it's not gonna be included in any routed param set, since it's not requested by the children. So the code is good as it is.

glemaitre

LGTM. Thanks @adrinjalali

StefanieSenger · 2024-02-05T11:41:04Z

I think there should also be tests for any meta-estimator, like Pipeline, that takes several sub-estimators or sub-transformers, that might or might not support that metadata.

adrinjalali · 2024-02-05T13:57:20Z

I think there should also be tests for any meta-estimator, like Pipeline, that takes several sub-estimators or sub-transformers, that might or might not support that metadata.

I think those are tested in the common metadata routing tests @StefanieSenger

StefanieSenger · 2024-02-05T16:26:15Z

@adrinjalali
I meant to add tests if Pipeline and other meta-estimators, that accept lists of several sub-estimators, will also accept a mix of consumers and non-consumers, for instance in test_pipeline.py.
Those newly added tests here in test_metaestimators_metadata_routing.py don't do that.

adrinjalali · 2024-02-06T09:03:37Z

Yes, but I think those are already tested in the test_metadata_routing.py. We haven't had an issue yet which comes from having a mix of subestimators that are consumers and non-consumers.

adrinjalali · 2024-02-06T09:04:17Z

cc @OmarManzoor @thomasjpfan for a quick review maybe? This is for the coming patch release.

OmarManzoor

LGTM. Thanks @adrinjalali

…learn#28240)

adrinjalali added 2 commits January 24, 2024 14:45

FIX MultiOutput* when sub-estimator does not accept metadata

fcbcaf5

Merge remote-tracking branch 'upstream/main' into fix/multioutput

13033c7

adrinjalali added this to the 1.4.1 milestone Jan 24, 2024

adrinjalali added 2 commits January 24, 2024 14:51

add changelog

fedee9a

non consuming estimators don't need a registry

cbbdf4e

glemaitre self-requested a review January 24, 2024 16:12

glemaitre reviewed Jan 24, 2024

View reviewed changes

sklearn/multioutput.py Show resolved Hide resolved

glemaitre reviewed Jan 24, 2024

View reviewed changes

sklearn/multioutput.py Show resolved Hide resolved

Merge remote-tracking branch 'upstream/main' into fix/multioutput

4be2d62

Merge remote-tracking branch 'upstream/main' into fix/multioutput

76e0a80

glemaitre self-requested a review February 1, 2024 17:26

Merge branch 'main' into fix/multioutput

035441f

glemaitre approved these changes Feb 2, 2024

View reviewed changes

StefanieSenger mentioned this pull request Feb 5, 2024

FEA Metadata routing for VotingClassifier and VotingRegressor #27584

Merged

OmarManzoor approved these changes Feb 6, 2024

View reviewed changes

OmarManzoor merged commit 7d8a9de into scikit-learn:main Feb 6, 2024

adrinjalali deleted the fix/multioutput branch February 6, 2024 14:18

glemaitre pushed a commit to glemaitre/scikit-learn that referenced this pull request Feb 10, 2024

FIX MultiOutput* when sub-estimator does not accept metadata (scikit-…

59a8a72

…learn#28240)

glemaitre pushed a commit to glemaitre/scikit-learn that referenced this pull request Feb 13, 2024

FIX MultiOutput* when sub-estimator does not accept metadata (scikit-…

04e9e4f

…learn#28240)

glemaitre pushed a commit that referenced this pull request Feb 13, 2024

FIX MultiOutput* when sub-estimator does not accept metadata (#28240)

c3ae535

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX MultiOutput* when sub-estimator does not accept metadata #28240

FIX MultiOutput* when sub-estimator does not accept metadata #28240

adrinjalali commented Jan 24, 2024

github-actions bot commented Jan 24, 2024 •

edited

Loading

glemaitre commented Jan 24, 2024

adrinjalali commented Jan 25, 2024

adrinjalali commented Feb 2, 2024

glemaitre left a comment

StefanieSenger commented Feb 5, 2024

adrinjalali commented Feb 5, 2024

StefanieSenger commented Feb 5, 2024 •

edited

Loading

adrinjalali commented Feb 6, 2024

adrinjalali commented Feb 6, 2024

OmarManzoor left a comment

FIX MultiOutput* when sub-estimator does not accept metadata #28240

FIX MultiOutput* when sub-estimator does not accept metadata #28240

Conversation

adrinjalali commented Jan 24, 2024

github-actions bot commented Jan 24, 2024 • edited Loading

✔️ Linting Passed

glemaitre commented Jan 24, 2024

adrinjalali commented Jan 25, 2024

adrinjalali commented Feb 2, 2024

glemaitre left a comment

Choose a reason for hiding this comment

StefanieSenger commented Feb 5, 2024

adrinjalali commented Feb 5, 2024

StefanieSenger commented Feb 5, 2024 • edited Loading

adrinjalali commented Feb 6, 2024

adrinjalali commented Feb 6, 2024

OmarManzoor left a comment

Choose a reason for hiding this comment

github-actions bot commented Jan 24, 2024 •

edited

Loading

StefanieSenger commented Feb 5, 2024 •

edited

Loading