FIX `CalibratedClassifierCV` should not ignore `sample_weight` if estimator does not support it #21143

glemaitre · 2021-09-24T15:18:01Z

Partially addressed #21134

Forces to raise an error with Pipeline included in meta-estimators to not ignore silently sample_weight.
In the future, we should address #18159 and the test should not raise an error and delegate the weights to the right estimator in the Pipeline.

glemaitre · 2021-09-24T15:27:16Z

Now that I am looking at the calibration code, it seems that we intend to raise a Warning (which I ignore while doing my primary tests). @lucyleeow do you remember why was not it controversial to not fit a model discarding sample_weight and only using it for calibration?

lucyleeow · 2021-09-25T02:17:51Z

I think the intention when I refactored this function was to keep all functionality the same, and any fixes/changes to be done afterwards, separately. (not that I can remember any fixes..!)

Looking through git blame, it seems that the warning about when sample weight is ignored is added here: 70d49de

And it seems this ignoring of sample weight originates from the start? ecfc93d:

            for train, test in cv:
                this_estimator = clone(self.base_estimator)
                if sample_weight is not None and \
                   "sample_weight" in inspect.getargspec(
                        this_estimator.fit)[0]:
                    this_estimator.fit(X[train], y[train],
                                       sample_weight[train])
                else:
                    this_estimator.fit(X[train], y[train])

glemaitre · 2021-09-25T11:27:14Z

And it seems this ignoring of sample weight originates from the start?

Thanks @lucyleeow for the insights. I am doubting that this is a good strategy, though. I will raise this issue in the next dev meeting then.

ogrisel · 2021-09-27T14:56:59Z

I am not sure if passing sample_weight both to the calibrator and the base_estimator is a form of "double-accounting" or not. Because we do a cross-val split, I think not but I am not 100% sure.

Maybe the best way to do would be to consider 2 datasets:

X, y, sample_weight where all samples have weight values of 1. except the last one that has a weight value of 2.
X, y and sample_weight=None with the same samples except for the last one that is duplicated.

Intuitively we would like that calling CallibratedClassifierCV(some_estimator) on those two cases to yield exactly the same decision function (same predict_proba values), in expectation.

When ensemble=True it seems that it can be the case only of both the calibrators and the base estimators are being propagated the weights. Which means that the current situation is probably yielding bad results if the base estimator does not accept sample weights.
When ensemble=False, I am not sure...

We could have a similar test to check that that dropping a sample is equivalent to setting it a weight of 0. There is a common test for this latter semantics but it is XFAILing for CalibratedClassifierCV:

https://github.com/scikit-learn/scikit-learn/blob/main/sklearn/calibration.py#L437-L440

So I think we should at least add dedicated tests for CalibratedClassifierCV to check for some semantics and document cases that work as expected.

For cases that are not working, we should make sure that we include them as part of the PR prototype for SLEP006 on meta-data routing, e.g. as part of #20350.

Whether or not we should raise ValueError or a warning with a stronger message in the mean time, while waiting for SLEP0006 ... I don't have a strong opinion. Maybe a warning is enough if we reword it.

ogrisel · 2021-09-28T08:55:56Z

Intuitively we would like that calling CallibratedClassifierCV(some_estimator) on those two cases to yield exactly the same decision function (same predict_proba values), in expectation.

Thinking a bit more about this this might be challenging to test with a limited computational budget because the cross-validation strategy might be non-deterministic and the "in expectation" would require a statistical test.

To simplifiy the problem we could make sure that we run this test with a simplistic, deterministic CV loop (simple 3 or 5-Fold CV without shuffling or stratification) and put the duplicated samples and sample with weight 2 in the same position (e.g. in the last CV fold in both cases.

Same strategy could be adapted to to check the 0 weight / sample drop equivalence.

glemaitre · 2021-09-28T09:05:38Z

To simplifiy the problem we could make sure that we run this test with a simplistic, deterministic CV loop (simple 3 or 5-Fold CV without shuffling or stratification) and put the duplicated samples and sample with weight 2 in the same position (e.g. in the last CV fold in both cases.

I was indeed starting to make a 2-fold cross-validation with iris (only 2 first class) where it would be easy to check the underlying weight of the classifier and the parameters of the calibrator to understand exactly what we are doing with the weight.

glemaitre · 2021-10-20T13:49:10Z

We can postpone this PR until we have proper dispatching with sample props

glemaitre · 2023-12-07T15:37:56Z

Solved by using meta-data routing.

TST add common tests for meta-estimators

64a3651

glemaitre added this to the 1.0.1 milestone Sep 24, 2021

glemaitre marked this pull request as draft September 24, 2021 15:25

glemaitre added 2 commits September 24, 2021 17:36

iter

80cd0aa

iter

cce6f9a

glemaitre added 2 commits September 27, 2021 14:52

iter

4fb184c

DOC add more details for future behaviour

000c3f8

glemaitre changed the title ~~TST add common tests for meta-estimators~~ FIX CalibratedClassifierCV should not ignore sample_weight if estimator does not support it Sep 27, 2021

iter

54ae36c

glemaitre mentioned this pull request Sep 28, 2021

TST check equivalence sample_weight in CalibratedClassifierCV #21179

Merged

2 tasks

glemaitre modified the milestones: 1.0.1, 1.1 Oct 20, 2021

jeremiedbb modified the milestones: 1.1, 1.2 Apr 7, 2022

glemaitre modified the milestones: 1.2, 1.3 Nov 16, 2022

jeremiedbb modified the milestones: 1.3, 1.4 Jun 8, 2023

glemaitre closed this Dec 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

FIX `CalibratedClassifierCV` should not ignore `sample_weight` if estimator does not support it #21143

FIX `CalibratedClassifierCV` should not ignore `sample_weight` if estimator does not support it #21143

Uh oh!

glemaitre commented Sep 24, 2021

Uh oh!

glemaitre commented Sep 24, 2021

Uh oh!

lucyleeow commented Sep 25, 2021 •

edited

Loading

Uh oh!

glemaitre commented Sep 25, 2021

Uh oh!

ogrisel commented Sep 27, 2021 •

edited

Loading

Uh oh!

ogrisel commented Sep 28, 2021 •

edited

Loading

Uh oh!

glemaitre commented Sep 28, 2021

Uh oh!

glemaitre commented Oct 20, 2021

Uh oh!

glemaitre commented Dec 7, 2023

Uh oh!

Uh oh!

Uh oh!

FIX CalibratedClassifierCV should not ignore sample_weight if estimator does not support it #21143

FIX CalibratedClassifierCV should not ignore sample_weight if estimator does not support it #21143

Uh oh!

Conversation

glemaitre commented Sep 24, 2021

Uh oh!

glemaitre commented Sep 24, 2021

Uh oh!

lucyleeow commented Sep 25, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glemaitre commented Sep 25, 2021

Uh oh!

ogrisel commented Sep 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ogrisel commented Sep 28, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glemaitre commented Sep 28, 2021

Uh oh!

glemaitre commented Oct 20, 2021

Uh oh!

glemaitre commented Dec 7, 2023

Uh oh!

Uh oh!

FIX `CalibratedClassifierCV` should not ignore `sample_weight` if estimator does not support it #21143

FIX `CalibratedClassifierCV` should not ignore `sample_weight` if estimator does not support it #21143

lucyleeow commented Sep 25, 2021 •

edited

Loading

ogrisel commented Sep 27, 2021 •

edited

Loading

ogrisel commented Sep 28, 2021 •

edited

Loading