FIX Add error when `LeaveOneOut` used in `CalibratedClassifierCV` #29545

lucyleeow · 2024-07-23T06:49:43Z

Reference Issues/PRs

closes #29000

What does this implement/fix? Explain your changes.

Add error when LeaveOneOut used in CalibratedClassifierCV
- used new error message as 'Requesting x-fold..' does not make sense for LOO
Updated 2 tests that used LeaveOneOut
- these tests were originally added to address Leave one out cross validation with CalibratedClassifierCV and LinearSVC #7796

Any other comments?

github-actions · 2024-07-23T06:51:02Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 6321854. Link to the linter CI: here}

lucyleeow · 2024-07-24T05:53:00Z

sklearn/tests/test_calibration.py

-            assert np.all(proba[:, :i] > 0)
-            assert np.all(proba[:, i + 1 :] > 0)
-        else:
-            # Check `proba` are all 1/n_classes


Note proba was 1/n_classes here because the original test data was unique (consisted of 10 samples belonging to 10 classes) and this was not really related to train subset not containing all classes.
I think the estimator ended up being overfit and the calibrator did not respond well to low predict values, calibrating them all to the same value. Note proba was the same value even before normalization of the probabilities

I wonder if we even have to check ensemble=False as we use cross_val_predict to get the predictions to use to calibrate and at predict, only one estimator is fit using all the data.

doc/whats_new/v1.5.rst

glemaitre · 2024-07-25T08:51:36Z

sklearn/calibration.py

+            if isinstance(self.cv, LeaveOneOut):
+                raise ValueError(
+                    "LeaveOneOut cross-validation does not allow"
+                    "all classes to be present in test splits."


Let's be explicit by asking people to use an alternative cross-validation strategies.

Amended. Hopefully not too long now?

Nop this is good.

sklearn/tests/test_calibration.py

glemaitre · 2024-07-25T08:57:16Z

sklearn/tests/test_calibration.py

@@ -441,27 +456,30 @@ def test_calibration_prob_sum(ensemble):
 def test_calibration_less_classes(ensemble):
    # Test to check calibration works fine when train set in a test-train


This is a use case where I'm really wondering if this is valid :). But this is here so let's go with it.

glemaitre

Regarding the scope of the PR, I'm happy to already have this in the codebase. I open a related PR regarding the cross-validation and the fact that we can get a subset of classes in training. I'm not sure that this is something that we should allow but it requires much more discussions and I might overlook some aspects.

glemaitre · 2024-08-02T08:58:46Z

I'll add this PR in the milestone for 1.5.2

thomasjpfan

LGTM

…ikit-learn#29545)

…9545)

add loo error

0c8a077

lucyleeow changed the title ~~add loo error~~ FIX Add error when LeaveOneOut used in CalibratedClassifierCV Jul 23, 2024

lucyleeow added 4 commits July 23, 2024 16:51

cruft

fd5fbfb

format

2a358c5

lint

c1a973b

Merge branch 'main' into calbclass_cv

50ac87d

lucyleeow commented Jul 24, 2024

View reviewed changes

lucyleeow added 5 commits July 24, 2024 15:53

fix param

929ddc3

add whats new

b839b78

lint

7ceec34

Merge branch 'main' into calbclass_cv

3c55bac

typo

b91cd93

lucyleeow mentioned this pull request Jul 24, 2024

BUG Problem when CalibratedClassifierCV train contains 2 classes but data contains more #29551

Open

glemaitre self-requested a review July 24, 2024 08:22

glemaitre added this to the 1.5.2 milestone Jul 25, 2024

glemaitre approved these changes Jul 25, 2024

View reviewed changes

review

6321854

thomasjpfan approved these changes Aug 10, 2024

View reviewed changes

thomasjpfan added the To backport PR merged in master that need a backport to a release branch defined based on the milestone. label Aug 10, 2024

thomasjpfan enabled auto-merge (squash) August 10, 2024 13:50

thomasjpfan merged commit 83da530 into scikit-learn:main Aug 10, 2024
33 checks passed

lucyleeow deleted the calbclass_cv branch August 10, 2024 21:50

MarcBresson pushed a commit to MarcBresson/scikit-learn that referenced this pull request Sep 2, 2024

FIX Add error when LeaveOneOut used in CalibratedClassifierCV (sc…

e715466

…ikit-learn#29545)

glemaitre pushed a commit to glemaitre/scikit-learn that referenced this pull request Sep 9, 2024

FIX Add error when LeaveOneOut used in CalibratedClassifierCV (sc…

5c2ad0c

…ikit-learn#29545)

glemaitre pushed a commit that referenced this pull request Sep 11, 2024

FIX Add error when LeaveOneOut used in CalibratedClassifierCV (#2…

6dd63fd

…9545)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

FIX Add error when `LeaveOneOut` used in `CalibratedClassifierCV` #29545

FIX Add error when `LeaveOneOut` used in `CalibratedClassifierCV` #29545

Uh oh!

lucyleeow commented Jul 23, 2024

Uh oh!

github-actions bot commented Jul 23, 2024 •

edited

Loading

Uh oh!

lucyleeow Jul 24, 2024 •

edited

Loading

Uh oh!

lucyleeow Jul 24, 2024

Uh oh!

Uh oh!

glemaitre Jul 25, 2024

Uh oh!

lucyleeow Jul 26, 2024

Uh oh!

glemaitre Aug 2, 2024

Uh oh!

Uh oh!

glemaitre Jul 25, 2024

Uh oh!

glemaitre left a comment

Uh oh!

glemaitre commented Aug 2, 2024 •

edited

Loading

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

Uh oh!

		@@ -441,27 +456,30 @@ def test_calibration_prob_sum(ensemble):
		def test_calibration_less_classes(ensemble):
		# Test to check calibration works fine when train set in a test-train

Uh oh!

FIX Add error when LeaveOneOut used in CalibratedClassifierCV #29545

FIX Add error when LeaveOneOut used in CalibratedClassifierCV #29545

Uh oh!

Conversation

lucyleeow commented Jul 23, 2024

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Jul 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

lucyleeow Jul 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lucyleeow Jul 24, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

glemaitre Jul 25, 2024

Choose a reason for hiding this comment

Uh oh!

lucyleeow Jul 26, 2024

Choose a reason for hiding this comment

Uh oh!

glemaitre Aug 2, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

glemaitre Jul 25, 2024

Choose a reason for hiding this comment

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

glemaitre commented Aug 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

FIX Add error when `LeaveOneOut` used in `CalibratedClassifierCV` #29545

FIX Add error when `LeaveOneOut` used in `CalibratedClassifierCV` #29545

github-actions bot commented Jul 23, 2024 •

edited

Loading

lucyleeow Jul 24, 2024 •

edited

Loading

glemaitre commented Aug 2, 2024 •

edited

Loading