FEA add temperature scaling to `CalibratedClassifierCV` #31068

virchan · 2025-03-25T07:57:42Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This PR adds temperature scaling to scikit-learn's CalibratedClassifierCV:

Temperature scaling can be enabled by setting method = "temperature" in CalibratedClassifierCV:

from sklearn.datasets import make_classification
from sklearn.model_selection import train_test_split
from sklearn.calibration import CalibratedClassifierCV
from sklearn.svm import LinearSVC

X, y = make_classification(random_state=42)

X_train, X_calib, y_train, y_calib = train_test_split(X, y, random_state=42)

clf = LinearSVC(random_state=42)
clf.fit(X_train, y_train)
cal_clf = CalibratedClassifierCV(clf, method="temperature").fit(X_train, y_train)

This method supports both binary and multi-class classification.

Any other comments?

Cc @adrinjalali, @lorentzenchr in advance.

github-actions · 2025-03-25T07:58:58Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: b0584d6. Link to the linter CI: here}

virchan

A follow-up to my comment on the Array API: I don't think we can support the Array API here, as scipy.optimize.minimize does not appear to support it.

If I missed anything, please let me know—I'd be happy to investigate further.

sklearn/calibration.py

sklearn/tests/test_calibration.py

ogrisel

Thanks for the PR. Here is a first pass of feedback:

sklearn/calibration.py

sklearn/tests/test_calibration.py

sklearn/calibration.py

…enting_temperature_scaling

…fier`. Updated constructor of `_TemperatureScaling` class. Updated `test_temperature_scaling` in `test_calibration.py`. Added `__sklearn_tags__` to `_TemperatureScaling` class.

…enting_temperature_scaling

sklearn/calibration.py

…enting_temperature_scaling

…Updated doc-strings of temperature scaling in `calibration.py`. Updated formatting.

virchan

I'm still working on addressing the feedback, but I also wanted to share some findings related to it and provide an update.

sklearn/calibration.py

sklearn/tests/test_calibration.py

…enting_temperature_scaling

lorentzenchr

I few computational things seem off.

sklearn/calibration.py

…enting_temperature_scaling

Update `minimize` in `_temperture_scaling` to `minimize.scalar`. Update `test_calibration.py` to check the optimised inverse temperature is between 0.1 and 10.

virchan

There are some CI failures—I'll fix those shortly.

Also considering adding a verbose parameter to CalibratedClassifierCV to optionally display convergence info when optimising the inverse temperature beta.

sklearn/calibration.py

…enting_temperature_scaling

…id `method` parameter.

…enting_temperature_scaling

virchan

At this point, CI is complaining about missing test coverage for tags and for the errors raised in _fit_calibrator and _CalibratedClassifier.

I'm not quite sure how to handle the tags part, but I think the other two cases are fine, since the tests should cover warnings raised by the user-facing API, rather than by private functions.

I'll work on the convergence warning for minimize_scalar later.

…enting_temperature_scaling

… message.

…in the `_temperature_scaling` function.

virchan

Updates on Raising Warnings When scipy-optimize.minimize_scalar Doesn't Converge.

minimize_scalar(method = "bounded") raises the following warnings when optimisation is unsuccessful:

"Maximum number of function calls reached". Controled by the "maxiter" in the options dictionary.
"Nan result encountered".

(Reference: here)

For (1), the default value is maxiter=500. In my testing (over about a week of trial and error), setting maxiter=40 or lower may trigger non-convergence. However, reducing the number of iterations negatively affects the performance of the calibrator in otherwise converging cases. So I don't think it's worth lowering maxiter just to surface a warning in rare failure cases.

For (2), the docstring of HalfMultinomialLoss says:

        loss_i = log(sum(exp(raw_pred_{i, k}), k=0..n_classes-1))
                - sum(y_true_{i, k} * raw_pred_{i, k}, k=0..n_classes-1)

As far as I can tell, the only two ways loss_i could become NaN are:

if raw_pred contains np.inf, or is entirely -np.inf.
if raw_pred contains NaN.

But in either case, scikit-learn's check_array would catch this and raise an error before minimize_scalar even runs.

There's also the edge case where the minimiser lands exactly on a boundary ($\pm 10$), but due to the xtol tolerance in SciPy's optimizer, it ends up returning something like $\pm 9.9999999…$ and still counts as a successful termination.

So overall, I'm leaning towards our current implementation being fine in terms of convergence handling. That said, I might be missing some edge cases or subtle points.

Just to be safe, I've updated the options dictionary in _temperature_scaling to show a warning when convergence fails, and I've also tightened the absolute tolerance xatol for convergence.

Let me know if I've overlooked anything — I'll keep working on it!

Ball-Man · 2025-06-10T11:26:46Z

If I can add a minor comment, while testing it I noticed that the learned temperature is treated as a plain attribute (_TemperatureScaling.beta). As a result, even after fitting, the _TemperatureScaling instance is detected as not fitted by check_is_fitted. This is easily solved by treating temperature as a learned parameter instead, with the trailing underscore syntax (e.g. _TemperatureScaling.beta_). I guess this may be superfluous, since it's an internal estimator. Still, the sigmoid calibrator (as a counterexample) does behave appropriately, learning the parameters a_ and b_. Thanks for your contribution.

…enting_temperature_scaling

…attribute to ensure `check_is_fitted` runs correctly.

virchan

Thank you for the review, @Ball-Man!

I can confirm that scikit-learn's check_is_fitted function raises an error for temperature scaling, but not for sigmoid or isotonic calibration, after fitting on a calibration dataset.

I've updated the attribute as you suggested.

…sifier.predict_proba()` as `# pragma: no cover`.

…enting_temperature_scaling

virchan

I realised that # pragma: no cover can be used to raise errors without affecting CI coverage. So I've added it to both _CalibratedClassifier.predict_proba() and _fit_calibrator.

Specifically, if the method parameter is anything other than "sigmoid", "isotonic", or "temperature", the input validation in CalibratedClassifierCV will raise an error before any of these private functions are even called.

Similarly, I was able to raise an error when minimize_scalar fails to converge, without upsetting CI coverage in the absence of a dedicated test case. So this fix felt reasonable.

Let me know your thoughts, I'm happy to keep working on this!

sklearn/calibration.py

OmarManzoor

A few comments otherwise looks good. Thank you for the updates @virchan

sklearn/calibration.py

OmarManzoor · 2025-06-18T11:58:00Z

sklearn/calibration.py

        calibrators.append(calibrator)

+    else:  # pragma: no cover


If this kind of an error is raised in the methods of the class before calling the private functions then I don't think we need to raise this error here? Was it present previously?

This is suggested in #31068 (comment). I also believe it's safer to raise an error in this case.

We could possibly also add a test like test_fit_calibrator_function_raises and directly invoke this private function to test for the error.

Maybe we could ask @lorentzenchr to weigh in here again, in light of #31068 (comment)---whether it's worthwhile to also include tests for private functions.

I'm happy to follow whichever approach the team prefers.

sklearn/tests/test_calibration.py

…ce in all `ensemble` cases.

…gestions.

…ction.

…enting_temperature_scaling

virchan

CI passed!

virchan · 2025-06-18T23:51:38Z

sklearn/calibration.py

        calibrators.append(calibrator)

+    else:  # pragma: no cover


This is suggested in #31068 (comment). I also believe it's safer to raise an error in this case.

OmarManzoor

LGTM. Thank you @virchan

OmarManzoor · 2025-06-19T04:56:48Z

sklearn/calibration.py

        calibrators.append(calibrator)

+    else:  # pragma: no cover


We could possibly also add a test like test_fit_calibrator_function_raises and directly invoke this private function to test for the error.

FEA add temperature scaling to CalibratedClassifierCV

604e0da

added changelog

257fd03

virchan added the New Feature label Mar 25, 2025

virchan commented Mar 25, 2025

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/tests/test_calibration.py Outdated Show resolved Hide resolved

virchan added the module:calibration label Mar 25, 2025

updated docstring.

eb6dd4a

virchan marked this pull request as ready for review March 25, 2025 10:55

ogrisel reviewed Mar 25, 2025

View reviewed changes

virchan added 4 commits March 27, 2025 18:14

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

73d0335

…enting_temperature_scaling

Updated docstrings of CalibratedClassifierCV and `_CalibratedClassi…

6d6963f

…fier`. Updated constructor of `_TemperatureScaling` class. Updated `test_temperature_scaling` in `test_calibration.py`. Added `__sklearn_tags__` to `_TemperatureScaling` class.

Fix typo in _TemperatureScaling's fit method.

c8ffc1b

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

5a09af3

…enting_temperature_scaling

lorentzenchr reviewed Mar 31, 2025

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/calibration.py Show resolved Hide resolved

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/calibration.py Show resolved Hide resolved

sklearn/calibration.py Outdated Show resolved Hide resolved

virchan added 5 commits March 31, 2025 15:11

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

a1be098

…enting_temperature_scaling

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

d715e6e

…enting_temperature_scaling

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

4ce1452

…enting_temperature_scaling

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

b039a5a

…enting_temperature_scaling

Updated test cases for temperature scaling in test_calibration.py. …

93c7972

…Updated doc-strings of temperature scaling in `calibration.py`. Updated formatting.

virchan commented Apr 10, 2025

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/tests/test_calibration.py Outdated Show resolved Hide resolved

virchan added 2 commits April 14, 2025 15:52

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

7d50ea7

…enting_temperature_scaling

Fix failing test_float32_predict_proba.

dfcaa39

lorentzenchr reviewed Apr 15, 2025

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/calibration.py Outdated Show resolved Hide resolved

ogrisel reviewed Apr 15, 2025

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/calibration.py Outdated Show resolved Hide resolved

virchan added 2 commits April 25, 2025 22:16

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

24c1266

…enting_temperature_scaling

Update HalfMultinomialLoss docstring.

ad8dea5

Update `minimize` in `_temperture_scaling` to `minimize.scalar`. Update `test_calibration.py` to check the optimised inverse temperature is between 0.1 and 10.

virchan commented Apr 26, 2025

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

sklearn/calibration.py Outdated Show resolved Hide resolved

virchan added 3 commits April 30, 2025 19:36

Update _TemperatureScaling tags.

9f1626c

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

b4f1aad

…enting_temperature_scaling

Add test_calibration_method in test_calibration.py to check inval…

1a9e307

…id `method` parameter.

virchan added 2 commits May 26, 2025 23:38

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

10ec4d4

…enting_temperature_scaling

Fixed typos in the test_temperature_scaling function.

aa42de1

virchan commented May 27, 2025

View reviewed changes

virchan added 4 commits May 28, 2025 17:56

Fixed typos in the user guide calibration.rst

a27ed74

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

8ca9ef0

…enting_temperature_scaling

Update the _temperature_scaling function to display non-convergence…

fa76fab

… message.

Tighten absolute error in solution xopt acceptable for convergence …

01917e7

…in the `_temperature_scaling` function.

virchan commented Jun 3, 2025

View reviewed changes

virchan added 2 commits June 10, 2025 16:52

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

6610245

…enting_temperature_scaling

Replace the beta attribute in temperature scaling with the beta_ …

f0c7c8d

…attribute to ensure `check_is_fitted` runs correctly.

virchan commented Jun 11, 2025

View reviewed changes

virchan added 5 commits June 13, 2025 18:02

Mark the ValueError lines in _fit_calibrator and `_CalibratedClas…

e94ac24

…sifier.predict_proba()` as `# pragma: no cover`.

Raise RuntimeError when temperature scaling fails to optimize.

f5bf28b

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

130e297

…enting_temperature_scaling

Update test_temperature_scaling to calibrate SVC.

e45ce24

Update test_temperature_scaling to fix CI failure.

a09bafd

virchan commented Jun 14, 2025

View reviewed changes

sklearn/calibration.py Show resolved Hide resolved

OmarManzoor reviewed Jun 18, 2025

View reviewed changes

sklearn/tests/test_calibration.py Outdated Show resolved Hide resolved

virchan added 8 commits June 18, 2025 10:39

update test_temperature_scaling function to check accuracy invarian…

7f39877

…ce in all `ensemble` cases.

update CalibratedClassifierCV docstrings by applying reviewer's sug…

b08a228

…gestions.

add check_is_fitted to the test_temperature_scaling function.

060f240

add get_tags to the test_temperature_scaling_input_validation fun…

283f4a4

…ction.

Merge remote-tracking branch 'upstream/main' into issues/28574_implem…

cbc0af6

…enting_temperature_scaling

update the test_temperature_scaling function to fix CI failure.

c0caeb6

update test_calibration.py to trigger CI re-run.

3cc0161

update test_calibration.py to trigger CI re-run.

b0584d6

virchan commented Jun 18, 2025

View reviewed changes

OmarManzoor approved these changes Jun 19, 2025

View reviewed changes

Uh oh!

FEA add temperature scaling to CalibratedClassifierCV #31068

Are you sure you want to change the base?

FEA add temperature scaling to CalibratedClassifierCV #31068

Uh oh!

Conversation

virchan commented Mar 25, 2025

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Mar 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

virchan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

virchan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lorentzenchr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

virchan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

virchan left a comment

Choose a reason for hiding this comment

Uh oh!

virchan left a comment

Choose a reason for hiding this comment

Uh oh!

Ball-Man commented Jun 10, 2025

Uh oh!

virchan left a comment

Choose a reason for hiding this comment

Uh oh!

virchan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

OmarManzoor left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

OmarManzoor Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

virchan Jun 18, 2025

FEA add temperature scaling to `CalibratedClassifierCV` #31068

FEA add temperature scaling to `CalibratedClassifierCV` #31068

github-actions bot commented Mar 25, 2025 •

edited

Loading