Fix RuntimeWarning: invalid value encountered in test_calibration.py #19421

t-kusanagi · 2021-02-10T05:17:43Z

Reference Issues/PRs

#19334

What does this implement/fix? Explain your changes.

The runtime error was cause by _CalibratedClassifier.predict_proba function.
The error occurs when np.sum(proba, axis=1)[:, np.newaxis] has some zero elements.
So, I use np.divide to avoid this error. Without out parameter, we will get the warning from assert_allclose(...) in test_calibration_multiclass.

Note:

Things I've tried that don't work.

Use with np.errstate(divide='ignore')
- With pytest, still get warnig.
np.divide(proba, denominator, where=denominator != 0.0) (without out parameter)
- The warning about zero division is resolved, but another warning will be raised from assert_allclose(...) in test_calibration_multiclass.
  - proba[np.isnan(proba)] = 1. / n_classes is not enough to this case. Some of values seems not to become NaN, so sum of proba cannnot be 1.0

jeremiedbb · 2021-02-10T09:10:47Z

The reason why np.divide(proba, denominator, where=denominator != 0.0) did not work is because of the next line in the code:

# XXX : for some reason all probas can be 0
proba[np.isnan(proba)] = 1. / n_classes

Since now you don't divide by 0, you don't get inf and the probas are not normalized.

sklearn/calibration.py

Co-authored-by: Jérémie du Boisberranger <34657725+jeremiedbb@users.noreply.github.com>

ogrisel · 2021-02-10T09:30:06Z

Interesting, I did not know about the where parameter of np.divide. I agree with @jeremiedbb that we can collapse the code to remove any np.nan occurrence from the start.

If this is not already the case, we could probably add a specific test to "calibrate" such a badly behaving classifier that always predicts zeros in predict_probas and check that:

we get uniform 1/n_classes probabilities as a results;
we do not get a waning using sklearn.utils._testing.assert_no_warnings.

t-kusanagi · 2021-02-10T09:31:26Z

The reason why np.divide(proba, denominator, where=denominator != 0.0) did not work is because of the next line in the code:
# XXX : for some reason all probas can be 0
proba[np.isnan(proba)] = 1. / n_classes
Since now you don't divide by 0, you don't get inf and the probas are not normalized.

Thanks, I overlooked the fact that this process is only for division by zero.

sklearn/calibration.py

jeremiedbb · 2021-02-10T09:36:33Z

we do not get a waning using sklearn.utils._testing.assert_no_warnings.

@ogrisel I don't think we want to check that no warning is issued. We don't want to check that every line of code doesn't raise a warning. I think that the fact that the warning disappears with this PR is enough.

ogrisel · 2021-02-10T09:52:05Z

I think that the fact that the warning disappears with this PR is enough.

Alright, but don't we want to explicitly check that we get uniform probabilities when the based classifier predicts only zeros.

jeremiedbb · 2021-02-10T09:53:06Z

Alright, but don't we want to explicitly check that we get uniform probabilities when the base classifier predicts only zeros.

I didn't argue against that :)

t-kusanagi · 2021-02-10T09:58:07Z

I didn't argue against that :)

OK, now I'm going to add this test.

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

…ct_proba

t-kusanagi · 2021-02-10T13:46:09Z

OK, now I'm going to add this test.

Since it is difficult to control the behavior of the CalibratedClassifierCV class and the test code becomes large, I added test_calibration_zero_probability, which is just a test for _CalibratedClassifier.predict_proba.

ogrisel

Thanks for the new test. Here are some suggestions to improve it a bit. Otherwise LGTM.

sklearn/tests/test_calibration.py

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

sklearn/tests/test_calibration.py

ogrisel

LGTM once @jeremiedbb's remaining comments have been addressed.

Co-authored-by: Jérémie du Boisberranger <34657725+jeremiedbb@users.noreply.github.com>

sklearn/calibration.py

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

jeremiedbb

lgtm. Thanks @t-kusanagi

ogrisel · 2021-02-11T09:37:04Z

Thank you very much for the fix and the nice new test @t-kusanagi.

ogrisel · 2021-02-11T09:37:38Z

ping @lucyleeow to keep you in the loop on your favorite meta-estimator :)

Fix _CalibratedClassifier.predict_proba function

ef8fe52

jeremiedbb reviewed Feb 10, 2021

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

Remove zero case of proba

16a452c

Co-authored-by: Jérémie du Boisberranger <34657725+jeremiedbb@users.noreply.github.com>

ogrisel reviewed Feb 10, 2021

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

t-kusanagi and others added 3 commits February 10, 2021 19:03

Make it more readable

4b0bc51

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Add test_calibration_zero_probability for _CalibratedClassifier.predi…

f55eabd

…ct_proba

Remove debug code

2087b46

ogrisel reviewed Feb 10, 2021

View reviewed changes

t-kusanagi and others added 4 commits February 10, 2021 23:00

Rename FakeCalibrator to ZeroCalibrator

f06cc19

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Fix comment

dc10ab6

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Decrease n_samples and n_features

f9c455d

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Fix rename oversight

d9a96de

jeremiedbb reviewed Feb 10, 2021

View reviewed changes

sklearn/tests/test_calibration.py Outdated Show resolved Hide resolved

sklearn/tests/test_calibration.py Outdated Show resolved Hide resolved

sklearn/tests/test_calibration.py Show resolved Hide resolved

sklearn/tests/test_calibration.py Outdated Show resolved Hide resolved

t-kusanagi added 4 commits February 10, 2021 23:48

Use DummyClassifier instead of FakeClassifier

e02555d

Add comment

f0f2ede

Clarify tha shape information

0c56039

Remove redundant parts

46c22b2

jeremiedbb reviewed Feb 10, 2021

View reviewed changes

ogrisel approved these changes Feb 10, 2021

View reviewed changes

t-kusanagi and others added 3 commits February 11, 2021 10:13

Fix comment

61699ed

Co-authored-by: Jérémie du Boisberranger <34657725+jeremiedbb@users.noreply.github.com>

Fix comment

ef74324

Co-authored-by: Jérémie du Boisberranger <34657725+jeremiedbb@users.noreply.github.com>

Fix comment

86f4423

Co-authored-by: Jérémie du Boisberranger <34657725+jeremiedbb@users.noreply.github.com>

t-kusanagi and others added 3 commits February 11, 2021 10:15

Remove n_classes

1d54142

Co-authored-by: Jérémie du Boisberranger <34657725+jeremiedbb@users.noreply.github.com>

Use clf.classes_

49afbc2

Co-authored-by: Jérémie du Boisberranger <34657725+jeremiedbb@users.noreply.github.com>

Use clf.n_classes_ instead of n_classes

b4c9e36

ogrisel reviewed Feb 11, 2021

View reviewed changes

sklearn/calibration.py Outdated Show resolved Hide resolved

Fix comment

48a8172

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

jeremiedbb approved these changes Feb 11, 2021

View reviewed changes

jeremiedbb merged commit 6489daf into scikit-learn:main Feb 11, 2021

t-kusanagi deleted the fix_test_calibration branch February 11, 2021 09:24

glemaitre mentioned this pull request Apr 22, 2021

Release 0.24.2 #19954

Merged

12 tasks

Uh oh!

Fix RuntimeWarning: invalid value encountered in test_calibration.py #19421

Fix RuntimeWarning: invalid value encountered in test_calibration.py #19421

Uh oh!

Conversation

t-kusanagi commented Feb 10, 2021

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Uh oh!

jeremiedbb commented Feb 10, 2021

Uh oh!

Uh oh!

ogrisel commented Feb 10, 2021

Uh oh!

t-kusanagi commented Feb 10, 2021

Uh oh!

Uh oh!

jeremiedbb commented Feb 10, 2021

Uh oh!

ogrisel commented Feb 10, 2021

Uh oh!

jeremiedbb commented Feb 10, 2021 • edited by ogrisel Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

t-kusanagi commented Feb 10, 2021

Uh oh!

t-kusanagi commented Feb 10, 2021

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jeremiedbb left a comment

Choose a reason for hiding this comment

Uh oh!

ogrisel commented Feb 11, 2021

Uh oh!

ogrisel commented Feb 11, 2021

Uh oh!

Uh oh!

jeremiedbb commented Feb 10, 2021 •

edited by ogrisel

Loading