[MRG] FIX Correct brier_score_loss when there's only one class in y_true #13628

qinhanmin2014 · 2019-04-12T14:38:20Z

closes #8459, closes #9300, closes #9301, closes #9562, closes #11245
Correct brier_score_loss when there's only one class in y_true.
This is an ugly solution, but will fix existing bugs, and avoid backward incompatibility.
Remove some redundant code in calibration_curve and brier_score_loss.

qinhanmin2014 · 2019-04-13T03:25:08Z

ping @jnothman

jnothman

Okay. We could also raise an error if it's all not 1 and pos_label is not specified. Maybe in the future?

Add what's new?

jnothman · 2019-04-15T01:12:36Z

sklearn/calibration.py


    if normalize:  # Normalize predicted values into interval [0, 1]
        y_prob = (y_prob - y_prob.min()) / (y_prob.max() - y_prob.min())
    elif y_prob.min() < 0 or y_prob.max() > 1:
        raise ValueError("y_prob has values outside [0, 1] and normalize is "
                         "set to False.")

-    y_true = _check_binary_probabilistic_predictions(y_true, y_prob)


why are we no longer validating y_prob?

y_prob is already validated above.

jnothman · 2019-04-15T01:15:04Z

sklearn/metrics/classification.py

    if pos_label is None:
-        pos_label = y_true.max()
+        if (np.array_equal(labels, [0, 1]) or


Share this code with _binary_clf_curve? And document the behaviour?

This seems difficult since the definition of pos_label=None is different?

qinhanmin2014 · 2019-04-15T02:44:18Z

We could also raise an error if it's all not 1 and pos_label is not specified. Maybe in the future?

Maybe in the future if someone complains? I guess it's not so important.

jnothman

Update the parameter documentation please

qinhanmin2014 · 2019-04-15T10:05:40Z

Update the parameter documentation please

docstring updated. IMO it's not so easy to describe current behavior.

jnothman · 2019-04-15T10:14:04Z

sklearn/metrics/classification.py

-        Label of the positive class. If None, the maximum label is used as
-        positive class
+        Label of the positive class.
+        When ``pos_label=None``, if y_true is in {-1, 1} or {0, 1},


Defaults to the greater label unless y_true is all 0 or all -1 in which case pos_label defaults to 1.

How about that?

qinhanmin2014 · 2019-04-15T10:23:31Z

How about that?

Much better, I also simplify the code.

qinhanmin2014 · 2019-04-17T16:05:42Z

ping @jnothman since the previous PR is tagged as 0.21 by you.

qinhanmin2014 · 2019-04-23T13:58:43Z

ping @jnothman since the previous PR is tagged as 0.21 by you :)

jnothman

This is as far as I got today!!

jnothman · 2019-04-23T08:50:13Z

sklearn/metrics/classification.py

+    labels = np.unique(y_true)
+    if len(labels) > 2:
+        raise ValueError("Only binary classification is supported. "
+                         "Provided labels %s." % labels)


Suggested change

"Provided labels %s." % labels)

"Labels in y_true: %s." % labels)

jnothman

Thanks @qinhanmin2014

glemaitre · 2019-04-26T08:48:30Z

doc/whats_new/v0.21.rst

@@ -439,6 +439,10 @@ Support for Python 3.4 and below has been officially dropped.
  and now it returns NaN and raises :class:`exceptions.UndefinedMetricWarning`.
  :issue:`12855` by :user:`Pawel Sendyk <psendyk>`.

+- |Fix| Fixed a bug where :func:`metrics.brier_score_loss` will sometimes
+  return incorrect result when there's only one class in ``y_true``.
+  :issue:`13628` by :user:`Hanmin Qin <qinhanmin2014>`.


Suggested change

:issue:`13628` by :user:`Hanmin Qin <qinhanmin2014>`.

:pr:`13628` by :user:`Hanmin Qin <qinhanmin2014>`.

glemaitre · 2019-04-26T09:38:07Z

Thanks @qinhanmin2014 I made the change in the what's new

…cikit-learn#13628)

…_true (scikit-learn#13628)" This reverts commit d01f763.

…cikit-learn#13628)

qinhanmin2014 added 2 commits April 12, 2019 22:33

FIX Correct brier_score_loss when there's only one class in y_true

7daee60

code cov

175b174

qinhanmin2014 changed the title ~~FIX Correct brier_score_loss when there's only one class in y_true~~ [MRG] FIX Correct brier_score_loss when there's only one class in y_true Apr 13, 2019

jnothman reviewed Apr 15, 2019

View reviewed changes

DOC what's new

9c01f3e

jnothman reviewed Apr 15, 2019

View reviewed changes

DOC parameter description

9f95d0e

jnothman reviewed Apr 15, 2019

View reviewed changes

Joel's suggestion

7e4c7a5

Merge branch 'master' into brier_score

45a3b5c

jnothman reviewed Apr 23, 2019

View reviewed changes

Joel's comment

7f9656f

jnothman approved these changes Apr 24, 2019

View reviewed changes

qinhanmin2014 added this to the 0.21 milestone Apr 24, 2019

glemaitre reviewed Apr 26, 2019

View reviewed changes

glemaitre self-requested a review April 26, 2019 09:15

glemaitre added 2 commits April 26, 2019 11:18

Merge remote-tracking branch 'origin/master' into pr/qinhanmin2014/13628

f536622

update whats new

4c2eda0

glemaitre merged commit 296ee0c into scikit-learn:master Apr 26, 2019

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

FIX Correct brier_score_loss when there's only one class in y_true (s…

d01f763

…cikit-learn#13628)

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

Revert "FIX Correct brier_score_loss when there's only one class in y…

bc753be

…_true (scikit-learn#13628)" This reverts commit d01f763.

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

Revert "FIX Correct brier_score_loss when there's only one class in y…

00fd5e0

…_true (scikit-learn#13628)" This reverts commit d01f763.

qinhanmin2014 deleted the brier_score branch April 29, 2019 01:21

marcelobeckmann pushed a commit to marcelobeckmann/scikit-learn that referenced this pull request May 1, 2019

FIX Correct brier_score_loss when there's only one class in y_true (s…

a3a3135

…cikit-learn#13628)

marcelobeckmann pushed a commit to marcelobeckmann/scikit-learn that referenced this pull request May 1, 2019

FIX Correct brier_score_loss when there's only one class in y_true (s…

c699f8d

…cikit-learn#13628)

koenvandevelde pushed a commit to koenvandevelde/scikit-learn that referenced this pull request Jul 12, 2019

FIX Correct brier_score_loss when there's only one class in y_true (s…

6b455b5

…cikit-learn#13628)

qinhanmin2014 mentioned this pull request Oct 28, 2019

plot_roc_curve doesn't correctly infer pos_label #15303

Closed

	"Provided labels %s." % labels)
	"Labels in y_true: %s." % labels)

	:issue:`13628` by :user:`Hanmin Qin <qinhanmin2014>`.
	:pr:`13628` by :user:`Hanmin Qin <qinhanmin2014>`.

Uh oh!

[MRG] FIX Correct brier_score_loss when there's only one class in y_true #13628

[MRG] FIX Correct brier_score_loss when there's only one class in y_true #13628

Uh oh!

Conversation

qinhanmin2014 commented Apr 12, 2019

Uh oh!

qinhanmin2014 commented Apr 13, 2019

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

jnothman Apr 15, 2019

Choose a reason for hiding this comment

Uh oh!

qinhanmin2014 Apr 15, 2019

Choose a reason for hiding this comment

Uh oh!

jnothman Apr 15, 2019

Choose a reason for hiding this comment

Uh oh!

qinhanmin2014 Apr 15, 2019

Choose a reason for hiding this comment

Uh oh!

qinhanmin2014 commented Apr 15, 2019

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

qinhanmin2014 commented Apr 15, 2019

Uh oh!

jnothman Apr 15, 2019

Choose a reason for hiding this comment

Uh oh!

qinhanmin2014 commented Apr 15, 2019

Uh oh!

qinhanmin2014 commented Apr 17, 2019

Uh oh!

qinhanmin2014 commented Apr 23, 2019

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

jnothman Apr 23, 2019

Choose a reason for hiding this comment

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

glemaitre Apr 26, 2019

Choose a reason for hiding this comment

Uh oh!

glemaitre commented Apr 26, 2019

Uh oh!

Uh oh!