[MRG] Adds plot_precision_recall_curve #14936

thomasjpfan · 2019-09-09T14:31:55Z

Reference Issues/PRs

Related to #7116

What does this implement/fix? Explain your changes.

This PR adds plot_precision_recall_curve.

Any other comments?

Only supports binary classifiers.

…call

sklearn/metrics/_plot/precision_recall.py

glemaitre

Looks good, only a couple of changes.

examples/model_selection/plot_precision_recall.py

sklearn/metrics/_plot/precision_recall.py

sklearn/metrics/_plot/tests/test_plot_precision_recall.py

sklearn/metrics/_plot/precision_recall.py

sklearn/metrics/_plot/tests/test_plot_precision_recall.py

glemaitre · 2019-09-20T08:41:30Z

sklearn/metrics/_plot/precision_recall.py

+
+    Parameters
+    -----------
+    precision : ndarray of shape (n_thresholds + 1, )


Suggested change

precision : ndarray of shape (n_thresholds + 1, )

precision : ndarray of shape (n_thresholds + 1,)

glemaitre

You can also add an entry in what's new

sklearn/metrics/_plot/precision_recall.py

glemaitre · 2019-09-20T08:44:20Z

sklearn/metrics/_plot/precision_recall.py

+
+    if y_pred.ndim != 1:
+        if y_pred.shape[1] > 2:
+            raise ValueError("Estimator should solve a "


isn't it possible to use check_classification_targets?

amueller · 2019-09-20T18:57:11Z

conflicts ;)

…call

amueller · 2019-09-25T17:05:37Z

sklearn/metrics/_plot/precision_recall.py

+
+    y_pred = prediction_method(X)
+
+    if is_predict_proba and y_pred.ndim != 1:


if is_predict_proba y_pred.ndim is never 1, right?

amueller · 2019-09-25T17:09:20Z

sklearn/metrics/_plot/tests/test_plot_precision_recall.py

+        plot_precision_recall_curve(clf, X, y)
+
+    msg = "Estimator should solve a binary classification problem"
+    y_binary = y == 1


I don't understand why this raises this error, both semantically and why that is what the code does. I thought the code checked y_pred, which we're not changing here, right?

amueller · 2019-09-25T17:10:38Z

looks good apart from nitpicks

glemaitre · 2019-10-02T12:08:45Z

The error raised does not match.

…call

thomasjpfan · 2019-10-25T17:37:23Z

CC @NicolasHug

qinhanmin2014 · 2019-10-28T14:17:47Z

I agree that this is a blocker, but we need to figure out a solution for #15303

NicolasHug · 2019-11-06T16:03:18Z

The user guide link of plot_precision_recall_curve is wrong: there's no point to link to the vizualization API UG. Also some of the links are broken

@thomasjpfan

amueller · 2019-11-06T20:35:23Z

see #15405 (comment)

The easy fix is removing it and inferring it from the estimator. The better fix is to actually ensure to correctly slice predict_proba / decision_function

…call

thomasjpfan · 2019-11-06T21:05:22Z

Went with removing pos_label and infering it from the estimator.

amueller · 2019-11-06T22:42:26Z

Then we should do the same for plot_roc_curve and open an issue to do the fix for the next release?

qinhanmin2014

Perhaps it's better to keep consistent with plot_roc_curve/RocCurveDisplay (API and code)

qinhanmin2014 · 2019-11-07T05:58:03Z

doc/visualizations.rst

@@ -71,6 +71,7 @@ Functions

 .. autosummary::

+   metrics.plot_precision_recall_curve


alphabetic order?

qinhanmin2014 · 2019-11-07T05:58:09Z

doc/visualizations.rst

@@ -82,5 +83,6 @@ Display Objects

 .. autosummary::

+   metrics.PrecisionRecallDisplay


alphabetic order?

qinhanmin2014 · 2019-11-07T06:03:42Z

sklearn/metrics/__init__.py

@@ -79,6 +79,8 @@

 from ._plot.roc_curve import plot_roc_curve
 from ._plot.roc_curve import RocCurveDisplay
+from ._plot.precision_recall import plot_precision_recall_curve


rename the file to precision_recall_curve.py?

qinhanmin2014 · 2019-11-07T06:06:37Z

sklearn/metrics/_plot/precision_recall.py

+            Axes object to plot on. If `None`, a new figure and axes is
+            created.
+
+        label_name : str, default=None


why is it different from RocCurveDisplay?

qinhanmin2014 · 2019-11-07T06:07:48Z

sklearn/metrics/_plot/precision_recall.py

+
+    Parameters
+    -----------
+    precision : ndarray of shape (n_thresholds + 1,)


n_thresholds is not defined in this context.

qinhanmin2014 · 2019-11-07T06:10:14Z

sklearn/metrics/_plot/precision_recall.py

+        line_kwargs.update(**kwargs)
+
+        self.line_, = ax.plot(self.recall, self.precision, **line_kwargs)
+        ax.set(xlabel="Recall", ylabel="Precision", ylim=[0.0, 1.05],


rely on default xlim/ylim?

For the, x, going without the I think for the y it is kind of important because of the scaling:

With ylim explicit set:

Not set:

qinhanmin2014 · 2019-11-07T06:12:06Z

sklearn/metrics/_plot/precision_recall.py

+    precision : ndarray of shape (n_thresholds + 1,)
+        Precision values.
+
+    recall : ndarray of shape (n_thresholds + 1,)


n_thresholds is not defined in this context.

qinhanmin2014 · 2019-11-07T06:13:00Z

sklearn/metrics/_plot/precision_recall.py

+        :term:`predict_proba` is tried first and if it does not exist
+        :term:`decision_function` is tried next.
+
+    label_name : str, default=None


not consistent with plot_roc_curve

Changed this to name to be consistent with plot_roc_curve.

NicolasHug

Looks good but need to link to UG with small updates

NicolasHug · 2019-11-07T17:36:03Z

sklearn/metrics/_plot/precision_recall.py

+
+    It is recommend to use :func:`~sklearn.metrics.plot_precision_recall_curve`
+    to create a visualizer. All parameters are stored as attributes.
+


Add link to Visualization UG

NicolasHug · 2019-11-07T17:36:46Z

sklearn/metrics/_plot/precision_recall.py

+    """Plot Precision Recall Curve for binary classifers.
+
+    Extra keyword arguments will be passed to matplotlib's `plot`.
+


Add link to https://82470-843222-gh.circle-artifacts.com/0/doc/modules/model_evaluation.html#precision-recall-f-measure-metrics

and link include plot_precision_recall_curve in the UG there

NicolasHug

Not sure why tests are failing but LGTM

qinhanmin2014

I think tests are failing because we no longer set xlim and ylim manually but we don't update the test.

I feel a little uncomfortable that plot_roc_curve and plot_precision_recall_curve are written in different way, e.g., we introduce is_predict_proba in plot_precision_recall_curve, but do not introduce it in plot_roc_auc_score. If we keep these two functions consistent, it will be much easier to maintain, but prehaps it's not so important.

…call

thomasjpfan · 2019-11-08T19:11:41Z

If we keep these two functions consistent, it will be much easier to maintain, but prehaps it's not so important.

I refactored the response method checking into a _check_classifer_response_method that can be used by plot_roc_auc_curve. We can have a follow up PR to have plot_roc_auc_curve use it as well, to keep the error messages and code consistent.

qinhanmin2014

Should we rename the files to _precision_recall_curve.py and _roc_curve.py

qinhanmin2014 · 2019-11-09T07:22:34Z

sklearn/metrics/_plot/__init__.py

@@ -0,0 +1,40 @@
+def _check_classifer_response_method(estimator, response_method):


Is it good to include things in init.py? Perhaps base.py?
Let's update plot_roc_curve in this PR?

thomasjpfan · 2019-11-10T04:45:14Z

Should we rename the files to _precision_recall_curve.py and _roc_curve.py

Since they are both in _plot, either way works for me.

Let's update plot_roc_curve in this PR?

Done

qinhanmin2014 · 2019-11-10T15:03:32Z

sklearn/metrics/_plot/roc_curve.py

@@ -180,18 +181,8 @@ def plot_roc_curve(estimator, X, y, sample_weight=None,
    else:
        raise ValueError(classification_error)

-    if response_method != "auto":


also need to remove following things above

if response_method not in ("predict_proba", "decision_function", "auto"): raise ValueError("response_method must be 'predict_proba', " "'decision_function' or 'auto'")

thomasjpfan added 9 commits August 21, 2019 15:55

WIP

720f9ac

DOC Uses plot_precision_recall in example

8ac4469

DOC Adds to userguide

0b81383

DOC style

e9c8131

Merge remote-tracking branch 'upstream/master' into plot_precision_re…

241845f

…call

DOC Better docs

3d86867

Merge remote-tracking branch 'upstream/master' into plot_precision_re…

c7029a6

…call

Merge remote-tracking branch 'upstream/master' into plot_precision_re…

9bf152b

…call

CLN

66deaac

amueller reviewed Sep 10, 2019

View reviewed changes

sklearn/metrics/_plot/precision_recall.py Outdated Show resolved Hide resolved

amueller reviewed Sep 10, 2019

View reviewed changes

sklearn/metrics/_plot/precision_recall.py Outdated Show resolved Hide resolved

amueller reviewed Sep 10, 2019

View reviewed changes

sklearn/metrics/_plot/precision_recall.py Outdated Show resolved Hide resolved

amueller reviewed Sep 10, 2019

View reviewed changes

sklearn/metrics/_plot/precision_recall.py Outdated Show resolved Hide resolved

glemaitre requested changes Sep 13, 2019

View reviewed changes

CLN Address @glemaitre comments

10dc97e

glemaitre reviewed Sep 20, 2019

View reviewed changes

CLN Address @glemaitre comments

affec16

thomasjpfan added 4 commits September 20, 2019 15:07

Merge remote-tracking branch 'upstream/master' into plot_precision_re…

99b18ed

…call

DOC Remove whatsnew

1796020

Merge remote-tracking branch 'upstream/master' into plot_precision_re…

ee62183

…call

DOC Style

d7d448f

amueller reviewed Sep 25, 2019

View reviewed changes

thomasjpfan added 2 commits September 25, 2019 15:35

CLN Addresses @amuller comments

294c29a

CLN Addresses @amuller comments

dbe9a3a

Merge remote-tracking branch 'upstream/master' into plot_precision_re…

3fade4b

…call

qinhanmin2014 added the Blocker label Oct 28, 2019

NicolasHug mentioned this pull request Nov 6, 2019

[MRG] ENH Adds plot_confusion matrix #15083

Merged

thomasjpfan added 2 commits November 6, 2019 15:53

Merge remote-tracking branch 'upstream/master' into plot_precision_re…

7cbb1ac

…call

BUG Quick fix

abbbb9d

BUG Fix test

a589f92

ENH Better error message

c9b3d60

qinhanmin2014 reviewed Nov 7, 2019

View reviewed changes

NicolasHug reviewed Nov 7, 2019

View reviewed changes

CLN Address comments

bfd5634

NicolasHug approved these changes Nov 8, 2019

View reviewed changes

NicolasHug mentioned this pull request Nov 8, 2019

[MRG] DOC mention other plotting utilities in highlights #15569

Merged

qinhanmin2014 approved these changes Nov 8, 2019

View reviewed changes

thomasjpfan added 2 commits November 8, 2019 13:56

Merge remote-tracking branch 'upstream/master' into plot_precision_re…

dbae9f8

…call

CLN Address comments

2c3d78d

qinhanmin2014 approved these changes Nov 9, 2019

View reviewed changes

thomasjpfan added 2 commits November 9, 2019 20:43

CLN Move to base

7736f77

CLN Unify response detection

91f0d05

qinhanmin2014 reviewed Nov 10, 2019

View reviewed changes

CLN Removes unneeded check

a559342

qinhanmin2014 approved these changes Nov 11, 2019

View reviewed changes

qinhanmin2014 merged commit 968252d into scikit-learn:master Nov 11, 2019

panpiort8 pushed a commit to panpiort8/scikit-learn that referenced this pull request Mar 3, 2020

FEA Adds plot_precision_recall_curve (scikit-learn#14936)

6367ff2

	precision : ndarray of shape (n_thresholds + 1, )
	precision : ndarray of shape (n_thresholds + 1,)


		y_pred = prediction_method(X)

		if is_predict_proba and y_pred.ndim != 1:

		@@ -71,6 +71,7 @@ Functions

		.. autosummary::

		metrics.plot_precision_recall_curve

		@@ -82,5 +83,6 @@ Display Objects

		.. autosummary::

		metrics.PrecisionRecallDisplay


		It is recommend to use :func:`~sklearn.metrics.plot_precision_recall_curve`
		to create a visualizer. All parameters are stored as attributes.

		"""Plot Precision Recall Curve for binary classifers.

		Extra keyword arguments will be passed to matplotlib's `plot`.

		@@ -0,0 +1,40 @@
		def _check_classifer_response_method(estimator, response_method):

[MRG] Adds plot_precision_recall_curve #14936

[MRG] Adds plot_precision_recall_curve #14936

Conversation

thomasjpfan commented Sep 9, 2019

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

glemaitre left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

glemaitre left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amueller commented Sep 20, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amueller commented Sep 25, 2019

glemaitre commented Oct 2, 2019

thomasjpfan commented Oct 25, 2019

qinhanmin2014 commented Oct 28, 2019

NicolasHug commented Nov 6, 2019

amueller commented Nov 6, 2019

thomasjpfan commented Nov 6, 2019

amueller commented Nov 6, 2019

qinhanmin2014 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

NicolasHug left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

NicolasHug left a comment

Choose a reason for hiding this comment

qinhanmin2014 left a comment

Choose a reason for hiding this comment

thomasjpfan commented Nov 8, 2019

qinhanmin2014 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thomasjpfan commented Nov 10, 2019

Choose a reason for hiding this comment