[MRG + 1] Sort labels in precision_recall_fscore_support #4147

amueller · 2015-01-22T21:01:57Z

Fixes #3670.
The code already stores and restores the ordering, so we can discard it here.

amueller · 2015-01-22T21:06:27Z

Possibly of interest to @jnothman and @arjoly

amueller · 2015-01-22T21:08:54Z

I guess this is also fixed in #2610 but I'm not sure what the state of that is.... (actually not sure, but definitely related)

jnothman · 2015-01-22T22:28:40Z

I've not looked at the fix, but I think you should probably do so, even if I get #2610 up...

jnothman · 2015-01-23T00:21:08Z

sklearn/metrics/tests/test_classification.py

@@ -303,6 +303,18 @@ def test_precision_recall_f1_score_multiclass():
    assert_array_equal(s, [24, 20, 31])


+def test_precision_refcall_f1_score_multilabel_unordered_labels():


Might as well test this in the multiclass case and for different averaging strategies, too, no?

Sure, will add.

This could be made into an invariant tests.

I'll look into it.

If I use average="sample" then support is None. Is that expected?

I guess it is...

Yes, it is.

arjoly · 2015-01-23T07:28:39Z

Thanks @amueller

amueller · 2015-01-23T19:26:33Z

Was that a 👍 @arjoly ? ;)

arjoly · 2015-01-26T11:38:31Z

sklearn/metrics/classification.py

@@ -847,7 +847,7 @@ def precision_recall_fscore_support(y_true, y_pred, beta=1.0, labels=None,
    if labels is None:
        labels = unique_labels(y_true, y_pred)
    else:
-        labels = np.asarray(labels)
+        labels = np.sort(labels)


Is there a regression test for this?

Don't take this comment into account. :-)

arjoly · 2015-01-26T11:41:06Z

Besides Joel comment and my small addendum, +1

amueller · 2015-01-26T20:16:20Z

@arjoly I tried

import numpy as np

y_true = np.array([[1, 1, 0, 0], [1, 1, 0, 0]])
y_pred = np.array([[0, 0, 1, 1], [0, 1, 1, 0]])
labels = np.array([4, 1, 2, 3])

for name in set(MULTILABELS_METRICS).intersection(METRICS_WITH_LABELS):
    metric = ALL_METRICS[name]
    score_labels = metric(y_true, y_pred, labels=labels)
    score = metric(y_true, y_pred)
    assert_equal(score_labels, score)

but that already passes on master, I'm not sure why :-/

amueller · 2015-01-26T20:30:16Z

Shouldn't the same problem pop up in precision_score and so on? They use same code path, right?

amueller · 2015-01-26T21:11:37Z

Ok, never mind, forgot to add average=None. Added the test now.

coveralls · 2015-01-26T21:22:06Z

Coverage increased (+0.0%) to 94.78% when pulling bfea61c on amueller:precision_recall_unsorted_indices into 50b83ae on scikit-learn:master.

FIX Sort labels in precision_recall_fscore_support

arjoly · 2015-01-27T21:36:36Z

Great thanks @amueller !

amueller · 2015-01-27T21:59:30Z

Thanks @larsmans and @arjoly for the reviews :)

jnothman · 2015-03-01T04:43:51Z

sklearn/metrics/tests/test_common.py

+def test_no_averaging_labels():
+    # test labels argument when not using averaging
+    # in multi-class and multi-label cases
+    y_true_multilabel = np.array([[1, 1, 0, 0], [1, 1, 0, 0]])


Why are you referencing label 4 when the only labels available are 0, 1, 2, 3? It's a wonder that this works... Or rather, it's no surprise this breaks #4287...

amueller force-pushed the precision_recall_unsorted_indices branch from 4e5221b to b2ffd3e Compare January 22, 2015 21:23

jnothman reviewed Jan 23, 2015
View reviewed changes

arjoly reviewed Jan 26, 2015
View reviewed changes

amueller force-pushed the precision_recall_unsorted_indices branch 2 times, most recently from e2abb81 to bfea61c Compare January 26, 2015 21:11

amueller force-pushed the precision_recall_unsorted_indices branch from bfea61c to 3566167 Compare January 26, 2015 21:46

amueller changed the title ~~[MRG] Sort labels in precision_recall_fscore_support~~ [MRG + 1] Sort labels in precision_recall_fscore_support Jan 26, 2015

sort labels in precision_recall_fscore_support

fd9b03a

amueller force-pushed the precision_recall_unsorted_indices branch from 3566167 to fd9b03a Compare January 26, 2015 22:07

larsmans added a commit that referenced this pull request Jan 27, 2015

Merge pull request #4147 from amueller/precision_recall_unsorted_indices

92e1e39

FIX Sort labels in precision_recall_fscore_support

larsmans merged commit 92e1e39 into scikit-learn:master Jan 27, 2015

amueller deleted the precision_recall_unsorted_indices branch January 27, 2015 21:59

jnothman reviewed Mar 1, 2015
View reviewed changes

jnothman mentioned this pull request Mar 1, 2015

[MRG+2] ENH labels parameter in P/R/F may extend or reduce label set #4287

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG + 1] Sort labels in precision_recall_fscore_support #4147

[MRG + 1] Sort labels in precision_recall_fscore_support #4147

amueller commented Jan 22, 2015

amueller commented Jan 22, 2015

amueller commented Jan 22, 2015

jnothman commented Jan 22, 2015

jnothman Jan 23, 2015

amueller Jan 23, 2015

arjoly Jan 26, 2015

amueller Jan 26, 2015

amueller Jan 26, 2015

amueller Jan 26, 2015

jnothman Jan 26, 2015

arjoly commented Jan 23, 2015

amueller commented Jan 23, 2015

arjoly Jan 26, 2015

arjoly Jan 26, 2015

arjoly commented Jan 26, 2015

amueller commented Jan 26, 2015

amueller commented Jan 26, 2015

amueller commented Jan 26, 2015

coveralls commented Jan 26, 2015

arjoly commented Jan 27, 2015

amueller commented Jan 27, 2015

jnothman Mar 1, 2015

		@@ -303,6 +303,18 @@ def test_precision_recall_f1_score_multiclass():
		assert_array_equal(s, [24, 20, 31])


		def test_precision_refcall_f1_score_multilabel_unordered_labels():

[MRG + 1] Sort labels in precision_recall_fscore_support #4147

[MRG + 1] Sort labels in precision_recall_fscore_support #4147

Conversation

amueller commented Jan 22, 2015

amueller commented Jan 22, 2015

amueller commented Jan 22, 2015

jnothman commented Jan 22, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arjoly commented Jan 23, 2015

amueller commented Jan 23, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arjoly commented Jan 26, 2015

amueller commented Jan 26, 2015

amueller commented Jan 26, 2015

amueller commented Jan 26, 2015

coveralls commented Jan 26, 2015

arjoly commented Jan 27, 2015

amueller commented Jan 27, 2015

Choose a reason for hiding this comment