[WIP] rewrite precision_recall_fscore_support #1990

jnothman · 2013-05-22T13:29:20Z

precision_recall_fscore_support was getting gargantuan, because the similarities between the different input formats and metric variations weren't being exploited; rather, almost everything was special-cased. As a result, some bugs and inconsistencies crept in (admittedly on my watch as a reviewer).

This implementation:

is much smaller and less nested in code, and should be faster in some cases, taking advantage of LabelEncoder to use bincount (and building upon vectorized multilabel support in FIX helper to check multilabel types #1985, ENH support multilabel targets in LabelEncoder #1987).
deprecates pos_label and introduces neg_label, which allows micro-averaging to be interesting in the multiclass case (see ENH P/R/F should be able to ignore a majority class in the multiclass case #1983).
neg_label has not yet been tested.
to do so, will assume that multilabel indicator matrices are represented with <1 and 1 (or False and True)
has not yet implemented support for the labels argument, largely because I don't know what it means (see DOC clarify the use of label in P/R/F metric family #1989). It's not hard to implement, but I wish labels were deprecated in favour of stating a convention regarding label ordering in the average=None case.
~~has not yet fixed some broken tests for multilabel average='samples'~~
has not yet updated the documentation or signatures of precision_score, etc. derivative functions regarding pos/neg_label
currently assumes P and R go to 0 when their denominators are 0. I think this is incorrect behaviour, but it is backward-compatible (except with the average='samples' implementation; again, whoops): precision should be perfect when recalling nothing; recall should be perfect when there are no instances to retrieve. (And not realising that scikit-learn had adopted the 0 approach, I suggested in the documentation that 1 should be used.) The decision made elsewhere is to use 0 and warn. This is not yet implemented.

Fixes bugs related to simply checking type(..) equality, and refactors.

Rewrite no longer does shape checks

jnothman · 2013-05-22T14:18:30Z

The remaining test failures result from problems in the previous implementation, the currently unhandled labels parameter, or my stubborn refusal to implement the label indicator matrix's pos_label application.

neg_label is also yet to be tested.

jnothman · 2013-05-23T12:39:24Z

Rebased on #1988 to correct values in test cases.

and helper function for metrics

…understand my own wording.

Also, support 1d-array of sequences as a multilabel format

Also, deprecate pos_label in favour of neg_label

Also, fix error messages without spaces

jnothman · 2013-07-08T12:10:19Z

Status of this PR:

this PR is a bit too messy in its objectives
I have a version that makes no changes to the arguments
it is waiting on [MRG] FIX remaining bug in precision, recall and fscore with multilabel data #1988 to be merged before I rebase to adopt its tests and offer as a separate PR

jnothman added 6 commits May 22, 2013 16:37

FIX helper to check multilabel types

5a35592

Fixes bugs related to simply checking type(..) equality, and refactors.

Fix, document and test check_multilabel_type

1425594

Rename check_multilabel_type -> is_multilabel and rewrite

76a2d61

Rewrite no longer does shape checks

Remove old implementation(!)

852e8ba

Fix use of changed return value

0aae0b1

Test strings in multilabel

ed326f9

jnothman mentioned this pull request May 22, 2013

[MRG] FIX remaining bug in precision, recall and fscore with multilabel data #1988

Closed

jnothman and others added 21 commits May 25, 2013 23:46

Replace is_multilabel with type_of_targets

65abef9

and helper function for metrics

BUG: set random state in LogisticRegression

f5ff005

DOC clarification in Scoring objects: Its not a good sign if I don't …

4afaab6

…understand my own wording.

FIX bug in f_score with beta !=1

6da8719

FIX formula inversion for sample-based precision/recall

1ac31ce

FIX set same default behavior for precision, recall and f-score

bb42c17

ENH support multilabel targets in LabelEncoder

90cc715

Also, support 1d-array of sequences as a multilabel format

Simplify precision_recall_fscore_support

062871d

Also, deprecate pos_label in favour of neg_label

Avoid floats for counts

b31ac03

More robust template

3b80b37

TST Use assert_almost_equal in test_symmetry

2be242b

Fix unusual cases for tests

b44fcba

Ravel in checking arrays

3af9b23

Add comments

60959db

Support labels argument

ce85e20

COSMIT prefer partial over lambda in test_metrics

881ba77

TSTFIX use name, not metric, in test_metrics error messages

9e826d7

Also, fix error messages without spaces

Ensure non-positive label indicators are 0 before summing

d5c0460

Add todo for a warning

25000df

Use list comprehensions instead of vectorize

bab82ff

Warn on setting 0-denominator metrics to 0

859a9a7

jnothman added 3 commits May 26, 2013 13:01

start handling PRF warnings in tests

d9a3c7f

Reimplement bincount for multilabel-sequence

a904cf5

Rename pos_sum to pred_sum

d6dfaf7

jnothman mentioned this pull request May 29, 2013

[MRG] FIX corner cases with unique_labels #2015

Merged

jnothman mentioned this pull request Jun 8, 2013

FIX helper to check multilabel types #1985

Closed

erg mentioned this pull request Jun 24, 2013

f1_score and precision_recall_fscore_support throw an error for some class labels #2094

Closed

jnothman closed this Jul 27, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] rewrite precision_recall_fscore_support #1990

[WIP] rewrite precision_recall_fscore_support #1990

jnothman commented May 22, 2013

jnothman commented May 22, 2013

jnothman commented May 23, 2013

jnothman commented Jul 8, 2013

[WIP] rewrite precision_recall_fscore_support #1990

[WIP] rewrite precision_recall_fscore_support #1990

Conversation

jnothman commented May 22, 2013

jnothman commented May 22, 2013

jnothman commented May 23, 2013

jnothman commented Jul 8, 2013