labels argument of classification_report is not useful when y is a list of strings

In scikit-learn 0.14.1 it was possible to have `y_true` and `y_pred` lists of strings, and pass a list of strings as a `labels` argument to `classification_report`, and it worked as expected: only labels from this list were included to the report. This no longer works in scikit-learn master. 

It was never documented that it should work: docs say that `labels` is an "Optional list of label indices to include in the report." So, according to docs, it was undefined what happens if `y` consists of strings and `labels` argument is passed - caller doesn't have correct indices to pass in this case.

It seems it is better to either raise an error if `labels` is passed when `y` is not pre-transformed by a LabelEncoder, or to restore and document 0.14.1 behavior. What do you think?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

labels argument of classification_report is not useful when y is a list of strings #3123

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

labels argument of classification_report is not useful when y is a list of strings #3123

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions