Evaluation metrics for multi label classifiers #558

amueller · 2012-01-16T10:21:20Z

As far as I can tell, these are completely missing.
I feel this makes the multi label classifiers much less useful.

I am not sure what common measure there are but two that seem natural to me would be Hamming loss (how many classes per example were correct?) and 0-1 loss (for how many examples were all classes correct).

At least these are two losses that are commonly used in structured prediction afaik.

mblondel · 2012-01-16T10:30:34Z

We thought that multi-label metrics were warranting another pull-request as the multi-label branch had been pending for a long time already. Plus, the classifiers are useful even without evaluation metric... :)

Yes, hamming loss is a popular evaluation metric. Others are precision and recall. I implemented them in the test_multiclass.py file but they need to be vectorized and merged into the metrics module. A question is whether we should have dedicated functions (say multilabel_precision and multilabel_recall) or just "overload" the existing ones.

Also note that we need to support both lists of tuples and label indicator matrices for input. Both formats are currently supported by LabelBinarizer (and thus by OneVsRest) and both have their pros and cons.

amueller · 2012-01-16T10:50:23Z

I didn't know you are working on this. Please don't take this as criticism on the (merging of the) multi-label branch.

I just wanted to raise awareness that this is a feature that still needs to be implemented.

The question about whether to create new functions or use the old ones is a good one.

As far as I can tell, the current score functions don't die gracefully when given multilabel input.
(zero_one thinks everything is wrong, precision raises a "unhashable type" error)
I think we have to do some better input validation any way, since not all classification metrics will support multi label classification.

I think I would prefer separate functions for multilabel and maybe branch from the existing functions if necessary/possible.

mblondel · 2012-01-16T10:59:05Z

No sweat. I added a smiley at the end of my first paragraph :p

amueller · 2012-01-16T10:59:40Z

:)

satra · 2012-01-16T13:29:18Z

could either of you elaborate on the difference between multilabel and multiclass? are these synonymous or not?

we were working on multiclass metrics (#443) till we ran into possible issues with delayed initialization of these metrics for cross-validation and other testing i.e. grid searches.

mblondel · 2012-01-16T13:37:52Z

we were working on multiclass metrics (#443) till we ran into possible issues with delayed initialization of these metrics for cross-validation and other testing i.e. grid searches.

Multi-label is when an instance can be labeled with 0, 1 or more
labels. For example, a newspaper article can be labeled with both
"economy" and "politics".

amueller · 2012-01-16T13:42:30Z

@satra:
There is an explanation at the top of http://scikit-learn.org/dev/modules/multiclass.html.
Do you think this explanation is sufficiently clear and prominent?

satra · 2012-01-16T13:43:32Z

@mblondel: thank you. now i am all squared and the metrics additions do not cover this.

satra · 2012-01-16T13:53:28Z

@amueller thank you. i think the docs are good ( i should read the docs more! ). they do define multilabel and multiclass.

does this explictly mean a multiclass or can that module also do multilabel: "For example, it is possible to use these estimators to turn a binary classifier or a regressor into a multiclass classifier."? and on a side note, perhaps the docs should also point to the tree module as also being able to do multiclass. (sorry for spamming this thread - i'll stop now).

amueller · 2012-01-16T14:12:21Z

I am not sure if I understood your question.
The one-vs-rest and one-vs-one meta estimators can generate a multi-class or multi label (only ovr) classifier from any
given binary classifier.
Was that the question? If not, could you reformulate?

satra · 2012-01-16T14:37:51Z

i meant to ask whether the following sentence in the docs should be augmented to say:

"For example, it is possible to use these estimators to turn a binary classifier or a regressor into a multiclass or multilabel classifier."

or whether those estimators could only turn things multiclass.

from your reply it seems it would be good to point out that only ovr can be used with a binary classifier to do multilabel.

arjoly · 2013-01-11T09:46:40Z

I have an implementation for several measures in multilabel classification.
However I had to hack the label binarizer.

To avoid to write one function per format, I wrote several check functions see this gist.

Am I doing it wrong?

arjoly · 2013-07-22T15:25:04Z

Now, there is several multi-label metrics.

amueller mentioned this issue Mar 18, 2012

fetch_mldata needs to handle sparse matrices as labels #700

Closed

amueller mentioned this issue Jan 3, 2013

[MRG] Metric documentation (mainly in classification) #1512

Merged

arjoly mentioned this issue Jan 22, 2013

[MRG] Multi-label metrics: accuracy, hamming loss and zero-one loss #1606

Closed

This was referenced May 7, 2013

[MRG] Add multilabel support to precision, recall, fscore and classification report #1945

Closed

[MRG] multilabel accuracy with jaccard the index #1795

Merged

arjoly closed this as completed Jul 22, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation metrics for multi label classifiers #558

Evaluation metrics for multi label classifiers #558

amueller commented Jan 16, 2012

mblondel commented Jan 16, 2012

amueller commented Jan 16, 2012

mblondel commented Jan 16, 2012

amueller commented Jan 16, 2012

satra commented Jan 16, 2012

mblondel commented Jan 16, 2012

amueller commented Jan 16, 2012

satra commented Jan 16, 2012

satra commented Jan 16, 2012

amueller commented Jan 16, 2012

satra commented Jan 16, 2012

arjoly commented Jan 11, 2013

arjoly commented Jul 22, 2013

Evaluation metrics for multi label classifiers #558

Evaluation metrics for multi label classifiers #558

Comments

amueller commented Jan 16, 2012

mblondel commented Jan 16, 2012

amueller commented Jan 16, 2012

mblondel commented Jan 16, 2012

amueller commented Jan 16, 2012

satra commented Jan 16, 2012

mblondel commented Jan 16, 2012

amueller commented Jan 16, 2012

satra commented Jan 16, 2012

satra commented Jan 16, 2012

amueller commented Jan 16, 2012

satra commented Jan 16, 2012

arjoly commented Jan 11, 2013

arjoly commented Jul 22, 2013