[MRG] Multi-label metrics: accuracy, hamming loss and zero-one loss #1606

arjoly · 2013-01-22T12:19:01Z

This pull request intents to bring 3 new features:

a tested and generalized unique_labels function;
multi-labels support for accuracy_score and zero_one_loss functions;
the hamming loss metrics (hamming_loss ) with multi-label support.

Before merging, I would like to suggest to add a new module where multi-labels utilities such as unique_labels and _is_label_indicator_matrix are collected.

Furthermore, I have to re-organise (cosmit) some of the function in a multi-label categories in metrics.py. But I will wait that reviews are done.

This pull request also tackles issue #558. Reviews and comments are welcome! :-)

mblondel · 2013-01-22T14:39:59Z

sklearn/metrics/metrics.py

+
+    Parameters
+    ----------
+    y_true : array-like or list of labels or label binary matrix


Very happy that you decided to support both list of labels and label binary matrix. Regarding the name of the latter, maybe label indicator matrix or class membership matrix would be more explicit?

Thanks "label indicator matrix" is better name than "label binary matrix".

mblondel · 2013-01-22T14:48:11Z

I think I'm +1 for moving _is_label_indicator_matrix and _is_multilabel to the metrics module (and of course, to make them public).

arjoly · 2013-01-22T18:17:48Z

Since those function don't assess the performance of an estimator, I am not sure that the metrics module is the best place. I was thinking about a sklearn.multilabel module, a sklearn.utils.multilabels module or to put those functions in sklearn.multiclass.

larsmans · 2013-01-22T19:38:16Z

Let's put them in sklearn.multiclass.

mblondel · 2013-01-23T06:37:42Z

+1 for multiclass

GaelVaroquaux · 2013-01-23T07:16:14Z

doc/whats_new.rst

+---------
+   - :func:`metrics.accuracy_score` and :func:`metrics.zero_one_loss` support
+     multi-label classification.  A new metric :func:`metrics.hamming_loss` is
+     added with mullti-label support.


Add your name here: credit where it belongs!

GaelVaroquaux · 2013-01-23T07:19:18Z

+1 for multiclass too.

arjoly · 2013-01-23T10:08:26Z

When I put is_label_indicator_matrix and is_multilabel into the multiclass, I got circular import.
Do you advise to do lazy import?

mblondel · 2013-01-23T10:28:17Z

Another possible place would be in the utils.

amueller · 2013-01-23T10:29:02Z

what is the circle?

arjoly · 2013-01-23T10:37:09Z

what is the circle?

In preprocessing, LabelBinarizer needs is_multilabel and is_label_indicator_matrix that are (will be) in multiclass.
In multiclass, unique_classes needs LabelBinarizer in preprocessing.

arjoly · 2013-01-23T10:42:32Z

Maybe the best place is in preprocess. The narrative doc says:

The sklearn.preprocessing package provides several common utility functions and transformer classes to change raw feature vectors into a representation that is more suitable for the downstream estimators.

So it would be logical to find function check or analyze raw data.

GaelVaroquaux · 2013-01-23T12:36:02Z

In preprocessing, LabelBinarizer needs is_multilabel and
is_label_indicator_matrix that are in multiclass.
In multiclass, unique_classes needs LabelBinarizer in preprocessing.

OK, I thank that tells me that we need to move things in utils.

arjoly · 2013-01-23T12:55:52Z

All right! I will create a new utils module.

arjoly · 2013-01-23T13:43:46Z

There is now a sklearn.utils.multiclass (to rename to sklearn.utils.multilabels?).

I have pull the unique_labels functionality from LabelBinarizer and concentrate everything in unique_labels to get rid of the circular import problem.

mblondel · 2013-01-23T18:10:55Z

Could you add multilabel support to precision / recall / f1 score? Once this is done, the multilabel tests in the multiclass module can be updated to use the metrics directly:
https://github.com/arjoly/scikit-learn/blob/ed98486d0c6b0072afe3b8b96a764037c90d2ad5/sklearn/tests/test_multiclass.py#L35

arjoly · 2013-01-23T18:13:07Z

I intended to do that in my next pull request.
But ok, I will have a look at that tomorrow.

amueller · 2013-01-23T22:52:12Z

+1 for separate pr

arjoly · 2013-01-24T08:41:26Z

+1 for separate pr

The voice of the reason: small and reviewable pr.

Don't worry @mblondel I intend to add an another pr with precision, recall and f-score.
It is pretty high on my todo list.

Perhaps one thing that could change is the name of the new utils module:

sklearn.utils.classification
sklearn.utils.multilabels
sklearn.utils.multiclass

arjoly · 2013-01-24T08:47:44Z

I rebase on top of master.

mblondel · 2013-01-24T09:24:00Z

No worries.

Could you discuss the relationship between hamming loss and zero-one loss in the docstring? Thanks.

arjoly · 2013-01-25T09:46:55Z

@mblondel I think that I have taken your remarks into account.

By the way, I add some more invariance tests.

mblondel · 2013-01-25T10:38:57Z

In the multiclass (not multilabel) case, they are the same, right?

arjoly · 2013-01-25T10:59:27Z

No, they differ. In the hamming loss, you divide each error by the number of labels.

One small example

In [22]: y2 = np.random.randint(0, 4, size=(5, ))

In [23]: y1 = np.random.randint(0, 4, size=(5, ))

In [24]: y1
Out[24]: array([2, 0, 3, 2, 2])

In [25]: y2
Out[25]: array([3, 1, 2, 1, 2])

In [26]: hamming_loss(y1, y2)
Out[26]: 0.40000000000000002

In [27]: zero_one_loss(y1, y2)
Out[27]: 0.80000000000000004

But thinking about it, the hamming loss is always smaller than the zero one loss. I will correct this.

mblondel · 2013-01-25T11:11:08Z

Did you decide to add this normalization or is it always implemented like this in multilabel papers? http://en.wikipedia.org/wiki/Hamming_distance uses the unormalized count.

mblondel · 2013-01-25T11:32:15Z

I'm asking because this is important that our implementation of the metrics is as standard as possible. We could add a normalize option (the question is, what should be the default value?).

arjoly · 2013-01-25T12:39:37Z

The following papers agree on the normalization with the number labels:

Grigorios Tsoumakas, Ioannis Katakis. Multi-Label Classification: An Overview. International Journal of Data Warehousing & Mining, 3(3), 1-13, July–September 2007.
Jesse Read, Bernhard Pfahringer, Geoff Holmes, Eibe Frank. Classifier Chains for Multi-label Classification. Machine Learning Journal. Springer. Vol. 85(3), (2011).
Zhang, M.L. and Zhou, Z.H. ML-KNN: A lazy learning approach to multi-label learning
Gjorgji Madjarov, Dragi Kocev, Dejan Gjorgjevikj, Sašo Džeroski. An extensive experimental comparison of methods for multi-label learning. Pattern Recognition. Vol. 45(9), (2012).
Wei Gao and. Zhi-Hua Zhou. On the Consistency of. Multi-Label Learning. JMLR.

mblondel · 2013-01-25T13:50:00Z

Great. Maybe you can cite the first one then.

arjoly · 2013-01-25T14:19:11Z

Great. Maybe you can cite the first one then.

Done

arjoly · 2013-03-02T09:50:22Z

I will have time this week to work on the precision, recall and F-measure metrics to support the multi-labels format. Furthemore, I would like to add the jaccard similiarty measure (an example based accuracy measure).

What do you advise to me? I will need some of the function in utils.multiclass.

amueller · 2013-03-02T10:12:56Z

Maybe do a PR on top of this one? We really should try to get this one in :-/

arjoly · 2013-03-02T10:16:10Z

If I do a pr on top of this one, will I have problem if I rebase this one on top of master?

larsmans · 2013-03-02T10:50:14Z

This one merges cleanly, no reason to rebase (though I'd like to rebase -i it prior to the actual merge to squash some commits).

You can branch off current master, merge this branch into your new branch, then add the functionality you want.

arjoly · 2013-03-02T11:46:48Z

This one merges cleanly, no reason to rebase (though I'd like to rebase -i it prior to the actual merge to squash some commits).

You can branch off current master, merge this branch into your new branch, then add the functionality you want.

I will do as you suggest! Thanks!

larsmans · 2013-03-02T12:59:16Z

Btw, it's easier if you first squash the commits using a rebase -i. It doesn't have to be all in one commit, but lots of microcommits make it harder to cherry-pick when an intermediate release is done.

amueller · 2013-03-02T13:09:32Z

Or someone can give this one a second +1 and we merge it ;)

larsmans · 2013-03-02T14:27:53Z

I'll try to review the PR this afternoon. I'll merge it if I think it's ready.

amueller · 2013-03-02T15:19:36Z

awesome, thanks :)

larsmans · 2013-03-02T18:59:54Z

We've got an inconsistency in the documentation. The dev docs say utils is off-limits to end users, while the multiclass docs now advises their use. I'm going to move the latter remark to the dev docs.

larsmans · 2013-03-02T19:11:39Z

doc/modules/model_evaluation.rst

@@ -599,13 +667,16 @@ classification loss (:math:`L_{0-1}`) over :math:`n_{\text{samples}}`. By
 defaults, the function normalizes over the sample. To get the sum of the
 :math:`L_{0-1}`, set ``normalize``  to ``False``.

+In multilabel classification, the :func:`zero_one_loss` function corresponds
+to the subset zero one loss: the subset of labels must be correctly predict.


I don't get this sentence.

larsmans · 2013-03-02T19:42:43Z

Ok, pushed to master after squashing. Thanks @arjoly for tackling this important problem: evaluation can be dull and it can make your head hurt, but it's crucial for a machine learning toolkit.

arjoly · 2013-03-02T19:56:53Z

Thanks to all reviewers !!!

larsmans · 2013-03-02T21:04:50Z

We have a failure on the Numpy 1.3/Scipy 0.7 build bot: ValueError: 0-d arrays can't be concatenated. It seems limited to the model_selection.rst doctests.

ogrisel · 2013-03-03T09:32:59Z

Can any one have a look at the failing doctest?

https://jenkins.shiningpanda-ci.com/scikit-learn/job/python-2.6-numpy-1.3.0-scipy-0.7.2/1656/console

https://jenkins.shiningpanda-ci.com/scikit-learn/job/python-2.6-numpy-1.3.0-scipy-0.7.2/

arjoly · 2013-03-03T11:00:06Z

I will have a look this afternoon.

arjoly · 2013-03-03T16:15:21Z

I am not able to install numpy 1.3 with python 2.6. :-$
So it is a bit hard to investigate :-(

ogrisel · 2013-03-03T17:53:11Z

I think you can reproduce it with numpy 1.3 on python 2.7. I don't see why it would be specific to 2.6.

larsmans · 2013-03-04T09:47:21Z

No, I suspect it's Numpy-specific.

arjoly · 2013-03-04T13:02:22Z

I am working on it.
I have the proper numpy version now, but the installation of scipy 0.7 failed...

larsmans · 2013-03-04T13:10:42Z

I bet SciPy has very little to do with this, so you can try a later version first to see if you get the failures.

(Otherwise, try finding an old version of a Linux distro that has these versions and install it in a VM.)

arjoly · 2013-03-04T17:34:41Z

I haven't been able to install to have python 2.7, numpy 1.3 with scipy 0.7.
I got the following error constantly:

error: Command "g++ -pthread -fno-strict-aliasing -I/home/ajoly/opt/local/include -DNDEBUG
-g -fwrapv -O3 -Wall  -fPIC -I/home/ajoly/git/numpy-1.3/numpy/core/include 
-I/home/ajoly/opt/python/include/python2.7 -c scipy/sparse/sparsetools/csr_wrap.cxx
 -o build/temp.linux-i686-2.7/scipy/sparse/sparsetools/csr_wrap.o" failed with exit status 1

Same with scipy 0.6 or any scipy 0.7.x version...

With python 2.6, I am not able to install numpy 1.3 due to a problem with unicode character (ucs2 and ucs4 problem).

Lastly, scipy 0.8 need at least numpy 1.4...
This makes me crazy...

I suppose that a simple np.asarray could solve the problem, but I am not able to investigate.
Or to handle the multiclass directly ...

arjoly · 2013-03-04T17:44:19Z

Any suggestion for a linux distro with the required package (and if possible easy installation of) python 2.6, numpy 1.3 and scipy 0.7?

amueller · 2013-03-04T18:33:02Z

Ubuntu lucid should do.

mblondel reviewed Jan 22, 2013
View reviewed changes

GaelVaroquaux reviewed Jan 23, 2013
View reviewed changes

arjoly added 2 commits March 2, 2013 10:45

DOC + FIX specify supported format and fix a bug

cdf9c3f

DOC FIX remaining attribute name change

7f6a266

larsmans reviewed Mar 2, 2013
View reviewed changes

larsmans closed this Mar 2, 2013

arjoly mentioned this pull request Mar 5, 2013

[MRG] FIX numpy 1.3 issues with the new multilabel metrics #1741

Merged

arjoly deleted the multilabel-metrics branch March 7, 2013 10:37

[MRG] Multi-label metrics: accuracy, hamming loss and zero-one loss #1606

[MRG] Multi-label metrics: accuracy, hamming loss and zero-one loss #1606

Conversation

arjoly commented Jan 22, 2013

mblondel Jan 22, 2013

Choose a reason for hiding this comment

arjoly Jan 22, 2013

Choose a reason for hiding this comment

mblondel commented Jan 22, 2013

arjoly commented Jan 22, 2013

larsmans commented Jan 22, 2013

mblondel commented Jan 23, 2013

GaelVaroquaux Jan 23, 2013

Choose a reason for hiding this comment

GaelVaroquaux commented Jan 23, 2013

arjoly commented Jan 23, 2013

mblondel commented Jan 23, 2013

amueller commented Jan 23, 2013

arjoly commented Jan 23, 2013

arjoly commented Jan 23, 2013

GaelVaroquaux commented Jan 23, 2013

arjoly commented Jan 23, 2013

arjoly commented Jan 23, 2013

mblondel commented Jan 23, 2013

arjoly commented Jan 23, 2013

amueller commented Jan 23, 2013

arjoly commented Jan 24, 2013

arjoly commented Jan 24, 2013

mblondel commented Jan 24, 2013

arjoly commented Jan 25, 2013

mblondel commented Jan 25, 2013

arjoly commented Jan 25, 2013

mblondel commented Jan 25, 2013

mblondel commented Jan 25, 2013

arjoly commented Jan 25, 2013

mblondel commented Jan 25, 2013

arjoly commented Jan 25, 2013

arjoly commented Mar 2, 2013

amueller commented Mar 2, 2013

arjoly commented Mar 2, 2013

larsmans commented Mar 2, 2013

arjoly commented Mar 2, 2013

larsmans commented Mar 2, 2013

amueller commented Mar 2, 2013

larsmans commented Mar 2, 2013

amueller commented Mar 2, 2013

larsmans commented Mar 2, 2013

larsmans Mar 2, 2013

Choose a reason for hiding this comment

larsmans commented Mar 2, 2013

arjoly commented Mar 2, 2013

larsmans commented Mar 2, 2013

ogrisel commented Mar 3, 2013

arjoly commented Mar 3, 2013

arjoly commented Mar 3, 2013

ogrisel commented Mar 3, 2013

larsmans commented Mar 4, 2013

arjoly commented Mar 4, 2013

larsmans commented Mar 4, 2013

arjoly commented Mar 4, 2013

arjoly commented Mar 4, 2013

amueller commented Mar 4, 2013