[MRG+1] Invariance tests for clustering metrics #8102 #8135

anki08 · 2016-12-29T18:34:53Z

Reference Issue

What does this implement/fix? Explain your changes.

Any other comments?

…modified

anki08 · 2016-12-30T05:03:01Z

Invariance tests for clustering metrics #8102

This code forms a common test for the different clustering metrics based on their common properties.

3rd version

jnothman · 2016-12-30T07:29:57Z

Please sort out your PEP8 compliance and I hope to review in a few days' time.

anki08 · 2016-12-30T07:32:44Z

@jnothman Okay I will try to sort PEP8 compliance
Thank you

jnothman · 2016-12-31T13:19:39Z

What is 1.py?

jnothman

Thanks!!

Please still fix PEP8 issues
Please also test input format invariance: each should accept an array or a list in its place.

Label permutation and format invariance should be tested for unsupervised metrics too.

jnothman · 2016-12-31T13:18:35Z

1.py

+@author: user
+"""
+
+import numpy as np


We conventionally follow the following import order:

standard library

other external libraries

sklearn

perhaps testing imports come last

jnothman · 2016-12-31T13:19:23Z

test_common.py

+@author: anki08
+"""
+
+import numpy as np


We conventionally follow the following import order:

standard library

other external libraries

sklearn

perhaps testing imports come last

jnothman · 2016-12-31T13:20:34Z

test_common.py

@@ -0,0 +1,219 @@
+# -*- coding: utf-8 -*-


Why is this not in sklearn/metrics/cluster/tests?

Where it is, the tests are not being run by make test or the continuous integration servers.

jnothman · 2016-12-31T13:23:15Z

test_common.py

+    "adjusted_mutual_info_score"
+    ]
+
+#METRICS where permutations oflabels dont change score


This should be true of all clustering metrics.

jnothman · 2016-12-31T13:24:15Z

test_common.py

+    "normalized_mutual_info_score"
+    ]
+
+#metrics which result in 0 when a class is split across different clusters


I don't get this, nor is CLASS_BASED_METRICS used

It is basically for supervised metrics which give a high score if the clusters are both homogeneous and complete

jnothman · 2016-12-31T13:38:38Z

test_common.py

+#test function for mtericshaving the property of not changing the score when 
+#the labels are permuted.                       
+def permute_labels():
+    for name in METRICS_NORMALIZED_OUTPUT:


Wrong metric set

jnothman · 2016-12-31T13:40:00Z

test_common.py

+        assert_equal(adjusted_mutual_info_score(labels_a, labels_b), 0.0)
+        assert_equal(normalized_mutual_info_score(labels_a, labels_b), 0.0)
+
+#test function for mtericshaving the property of not changing the score when 


I think you're confusing what i meant by invariant to label permutation.

I mean that for some (binary) y_true, y_pred, metric(y_true, y_pred) == metric(1 - y_true, y_pred) == metric(y_true, 1 - y_pred) == metric(1-y_true, 1-y_pred)

For unsupervised metrics the input format is completely different . The parameters are X and labels .How will the label permutation work there ? Silhouette__score needs X.shape to execute which is not present in labels. So it doesnot follow format invariance

I just mean that metric(X, y_pred) = metric(X, 1 - y_pred)

jnothman · 2016-12-31T13:40:33Z

test_common.py

+# If classes members are completely split across different clusters,
+#the assignment is totally in-complete, hence the score of these metrics is 0
+#they are perfect when the clusters are both homoneneous and complete
+def class_based_clusters():


should start with test_

jnothman · 2016-12-31T13:41:09Z

test_common.py

+#the assignment is totally in-complete, hence the score of these metrics is 0
+#they are perfect when the clusters are both homoneneous and complete
+def class_based_clusters():
+    for name in METRICS_NORMALIZED_OUTPUT:


wrong set of metrics

jnothman · 2016-12-31T13:41:21Z

test_common.py

+        var_2 = metric([0, 1, 0, 1, 0, 1], [2, 0, 1, 1, 0, 2])
+        assert_equal(var_1,var_2)
+
+# If classes members are completely split across different clusters,


This seems much like test_exactly_zero_info_score...?

Added format invariance and changed label permutation tests

jnothman

Please remove the two old files and only work on one.

jnothman · 2017-01-01T21:42:25Z

test_common.py

+
+#test function for metrics whose output in range 0 to 1
+def test_normalized_output():
+    for name in METRICS_NORMALIZED_OUTPUT:


What I meant is that the lower_bound will not be true for asymmetric measures, but probably is for symmetric ones.

jnothman · 2017-01-01T21:43:34Z

test_common.py

+        assert_equal(adjusted_mutual_info_score(labels_a, labels_b), 0.0)
+        assert_equal(normalized_mutual_info_score(labels_a, labels_b), 0.0)
+
+#test function for mtericshaving the property of not changing the score when 


I just mean that metric(X, y_pred) = metric(X, 1 - y_pred)

jnothman

fowlkes_mallows and calinsky_harabaz, at least, are not present in your imports.

jnothman · 2017-01-02T10:18:01Z

sklearn/metrics/cluster/tests/test_common.py

+    ]                  
+
+# Metrics where permutations of labels dont change score( 0 and 1 exchchanged)
+METRICS_PERMUTE_LABELS = ["homogeneity_score","v_measure_score",


This is true of all clustering in scikit-learn

I.e. it should not have its own group. Just test this property on all.

jnothman · 2017-01-02T10:18:03Z

sklearn/metrics/cluster/tests/test_common.py

+    ]
+
+# Input parameters can be both in the form of arrays and lists
+METRICS_WITH_FORMAT_INVARIANCE = ["homogeneity_score","v_measure_score",


This is true of all clustering in scikit-learn

jnothman · 2017-01-02T10:19:02Z

sklearn/metrics/cluster/tests/test_common.py

+    for name in CLASS_BASED_METRICS:
+        metric = ALL_METRICS[name]
+        assert_equal(metric([0, 0, 0, 0],[0, 1, 2, 3]),0.0)
+        assert_equal(metric([0, 0, 1, 1],[0, 0, 1, 1]),1.0)


Surely this is testing being normalised, not class-based

jnothman · 2017-01-02T10:19:09Z

sklearn/metrics/cluster/tests/test_common.py

+        metric = ALL_METRICS[name]
+        assert_equal(metric([0, 0, 0, 0],[0, 1, 2, 3]),0.0)
+        assert_equal(metric([0, 0, 1, 1],[0, 0, 1, 1]),1.0)
+        assert_equal(metric([0, 0, 1, 1],[0, 0, 1, 1]),1.0)   


this appears to be a duplicate

I could not run fowlkes_mallows and calinsky_harabaz on my laptop Therefore I did not add them .
ERROR :
NameError: name 'calinski_harabaz_score' is not defined

You apparently have an old version of scikit-learn installed. You need to be working with the current master.

jnothman · 2017-01-02T10:21:09Z

sklearn/metrics/cluster/tests/test_common.py

+    "normalized_mutual_info_score","adjusted_rand_score"
+    ] 
+
+# If classes members are completely split across different clusters,


I still don't get what the point of this group is. adjusted_rand_accuracy and mutual_info_score and fowlkes_mallows_score would also pass the test as it stands.

jnothman · 2017-01-03T20:27:27Z

I would really appreciate if you used more meaningful commit messages. "Add files via upload" gives me no indication of what's changed. There are a lot of PRs to keep track of.

…Added fowlkes_mallows and calinsky_harabaz . Removed the groups for permute_labels and invariance_format

jnothman · 2017-01-04T08:52:11Z

Please fix your flake8 issues. They make it much harder to focus on the actual work:

sklearn/metrics/cluster/tests/test_common.py:10:1: F401 'KMeans' imported but unused
from sklearn.cluster import KMeans
^
sklearn/metrics/cluster/tests/test_common.py:12:1: F401 '_jaccard' imported but unused
from sklearn.metrics.cluster.bicluster import _jaccard
^
sklearn/metrics/cluster/tests/test_common.py:13:1: F401 'consensus_score' imported but unused
from sklearn.metrics.cluster import consensus_score
^
sklearn/metrics/cluster/tests/test_common.py:17:1: F401 'contingency_matrix' imported but unused
from sklearn.metrics.cluster import contingency_matrix
^
sklearn/metrics/cluster/tests/test_common.py:19:1: F401 'expected_mutual_information' imported but unused
from sklearn.metrics.cluster import expected_mutual_information
^
sklearn/metrics/cluster/tests/test_common.py:20:1: F401 'homogeneity_completeness_v_measure' imported but unused
from sklearn.metrics.cluster import homogeneity_completeness_v_measure
^
sklearn/metrics/cluster/tests/test_common.py:26:1: F401 'silhouette_samples' imported but unused
from sklearn.metrics.cluster import silhouette_samples
^
sklearn/metrics/cluster/tests/test_common.py:30:1: F401 'assert_false' imported but unused
from sklearn.utils.testing import assert_false
^
sklearn/metrics/cluster/tests/test_common.py:31:1: F401 'assert_array_equal' imported but unused
from sklearn.utils.testing import assert_array_equal
^
sklearn/metrics/cluster/tests/test_common.py:32:1: F401 'assert_raises_regexp' imported but unused
from sklearn.utils.testing import assert_raises_regexp
^
sklearn/metrics/cluster/tests/test_common.py:33:1: F401 'assert_raise_message' imported but unused
from sklearn.utils.testing import assert_raise_message
^
sklearn/metrics/cluster/tests/test_common.py:35:1: F401 'assert_true' imported but unused
from sklearn.utils.testing import assert_true
^
sklearn/metrics/cluster/tests/test_common.py:46:80: E501 line too long (92 > 79 characters)
#   - SUPERVISED_METRICS: all supervised cluster metrics - (when given a ground truth value)
                                                                               ^
sklearn/metrics/cluster/tests/test_common.py:53:52: W291 trailing whitespace
# Metrics used to test similarity between bicluster
                                                   ^
sklearn/metrics/cluster/tests/test_common.py:55:33: E231 missing whitespace after ':'
    "adjusted_mutual_info_score":adjusted_mutual_info_score ,
                                ^
sklearn/metrics/cluster/tests/test_common.py:55:60: E203 whitespace before ','
    "adjusted_mutual_info_score":adjusted_mutual_info_score ,
                                                           ^
sklearn/metrics/cluster/tests/test_common.py:56:26: E231 missing whitespace after ':'
    "adjusted_rand_score":adjusted_rand_score,
                         ^
sklearn/metrics/cluster/tests/test_common.py:57:25: E231 missing whitespace after ':'
    "completeness_score":completeness_score,
                        ^
sklearn/metrics/cluster/tests/test_common.py:58:24: E231 missing whitespace after ':'
    "homogeneity_score":homogeneity_score,
                       ^
sklearn/metrics/cluster/tests/test_common.py:59:24: E231 missing whitespace after ':'
    "mutual_info_score":mutual_info_score,
                       ^
sklearn/metrics/cluster/tests/test_common.py:60:35: E231 missing whitespace after ':'
    "normalized_mutual_info_score":normalized_mutual_info_score,
                                  ^
sklearn/metrics/cluster/tests/test_common.py:61:22: E231 missing whitespace after ':'
    "v_measure_score":v_measure_score,
                     ^
sklearn/metrics/cluster/tests/test_common.py:62:28: E231 missing whitespace after ':'
    "fowlkes_mallows_score":fowlkes_mallows_score
                           ^
sklearn/metrics/cluster/tests/test_common.py:64:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:66:23: E231 missing whitespace after ':'
    "silhouette_score":silhouette_score,
                      ^
sklearn/metrics/cluster/tests/test_common.py:67:29: E231 missing whitespace after ':'
    "calinski_harabaz_score":calinski_harabaz_score
                            ^
sklearn/metrics/cluster/tests/test_common.py:69:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:81:1: E265 block comment should start with '# '
#---------------------------------------------------------------------
^
sklearn/metrics/cluster/tests/test_common.py:85:26: E231 missing whitespace after ','
    "adjusted_rand_score","v_measure_score",
                         ^
sklearn/metrics/cluster/tests/test_common.py:86:24: E231 missing whitespace after ','
    "mutual_info_score","adjusted_mutual_info_score",
                       ^
sklearn/metrics/cluster/tests/test_common.py:87:35: E231 missing whitespace after ','
    "normalized_mutual_info_score","fowlkes_mallows_score"
                                  ^
sklearn/metrics/cluster/tests/test_common.py:89:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:90:45: E203 whitespace before ','
NON_SYMMETRIC_METRICS = ["homogeneity_score" , "completeness_score"]
                                            ^
sklearn/metrics/cluster/tests/test_common.py:94:35: E231 missing whitespace after ','
    "normalized_mutual_info_score","v_measure_score",
                                  ^
sklearn/metrics/cluster/tests/test_common.py:97:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:98:38: W291 trailing whitespace
# Metrics with output between 0 and 1
                                     ^
sklearn/metrics/cluster/tests/test_common.py:100:26: E231 missing whitespace after ','
    "adjusted_rand_score","homogeneity_score","completeness_score",
                         ^
sklearn/metrics/cluster/tests/test_common.py:100:46: E231 missing whitespace after ','
    "adjusted_rand_score","homogeneity_score","completeness_score",
                                             ^
sklearn/metrics/cluster/tests/test_common.py:101:22: E231 missing whitespace after ','
    "v_measure_score","adjusted_mutual_info_score","fowlkes_mallows_score",
                     ^
sklearn/metrics/cluster/tests/test_common.py:101:51: E231 missing whitespace after ','
    "v_measure_score","adjusted_mutual_info_score","fowlkes_mallows_score",
                                                  ^
sklearn/metrics/cluster/tests/test_common.py:104:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:105:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:106:23: E231 missing whitespace after ','
def assert_between(var,score_1,score_2):
                      ^
sklearn/metrics/cluster/tests/test_common.py:106:31: E231 missing whitespace after ','
def assert_between(var,score_1,score_2):
                              ^
sklearn/metrics/cluster/tests/test_common.py:110:26: E231 missing whitespace after ','
    if assert_greater(var,score_1) and assert_less(var,score_2):
                         ^
sklearn/metrics/cluster/tests/test_common.py:110:55: E231 missing whitespace after ','
    if assert_greater(var,score_1) and assert_less(var,score_2):
                                                      ^
sklearn/metrics/cluster/tests/test_common.py:115:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:122:38: E231 missing whitespace after ','
        assert_almost_equal(metric(y1,y2),
                                     ^
sklearn/metrics/cluster/tests/test_common.py:123:38: E231 missing whitespace after ','
                            metric(y2,y1))
                                     ^
sklearn/metrics/cluster/tests/test_common.py:124:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:127:54: E231 missing whitespace after ','
        assert_almost_equal(metric([0, 1, 2, 5, 4, 9],[0, 1, 9, 4, 3, 5]),
                                                     ^
sklearn/metrics/cluster/tests/test_common.py:128:54: E231 missing whitespace after ','
                            metric([0, 1, 9, 4, 3, 5],[0, 1, 2, 5, 4, 9]))
                                                     ^
sklearn/metrics/cluster/tests/test_common.py:130:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:134:27: E225 missing whitespace around operator
        labels_a, labels_b=(np.ones(i, dtype=np.int),
                          ^
sklearn/metrics/cluster/tests/test_common.py:135:31: E127 continuation line over-indented for visual indent
                              np.arange(i, dtype=np.int))
                              ^
sklearn/metrics/cluster/tests/test_common.py:137:19: E225 missing whitespace around operator
            metric=SUPERVISED_METRICS_DICT[name]
                  ^
sklearn/metrics/cluster/tests/test_common.py:138:48: E231 missing whitespace after ','
            assert_almost_equal(metric(labels_a,labels_b), 0.0)
                                               ^
sklearn/metrics/cluster/tests/test_common.py:140:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:145:15: E225 missing whitespace around operator
        metric=SUPERVISED_METRICS_DICT[name]
              ^
sklearn/metrics/cluster/tests/test_common.py:146:49: E231 missing whitespace after ','
        assert_between(metric([0, 0, 0, 1, 1, 1],[0, 0, 0, 1, 2, 2]), 0.0, 1.0)
                                                ^
sklearn/metrics/cluster/tests/test_common.py:147:49: E231 missing whitespace after ','
        assert_between(metric([0, 0, 1, 1, 2, 2],[0, 0, 1, 1, 1, 1]), 0.0, 1.0)
                                                ^
sklearn/metrics/cluster/tests/test_common.py:148:42: E231 missing whitespace after ','
        assert_equal(metric(upper_bound_1,upper_bound_2), 1.0)
                                         ^
sklearn/metrics/cluster/tests/test_common.py:149:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:150:5: E265 block comment should start with '# '
    #For symmetric metrics the lower bound is defined
    ^
sklearn/metrics/cluster/tests/test_common.py:154:15: E225 missing whitespace around operator
        metric=SUPERVISED_METRICS_DICT[name]
              ^
sklearn/metrics/cluster/tests/test_common.py:155:42: E231 missing whitespace after ','
        assert_equal(metric(lower_bound_1,lower_bound_2), 0.0)
                                         ^
sklearn/metrics/cluster/tests/test_common.py:158:36: W291 trailing whitespace
# that is when 0 and 1 exchchanged.
                                   ^
sklearn/metrics/cluster/tests/test_common.py:159:1: E302 expected 2 blank lines, found 1
def test_permute_labels():
^
sklearn/metrics/cluster/tests/test_common.py:164:52: E231 missing whitespace after ','
        assert_almost_equal(metric(y_pred, y_label),metric(y_pred, 1-y_label))
                                                   ^
sklearn/metrics/cluster/tests/test_common.py:165:52: E231 missing whitespace after ','
        assert_almost_equal(metric(y_pred, y_label),metric(1-y_pred, y_label))
                                                   ^
sklearn/metrics/cluster/tests/test_common.py:166:52: E231 missing whitespace after ','
        assert_almost_equal(metric(y_pred, y_label),metric(1-y_pred, 1-y_label))
                                                   ^
sklearn/metrics/cluster/tests/test_common.py:166:80: E501 line too long (80 > 79 characters)
        assert_almost_equal(metric(y_pred, y_label),metric(1-y_pred, 1-y_label))
                                                                               ^
sklearn/metrics/cluster/tests/test_common.py:167:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:168:5: E265 block comment should start with '# '
    #Test for Silhouette_score
    ^
sklearn/metrics/cluster/tests/test_common.py:176:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:177:5: E265 block comment should start with '# '
    #Test for calinski_harabaz_score
    ^
sklearn/metrics/cluster/tests/test_common.py:189:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:190:1: W293 blank line contains whitespace

^
sklearn/metrics/cluster/tests/test_common.py:191:58: W291 trailing whitespace
# For ALL clustering metrics Input parameters can be both
                                                         ^
sklearn/metrics/cluster/tests/test_common.py:200:35: E231 missing whitespace after ','
        score_list = metric(list_a,list_b)
                                  ^
sklearn/metrics/cluster/tests/test_common.py:201:35: E231 missing whitespace after ','
        score_array = metric(arr_a,arr_b)
                                  ^
sklearn/metrics/cluster/tests/test_common.py:202:32: E231 missing whitespace after ','
        assert_equal(score_list,score_array)
                               ^
sklearn/metrics/cluster/tests/test_common.py:202:45: W292 no newline at end of file
        assert_equal(score_list,score_array)
                                            ^

jnothman · 2017-01-04T08:53:17Z

maybe autopep8 would help on this occasion, though usually I'd recommend against it.

jnothman · 2017-01-08T02:06:27Z

Thank you for that. Much appreciated.

If you feel like this is basically what you hope to have merged (with some likely changes under review), please change the [WIP] to [MRG]

anki08 · 2017-01-08T05:04:06Z

Okay . Thanks a lot for your help .

anki08 · 2017-01-17T15:44:40Z

I can't figure out why the code is failing in CircleCI . Could You please help me @tguillemot

tguillemot · 2017-01-17T21:17:47Z

@anki08 Don't waste your time the problem appears in several PR. It is not related to your work.

tguillemot · 2017-01-18T09:08:12Z

@anki08 Ci problems seems solved now, can you try a rebase ?

anki08 · 2017-01-18T12:38:19Z

Its failing even now :

jnothman · 2017-01-18T20:55:29Z

Don't worry about the CI.

jnothman · 2017-12-12T10:04:46Z

I think this may be mergeable, though we should merge in master to check that CI runs fine, and ideally another core dev should give it a quick look.

amueller · 2017-12-12T17:21:11Z

what's with the weird sphinx error?

remove outdated comment

jnothman · 2017-12-12T20:45:38Z

We might need to merge in master to get updated circle script

jnothman · 2018-02-08T12:23:35Z

@glemaitre, want to review this?

glemaitre · 2018-02-08T12:26:41Z

I look at it.

glemaitre

I have some pytest updates in fact. The tests themselves seem ok.

glemaitre · 2018-02-08T13:12:42Z

sklearn/metrics/cluster/tests/test_common.py

+    for name in SYMMETRIC_METRICS:
+        metric = SUPERVISED_METRICS[name]
+        assert_almost_equal(metric(y1, y2), metric(y2, y1),
+                            err_msg="%s is not symmetric" % name)


shall we add:

... %s was expected to be symmetric.

glemaitre · 2018-02-08T13:13:04Z

sklearn/metrics/cluster/tests/test_common.py

+    for name in NON_SYMMETRIC_METRICS:
+        metric = SUPERVISED_METRICS[name]
+        assert_not_equal(metric(y1, y2), metric(y2, y1),
+                         msg="%s is symmetric" % name)


same suggestion as before but non symmetric

glemaitre · 2018-02-08T13:14:09Z

sklearn/metrics/cluster/tests/test_common.py

+        assert_not_equal(metric(y1, y2), metric(y2, y1),
+                         msg="%s is symmetric" % name)
+
+    assert_equal(sorted(SYMMETRIC_METRICS + NON_SYMMETRIC_METRICS),


assert sorted(SYMMETRIC_METRICS + NON_SYMMETRIC_METRICS) == sorted(SUPERVISED_METRICS)

glemaitre · 2018-02-08T13:27:46Z

sklearn/metrics/cluster/tests/test_common.py

+
+    for name in NON_SYMMETRIC_METRICS:
+        metric = SUPERVISED_METRICS[name]
+        assert_not_equal(metric(y1, y2), metric(y2, y1),


Question: can we have surprises if there is some numerical error which do not trigger the equality.
In this regard, I would write:

assert metric(y1, y2) != pytest.approx(metric(y2, y1))

If we go in this direction I would suggest to replace assert_almost_equal by

assert x == pytest.approx(y)

glemaitre · 2018-02-08T13:29:06Z

sklearn/metrics/cluster/tests/test_common.py

+    upper_bound_2 = [0, 0, 0, 1, 1, 1]
+    for name in NORMALIZED_METRICS:
+        metric = SUPERVISED_METRICS[name]
+        assert_greater(metric([0, 0, 0, 1, 1], [0, 0, 0, 1, 2]), 0.0)


replace:

assert_greater(x, y) by assert x > y

assert_less(x, y) by assert x < y

assert_equal(x, y) by assert x == y

glemaitre · 2018-02-08T13:31:31Z

sklearn/metrics/cluster/tests/test_common.py

+        score_1 = metric(y_pred, y_label)
+        assert_almost_equal(score_1, metric(1 - y_pred, y_label),
+                            err_msg="%s failed labels permutation" % name)
+        assert_almost_equal(score_1, metric(1 - y_pred, 1 - y_label),


we can use approx

glemaitre · 2018-02-08T13:32:10Z

sklearn/metrics/cluster/tests/test_common.py

+    y_true = [0, 0, 0, 0, 1, 1, 1, 1]
+    y_pred = [0, 1, 2, 3, 4, 5, 6, 7]
+
+    def generate_formats(y):


we can parametrize using pytest

glemaitre · 2018-02-08T13:34:16Z

sklearn/metrics/cluster/tests/test_common.py

+
+    for name in UNSUPERVISED_METRICS:
+        metric = UNSUPERVISED_METRICS[name]
+        X = np.random.randint(10, size=(8, 10))


randint(..., dtype=np.float64) will avoid a astype afterwords

Ups my comment is wrong

jnothman · 2018-03-18T01:20:41Z

@glemaitre aside from cosmetics does this look good to you? I'd like to merge this so we can be more confident about PRs like #10827

glemaitre

@jnothman The only thing that I can see is about the upper bound which is not tested while the lower bound is tested.

Otherwise, I am fine on the principle. Checking the other common test for metric, I see that we don't test:

infinite or nan input
single sample

Should it be considered in the tests?

glemaitre · 2018-03-18T07:16:52Z

sklearn/metrics/cluster/tests/test_common.py

+        assert_equal(metric(upper_bound_1, upper_bound_2), 1.0,
+                     msg="%s has upper_bound greater than 1" % name)
+
+    lower_bound_1 = [0, 0, 0, 0, 0, 0]


Just by curiosity, shall test the upper bound as well?

my bad it was just above

glemaitre · 2018-03-18T07:17:31Z

sklearn/metrics/cluster/tests/test_common.py

+        metric = SUPERVISED_METRICS[name]
+        score = [metric(lower_bound_1, lower_bound_2),
+                 metric(lower_bound_2, lower_bound_1)]
+        assert_true(0.0 in score,


assert_allclose

glemaitre · 2018-03-18T07:22:52Z

sklearn/metrics/cluster/tests/test_common.py

+
+    for name in UNSUPERVISED_METRICS:
+        metric = UNSUPERVISED_METRICS[name]
+        X = np.random.randint(10, size=(8, 10))


Ups my comment is wrong

jnothman · 2018-03-18T09:32:34Z

non-finite and single sample both sound like good ideas.

…

anki08 added 3 commits December 30, 2016 00:00

Common test file for different cluster metrics addressingissue 8102

157efbc

common test file for different cluster metrics addressing issue 8102 …

d4875f4

…modified

Update and rename test_file.py to test_common.py

9165f23

Add files via upload

89df474

3rd version

Add files via upload

2f76ca6

jnothman reviewed Dec 31, 2016

View reviewed changes

Add files via upload

c469da8

Added format invariance and changed label permutation tests

jnothman reviewed Jan 1, 2017

View reviewed changes

anki08 added 3 commits January 2, 2017 11:01

Delete test_common.py

2d5c29b

Delete 1.py

57cec20

Add files via upload

b4050a2

jnothman reviewed Jan 2, 2017

View reviewed changes

Add files via upload

35f2417

Wrote different dictionaries for supervised and unsupervised metrics.…

52476ab

…Added fowlkes_mallows and calinsky_harabaz . Removed the groups for permute_labels and invariance_format

anki08 added 6 commits January 4, 2017 18:26

fixed flake8 issues

7e2f9df

Fixed flake8 issues

cba2dd2

Fixed flake8 issues

4e22e3d

Add files via upload

072c033

Fixed flake8 issues

3e6d533

Fixed flake8 issues

b230cb9

anki08 changed the title ~~[WIP] Invariance tests for clustering metrics #8102~~ [MRG] Invariance tests for clustering metrics #8102 Jan 8, 2017

anki08 added 5 commits January 17, 2017 19:26

Update test_common.py

4f0639d

Updated test_common.py

71a575d

Updated test_common.py

06c2180

Updated test_common.py

987a9fe

Updated test_common.py

907aef2

anki08 added 3 commits January 17, 2017 22:36

assert_almost_equal -> assert_equal

a3af713

updated test_common.py

05c687b

updated test_common.py

0c70e3f

tguillemot mentioned this pull request Jan 17, 2017

[MRG+1] Ensure coef_ is an ndarray when fitting LassoLars #8160

Merged

Updated test_common.py

8f4de0f

updated test_common.py

7718d36

jnothman added this to the 0.20 milestone Jun 18, 2017

Update test_common.py

481e096

remove outdated comment

glemaitre reviewed Feb 8, 2018

View reviewed changes

jnothman mentioned this pull request Mar 18, 2018

[MRG+1] Add Davies-Bouldin index #10827

Merged

glemaitre reviewed Mar 18, 2018

View reviewed changes

glemaitre mentioned this pull request Mar 18, 2018

[MRG+2] Invariance tests for clustering metrics #10828

Merged

2 tasks

jnothman closed this in #10828 Mar 18, 2018

Uh oh!

[MRG+1] Invariance tests for clustering metrics #8102 #8135

[MRG+1] Invariance tests for clustering metrics #8102 #8135

Uh oh!

Conversation

anki08 commented Dec 29, 2016

Reference Issue

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

anki08 commented Dec 30, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jnothman commented Dec 30, 2016

Uh oh!

anki08 commented Dec 30, 2016

Uh oh!

jnothman commented Dec 31, 2016

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

anki08 Jan 1, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

anki08 Jan 2, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman commented Jan 3, 2017

Uh oh!

anki08 commented Dec 30, 2016 •

edited

Loading

anki08 Jan 1, 2017 •

edited

Loading

anki08 Jan 2, 2017 •

edited

Loading

anki08 commented Jan 17, 2017 •

edited

Loading