[MRG+1] SimpleImputer(strategy="constant") #11211

jeremiedbb · 2018-06-06T11:25:15Z

This is WIP to implement the constant strategy for the SimpleImputer as described in #11208.

ogrisel · 2018-06-06T16:17:51Z

sklearn/impute.py

@@ -94,6 +104,13 @@ class SimpleImputer(BaseEstimator, TransformerMixin):
          each column.
        - If "most_frequent", then replace missing using the most frequent
          value along each column.
+        - If "constant", then replace missing values with fill_value


style: end with "."

ogrisel · 2018-06-06T16:24:00Z

sklearn/impute.py

+            if self.missing_values == "NaN" or np.isnan(self.missing_values):
+                force_all_finite = "allow-nan"
+            else:
+                force_all_finite = True


Actually this should be:

if isinstance(self.missing_value, numbers.Real) and np.isnan(self.missing_values): force_all_finite = "allow-nan" else: force_all_finite = True

or even:

force_all_finite = True if self.missing_values is not np.nan else 'allow-nan'

jorisvandenbossche · 2018-06-06T16:29:13Z

sklearn/impute.py

@@ -115,16 +132,41 @@ class SimpleImputer(BaseEstimator, TransformerMixin):
    Notes
    -----
    Columns which only contained missing values at `fit` are discarded upon
-    `transform`.
+    `transform` is strategy is not "constant"


… on MICEImputer

jorisvandenbossche

minor comments

jorisvandenbossche · 2018-06-07T13:41:46Z

sklearn/impute.py

-    if value_to_mask == "NaN" or np.isnan(value_to_mask):
-        return np.isnan(X)
+    if value_to_mask is np.nan:
+        # nan values are never equal to themselves


I would add that this way it also works for object dtypes (in case somebody would wonder why not np.isnan is used)

jorisvandenbossche · 2018-06-07T13:51:03Z

sklearn/impute.py

@@ -80,10 +82,10 @@ class SimpleImputer(BaseEstimator, TransformerMixin):

    Parameters
    ----------
-    missing_values : integer or "NaN", optional (default="NaN")
+    missing_values : real number, string, np.nan or None,
+        optional (default=np.nan).


Sphinx does not really like this way of wrapping the line ..
If you want it to render nicely we can do

missing_values : real number, string, np.nan or None, \ optional (default=np.nan). The placeholder for ...

ogrisel

This is starting to look really good. As told in real life the main thing missing is a narrative doc update.

Here are some further comments in the code:

ogrisel · 2018-06-07T15:38:27Z

sklearn/tests/test_impute.py

+
+def test_imputation_constant_object():
+    # Test imputation using the constant strategy
+    # on objects


nitpick: comment fits on one line.

This comment has been copied several times.

ogrisel · 2018-06-07T15:49:48Z

sklearn/impute.py

+            else:
+                dtype = object
+
+            return np.full(X.shape[1], fill_value, dtype=dtype)


Maybe this should always be:

return np.full(X.shape[1], fill_value, dtype=X.dtype)

independently of fill_value no? If fill_value is inconsistent with X.dtype after validation, fit should have already raised an informative error message above.

ogrisel · 2018-06-07T16:09:24Z

sklearn/impute.py

+        if self.strategy == "constant":
+            if (X.dtype.kind in ("i", "f")
+                    and not isinstance(fill_value, numbers.Real)):
+                raise ValueError(


This should probably be a TypeError because we check isintance in the condition.

ogrisel · 2018-06-07T16:16:11Z

sklearn/impute.py

+                    and not isinstance(fill_value, numbers.Real)):
+                raise ValueError(
+                    "fill_value={0} is invalid. Expected a numerical value "
+                    "to numerical data".format(fill_value))


We should also issue informative error messages the following cases:

elif X.dtype.kind == "O" and fill_value not in six.string_types: raise TypeError("fill_value={0} is invalid. Expected the an str instance when" " imputing categorical data.".format(fill_value)) else: raise ValueError("SimpleImputer cannot work on data with dtype={0}:" " expecting numerical or categorical data with dtype=object" "".format(X.dtype))

We should also add tests to check that those exceptions and their message are raised on invalid inputs.

git grep "with pytest.raises" to find examples on how to test exception in the scikit-learn test suite using pytest.

ogrisel · 2018-06-07T16:16:35Z

sklearn/impute.py

-    def _dense_fit(self, X, strategy, missing_values):
+            # Constant
+            elif strategy == "constant":
+


style consistency: no need for a blank line here.

ogrisel · 2018-06-07T16:21:43Z

sklearn/impute.py

-                        if self.missing_values == 'NaN'
-                        or np.isnan(self.missing_values) else True)
+                        force_all_finite="allow-nan"
+                        if self.missing_values is np.nan else True)


style:please define a local variable to improve readability:

force_all_finite = "allow-nan" if self.missing_values is np.nan else True

ogrisel

One more comment.

ogrisel · 2018-06-07T16:32:56Z

sklearn/impute.py

+            if X.dtype.kind == "O":
+                most_frequent = np.empty(X.shape[0], dtype=object)
+            else:
+                most_frequent = np.empty(X.shape[0])


Maybe this should be:

most_frequent = np.empty(X.shape[0], dtype=X.dtype)

@ogrisel actually, we don't always want most_frequent to have the same dtype as X. For example, if there is a column full of NaNs in an integer array, X has integer dtype, whereas most_frequent (which stores the statistics column-wise) will have a np.nan for the column of NaNs.

so discussed here: if you have integer with all -1, specify missing_value of -1, then the most_frequent will be NaN (due to implementation details to signal this case), and then X.dtype would clash with NaN.

ogrisel · 2018-06-08T09:38:38Z

BTW there are still some tests are broken because of the switch from "NaN" to np.nan as the default missing value marker.

We decided to not use a string as the default marker to avoid having weird behavior in case a user has the "NaN" string in a CSV file for instance. I think it's less surprising to not have "magic" string in SimpleImputer and now is the time to change this because of the unreleased introduction of the new SimpleImputer class.

…p.nan

jeremiedbb · 2018-06-11T08:49:55Z

The switch from "NaN" to np.nan for the default value of missing_values does not seem to be a good idea eventually. Setting a parameter to np.nan as default is not possible right now. There is a general test for all estimators that checks the default parameters.

    # We get the default parameters from init and then
    # compare these against the actual values of the attributes.

(from estimator_checks.py in check_parameters_default_constructible)

It results in a nan != nan error.

I see 3 solutions:

Modify the check_parameters_default_constructible to allow np.nan. I don't think that one is a good idea
Go back to "NaN" for the default value.
Set default value to None and call the fit with missing_values=np.nan if X is numerical and leave None otherwise.

Which solution should we apply ?

ogrisel · 2018-06-11T09:02:58Z

Hum pandas.read_csv is using object dtype columns with np.nan for missing values when parsing CSV files with string contents in a column. So using None as the default missing value for non-numerical columns will not work out of the box for this kind of pipeline.

Maybe we should go back to the "NaN" special magic string. But this mean that we will not be able to properly handle columns that hat this token as a legit value and missing values in the same column.

Another option would be to introduce a module level MISSING singleton marker:

class MissingValueMarker:
    pass


MISSING = MissingValueMarker()


class SimpleImputer(strategy='...', missing_value=MISSING, ...):
    ...

This marker could by default match np.nan both for floating point and object dtype data. For integer dtyped data it would not match anything though and the user would be required to pass a specific integer value explicitly.

jeremiedbb · 2018-06-11T09:21:24Z

I said "and leave None otherwise" but we could call the fit with missing_values = np.nan for all dtypes.
So instead of

 if sparse.issparse(X):
      self.statistics_ = self._sparse_fit(X,
                                          self.strategy,
                                          self.missing_values,
                                          fill_value)

we'd have something like

missing_values = np.nan if self.missing_values is None else self.missing_values

 if sparse.issparse(X):
     self.statistics_ = self._sparse_fit(X,
                                         self.strategy,
                                         missing_values,
                                         fill_value)

is it fine ?

ogrisel · 2018-06-11T09:42:07Z

The problem with this approach is that nobody could use None as the missing value marker in that case. There might be pipelines where None is used upstream as the missing value marker for object dtyped data even if it's not the default behavior of pandas dataframes.

jeremiedbb · 2018-06-11T10:03:27Z

That's a problem indeed.

Another option would be to introduce a module level MISSING singleton marker:
it seems that's the only way to avoid the problem of having a default value different of the np.nan missing_values. And it's more elegant.
I'm going with this solution if it's ok for you

jnothman · 2018-06-11T10:24:17Z

Is the only issue with np.nan as the default value an issue with estimator_checks? I'd be in favour of loosening that check to allow for (old is new or old == new) instead of just (old == new)

…

On 11 June 2018 at 20:03, jeremiedbb ***@***.***> wrote: That's a problem indeed. Another option would be to introduce a module level MISSING singleton marker: it seems that's the only way to avoid the problem of having a default value different of the np.nan missing_values. And it's more elegant. I'm going with this solution if it's ok for you — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#11211 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAEz653i-i8scNxuYmhnYcUQarzXYNAIks5t7kB8gaJpZM4UceDA> .

jeremiedbb · 2018-06-11T11:04:28Z

Yes it's the only remaining issue regarding np.nan as the default value. It only happens when checking the constructor. Afterward, the imputer works fine with nans.

jorisvandenbossche · 2018-06-11T11:31:39Z

Yes, agree with @jnothman. np.nan is the logical default, so then it seems a bit stupid to introduce something some singleton marker that will be repladed with np.nan afterwards anyhow, to just satisfy the current estimator checks.

…in constructor ; + minor corrections

jorisvandenbossche · 2018-06-11T13:18:54Z

sklearn/impute.py

+                if not isinstance(fill_value, six.string_types):
+                    raise TypeError(
+                        "fill_value={0} is invalid. Expected an str instance "
+                        "when imputing categorical data.".format(fill_value))


I am not sure it is good / needed to be that strict here. For example, you can have Categorical data in pandas with integer categories (or other non-string values) where it can make sense to fill with another value (and column.dtype.kind will be "O" for categorical columns)

We should have a test for this specific use case: imputing a categorical column with a pandas categorical value.

maybe
if not (isinstance(fill_value, six.string_types) or isinstance(fill_value, numbers.Reals) or fill_value is None)
is enough ?

We talked IRL with @jorisvandenbossche and if the categorical datafram has integer values with missing values, then the check_array conversion will yield a floating point array so its already covered by the existing tests.

Let's stay strict in this checks for now. If a user reports a valid use case for being laxer we can always change that later.

jnothman

Please also avoid @ referencing me in commit messages... if it gets merged, it sends me spam notifications every time someone does something silly (like rebasing) with public forks of scikit-learn.

jnothman · 2018-06-16T11:15:13Z

sklearn/impute.py

+    def _validate_input(self, X):
+        allowed_strategies = ["mean", "median", "most_frequent", "constant"]
+        if self.strategy not in allowed_strategies:
+            raise ValueError("Can only use these strategies: {0} "


No test coverage

jnothman · 2018-06-16T11:16:08Z

sklearn/impute.py

+            if invalid_mask.any():
+                missing = np.arange(X.shape[1])[invalid_mask]
+                if self.verbose:
+                    warnings.warn("Deleting features without "


sklearn-lgtm · 2018-06-17T16:00:45Z

This pull request introduces 3 alerts when merging 20456f4 into bb38539 - view on LGTM.com

new alerts:

3 for Comparison of identical values

Comment posted by LGTM.com

jnothman · 2018-06-17T23:10:29Z

Please add an entry to the change log at doc/whats_new/v0.20.rst. Like the other entries there, please reference this pull request with :issue: and credit yourself (and other contributors if applicable) with :user:

Or perhaps modify the existing entry introducing SimpleImputer. Note the differences:

strategy='constant'
strategy='most_frequent' with string columns
missing_values="NaN" should now be missing_values=np.nan

jnothman · 2018-06-17T23:14:18Z

sklearn/impute.py

@@ -80,10 +95,10 @@ class SimpleImputer(BaseEstimator, TransformerMixin):

    Parameters
    ----------
-    missing_values : integer or "NaN", optional (default="NaN")
+    missing_values : real number, string, np.nan or None, \
+optional (default=np.nan)


Please replace these two lines with:

missing_values : number, string, np.nan (default) or None

what we currently have here is unnecessarily verbose.

jnothman · 2018-06-17T23:18:25Z

sklearn/impute.py

+                        " value when imputing numerical"
+                        " data".format(fill_value))
+
+            elif X.dtype.kind == "O":


Hmmm... object dtype does not necessarily mean string data. And string data does not necessarily mean object dtype (numpy.asarray with strings will not result in object data; we need to test this case). So I'm not sure this validation or its error message is quite right. I don't think we need to support numeric data in object arrays necessarily, but we do need to support string arrays if we're going to support object arrays, IMO, and we need to make sure the error messages are appropriate to the various cases.

+1 for adding support and tests for accepting string arrays even if in most cases that are not a recommended data structure for representing categorical variables.

As for the input validation on object dtype array, we can just drop the elif X.dtype.kind == "O" case and not raise any TypeError and accept any fill_value when X.dtype.kind == "O".

Note that there are some complexities to deal with in that case, eg the fact that filling with new values might truncate silently:

In [5]: a = np.array(['a', 'b', 'a']) In [6]: mask = a == 'b' In [7]: a[mask] = 'missing' In [8]: a Out[8]: array(['a', 'm', 'a'], dtype='<U1')

But that's then up to the user's responsibility?

object dtype does not necessarily mean string data

On this part (so not about supporting string dtype), I think we briefly discussed this and our feeling was: "let's be more restrictive than theoretically needed for now, if there is request for it we can always relax later".

For example, a situation where you can get the case of array of object dtype consisting of mixed types (and not only strings), is if you pass a subset of columns of a dataframe with both string and numerical columns to the SimpleImputer. But, normally you will want a different fill value for both types of columns, and you will need a ColumnTransformer anyhow to have a specific SimpleImputer for each type of columns.

Can't we force dtype = object as soon as input dtype is not numeric ?
Something like:

if X.dtype.kind not in ("i", "f"): X = X.astype(object)

right after check_array ?

Thanks @jorisvandenbossche, indeed imputing a string array is not simple to implement and probably useless in practice.

I think I prefer to raise an informative error message asking the user to provide either numerical data with integer or floating point data types or categorical represented with integer or object datatypes.

If we discover a common pipeline where the implicit conversion from string dtype to object dtype suggested above by @jeremiedbb is helpful we might want to consider it later, but to me it sounds like a YAGNI.

jnothman · 2018-06-17T23:19:10Z

sklearn/impute.py

@@ -94,6 +113,13 @@ class SimpleImputer(BaseEstimator, TransformerMixin):
          each column.
        - If "most_frequent", then replace missing using the most frequent
          value along each column.
+        - If "constant", then replace missing values with fill_value.
+


Please note here (or in a Notes section) that most_frequent and constant work with strings or numeric data, while the others only work with numeric data.

jeremiedbb · 2018-06-18T12:36:59Z

outdated

sklearn-lgtm · 2018-06-18T12:56:58Z

This pull request introduces 3 alerts when merging 7d3d1b5 into 4143356 - view on LGTM.com

new alerts:

3 for Comparison of identical values

Comment posted by LGTM.com

sklearn-lgtm · 2018-06-18T15:18:25Z

This pull request introduces 3 alerts when merging 972668b into 4143356 - view on LGTM.com

new alerts:

3 for Comparison of identical values

Comment posted by LGTM.com

ogrisel · 2018-06-19T08:59:12Z

sklearn/impute.py

+            raise TypeError("The SimpleImputer does not support this datatype"
+                            " ({0}). Please provide either numeric data or"
+                            " categorical data represented by integer or "
+                            "object datatypes.".format(X.dtype))


I am wondering: do we usually raise TypeError or ValueError when the object type is valid, but the dtype is not?

Also, the message is a bit confusing: one could get the impression that float data is not supported.

"""
SimpleImputer does not work on data with dtype {0}. Please provide either a numeric array (with a floating point or integer dtype) or categorical data represented either as an array with integer dtype or an array of string values with an object dtype.
"""

Better be explicit in error message. Verbosity is not a problem here.

numpy raises TypeError for bad dtypes. E.g. np.log(np.asarray(['hello']))

numpy is not very consistent:

>>> np.array(['a', 'b'], dtype=object).astype(np.uint8) Traceback (most recent call last): File "<ipython-input-6-3200a6709d46>", line 1, in <module> np.array(['a', 'b'], dtype=object).astype(np.uint8) ValueError: invalid literal for int() with base 10: 'a'

It's true that if you consider the individual scalar operation the TypeError raised by a ufunc might make sense. If you consider that the array as a whole is invalid, then ValueError makes more sense. I think both are fine in this case.

sklearn-lgtm · 2018-06-20T09:05:20Z

This pull request introduces 3 alerts when merging fb1a4e9 into 4143356 - view on LGTM.com

new alerts:

3 for Comparison of identical values

Comment posted by LGTM.com

ogrisel · 2018-06-20T12:06:08Z

sklearn/impute.py

+                not isinstance(fill_value, numbers.Real)):
+            raise TypeError("'fill_value'={0} is invalid. Expected a numerical"
+                            " value when imputing numerical"
+                            " data".format(fill_value))


As we switched to ValueError above we should probably be consistent and do it here as well.

I think value error is fine because this is already what is raised by most models who use the standard check_X_y validation:

>>> from sklearn.linear_model import LogisticRegression >>> LogisticRegression().fit([['invalid'], ['invalid']], [0, 1]) Traceback (most recent call last): File "<ipython-input-6-91496e8d832f>", line 1, in <module> LogisticRegression().fit([['invalid', 'invalid']], [0, 1]) File "/home/ogrisel/code/scikit-learn/sklearn/linear_model/logistic.py", line 1217, in fit order="C") File "/home/ogrisel/code/scikit-learn/sklearn/utils/validation.py", line 671, in check_X_y ensure_min_features, warn_on_dtype, estimator) File "/home/ogrisel/code/scikit-learn/sklearn/utils/validation.py", line 494, in check_array array = np.asarray(array, dtype=dtype, order=order) File "/home/ogrisel/.virtualenvs/py36/lib/python3.6/site-packages/numpy/core/numeric.py", line 492, in asarray return array(a, dtype, copy=False, order=order) ValueError: could not convert string to float: 'invalid'

sklearn-lgtm · 2018-06-20T13:48:43Z

This pull request introduces 3 alerts when merging c8246f2 into 4143356 - view on LGTM.com

new alerts:

3 for Comparison of identical values

Comment posted by LGTM.com

ogrisel · 2018-06-20T15:21:58Z

Thank you very much @jeremiedbb!

amueller · 2018-06-29T20:51:01Z

I like the new example, but I feel the example could be simplified. Can we not remove the fill_value and handle_unknown parameters?
Also, I forgot why remainder='passthrough' by default. What was the reason for that again?
What was the reason to going from the make_pipeline to Pipeline? it seems it makes stuff quite a bit longer...

glemaitre · 2018-06-29T20:55:08Z

What was the reason to going from the make_pipeline to Pipeline? it seems it makes stuff quite a bit longer...

It makes the example a bit longer but the parameter names to be set are more user friendly for an example:

Initial name: 'columntransformer__pipeline-0__simpleimputer__strategy'
Now: 'preprocessor__num__imputer__strategy'

jorisvandenbossche · 2018-07-03T10:01:49Z

Also, I forgot why remainder='passthrough' by default. What was the reason for that again?

Should we reconsider this? If we want to have 'drop' in most of our examples, that might be an indication.
I also had a colleague who had not dropped 'y' from its data, and because it was passed through in the ColumnTransformer, noticed very good results ...

jnothman · 2018-07-03T10:41:56Z

hahahaha. yeah, I suppose that's a risk. I'm okay with changing it again...

jeremiedbb added 5 commits June 6, 2018 13:22

added tests for constant impute strategy in simpleImputer

74384c6

typos

6dd6a5e

typos

2b101fb

typos

6e13e68

added constant strategy to the SimpleImputer.

f300fe2

ogrisel reviewed Jun 6, 2018

View reviewed changes

jorisvandenbossche reviewed Jun 6, 2018

View reviewed changes

This comment has been minimized.

Sign in to view

bug fixes on the SimpleImputer and change for default value to np.nan…

35e30ac

… on MICEImputer

This comment has been minimized.

Sign in to view

jorisvandenbossche reviewed Jun 7, 2018

View reviewed changes

object dtypes support for "most_frequent" strategy in SimpleImputer

ea4a929

This comment has been minimized.

Sign in to view

ogrisel reviewed Jun 7, 2018

View reviewed changes

glemaitre added this to the 0.20 milestone Jun 8, 2018

minor fixes regarding the change of default missing_values="NaN" to n…

10f165b

…p.nan

Changed the test in estimator_check to allow np.nan as default value …

a6c33b1

…in constructor ; + minor corrections

This comment has been minimized.

Sign in to view

jorisvandenbossche reviewed Jun 11, 2018

View reviewed changes

jnothman reviewed Jun 16, 2018

View reviewed changes

add tests for warnings and errors catch

20456f4

jnothman reviewed Jun 17, 2018

View reviewed changes

jeremiedbb added 2 commits June 18, 2018 09:41

Merge branch 'master' into constant-imputer

e2ae626

dtype checks modifications + more tests

7d3d1b5

fixed exception catching + go back to not allow any but object dtype

972668b

ogrisel reviewed Jun 19, 2018

View reviewed changes

jeremiedbb added 2 commits June 20, 2018 10:31

error message update

f1da7b8

with tests update is better

fb1a4e9

ogrisel reviewed Jun 20, 2018

View reviewed changes

TypeError -> ValueError

c8246f2

jorisvandenbossche approved these changes Jun 20, 2018

View reviewed changes

ogrisel merged commit 0d8a04b into scikit-learn:master Jun 20, 2018

jeremiedbb deleted the constant-imputer branch June 26, 2018 14:47

jeremiedbb mentioned this pull request Jun 29, 2018

[MRG] FIX: raise error with inconsistent dtype X and missing_values #11391

Merged

qinhanmin2014 mentioned this pull request Jul 11, 2018

TST test_bagging_regressor/classifier_with_missing_inputs fails with SimpleImputer #11482

Closed

thomasjpfan mentioned this pull request Feb 21, 2019

[MRG] Adds KNNImputer #12852

Merged

[MRG+1] SimpleImputer(strategy="constant") #11211

[MRG+1] SimpleImputer(strategy="constant") #11211

Conversation

jeremiedbb commented Jun 6, 2018

Choose a reason for hiding this comment

ogrisel Jun 6, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

This comment has been minimized.

This comment has been minimized.

jorisvandenbossche left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

This comment has been minimized.

ogrisel left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ogrisel left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ogrisel commented Jun 8, 2018 • edited Loading

jeremiedbb commented Jun 11, 2018

ogrisel commented Jun 11, 2018 • edited Loading

jeremiedbb commented Jun 11, 2018 • edited by ogrisel Loading

ogrisel commented Jun 11, 2018

jeremiedbb commented Jun 11, 2018

jnothman commented Jun 11, 2018 via email

jeremiedbb commented Jun 11, 2018

jorisvandenbossche commented Jun 11, 2018

This comment has been minimized.

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jnothman left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sklearn-lgtm commented Jun 17, 2018

jnothman commented Jun 17, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jeremiedbb commented Jun 18, 2018 • edited Loading

sklearn-lgtm commented Jun 18, 2018

sklearn-lgtm commented Jun 18, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sklearn-lgtm commented Jun 20, 2018

ogrisel Jun 20, 2018 • edited Loading

Choose a reason for hiding this comment

sklearn-lgtm commented Jun 20, 2018

ogrisel commented Jun 20, 2018

amueller commented Jun 29, 2018

glemaitre commented Jun 29, 2018

jorisvandenbossche commented Jul 3, 2018

jnothman commented Jul 3, 2018 via email

ogrisel Jun 6, 2018 •

edited

Loading

ogrisel commented Jun 8, 2018 •

edited

Loading

ogrisel commented Jun 11, 2018 •

edited

Loading

jeremiedbb commented Jun 11, 2018 •

edited by ogrisel

Loading

jeremiedbb commented Jun 18, 2018 •

edited

Loading

ogrisel Jun 20, 2018 •

edited

Loading