FIX wrong usage and occurrence of string tag #14043

glemaitre · 2019-06-07T20:51:59Z

The right tag to use should be string and not str. However, there are 2 occurrences within the code base.

I am unsure if we should backport this in 0.21.3

rth

Thanks for catching this!

jnothman · 2019-06-10T12:55:19Z

.... can we add some tag validation in check_estimator???

rth · 2019-06-11T10:27:26Z

.... can we add some tag validation in check_estimator???

I thought about that, but then it means that if some contrib packages add their own tags check_estimator would fail (at least until #13969 is implemented).

thomasjpfan · 2019-06-11T14:10:09Z

In check_estimators, we can add an optional keyword valid_tags, which check_estimators can use to validate tags.

amueller · 2019-06-11T19:38:17Z

What we actually should be doing is using these tags to generate data and run the tests with this generated data.

qinhanmin2014 · 2019-06-12T13:28:05Z

I thought about that, but then it means that if some contrib packages add their own tags check_estimator would fail (at least until #13969 is implemented).

+1

qinhanmin2014

let's merge this one first?

qinhanmin2014 · 2019-06-12T13:30:05Z

sklearn/utils/estimator_checks.py

@@ -663,7 +663,8 @@ def check_dtype_object(name, estimator_orig):
        if "Unknown label type" not in str(e):
            raise

-    if 'str' not in tags['X_types']:
+    tags = _safe_tags(estimator)


this seems redundant?

qinhanmin2014 · 2019-06-12T13:53:56Z

We already have tags = _safe_tags(estimator_orig) above?

codecov · 2019-06-12T14:29:25Z

Codecov Report

Merging #14043 into master will not change coverage.
The diff coverage is 100%.

@@           Coverage Diff           @@
##           master   #14043   +/-   ##
=======================================
  Coverage   96.81%   96.81%           
=======================================
  Files         393      393           
  Lines       71911    71911           
  Branches     7887     7887           
=======================================
  Hits        69623    69623           
  Misses       2265     2265           
  Partials       23       23

Impacted Files	Coverage Δ
sklearn/impute/_base.py	`98.32% <ø> (ø)`	⬆️
sklearn/utils/estimator_checks.py	`94.02% <100%> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 227ebc4...c74e110. Read the comment docs.

glemaitre · 2019-06-13T09:25:52Z

We already have tags = _safe_tags(estimator_orig) above?

Oh I see. Sorry about that. I did not see it.

FIX wrong usage and occurrence of string tag

496eab9

rth approved these changes Jun 8, 2019

View reviewed changes

rth added this to the 0.21.3 milestone Jun 8, 2019

Merge branch 'master' into is/string_tag

91623dd

qinhanmin2014 approved these changes Jun 12, 2019

View reviewed changes

fix

c74e110

jnothman approved these changes Jun 13, 2019

View reviewed changes

fix

19c7313

qinhanmin2014 approved these changes Jun 13, 2019

View reviewed changes

qinhanmin2014 merged commit 8fe89ea into scikit-learn:master Jun 13, 2019

qinhanmin2014 mentioned this pull request Jun 13, 2019

Add a test to ensure that estimators do not have invalid estimator tags #14082

Open

jnothman pushed a commit to jnothman/scikit-learn that referenced this pull request Jun 24, 2019

FIX wrong usage and occurrence of string tag (scikit-learn#14043)

7e874b4

koenvandevelde pushed a commit to koenvandevelde/scikit-learn that referenced this pull request Jul 12, 2019

FIX wrong usage and occurrence of string tag (scikit-learn#14043)

93ebfe3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX wrong usage and occurrence of string tag #14043

FIX wrong usage and occurrence of string tag #14043

glemaitre commented Jun 7, 2019

rth left a comment

jnothman commented Jun 10, 2019

rth commented Jun 11, 2019

thomasjpfan commented Jun 11, 2019

amueller commented Jun 11, 2019

qinhanmin2014 commented Jun 12, 2019

qinhanmin2014 left a comment

qinhanmin2014 Jun 12, 2019

qinhanmin2014 commented Jun 12, 2019

codecov bot commented Jun 12, 2019 •

edited

Loading

glemaitre commented Jun 13, 2019

FIX wrong usage and occurrence of string tag #14043

FIX wrong usage and occurrence of string tag #14043

Conversation

glemaitre commented Jun 7, 2019

rth left a comment

Choose a reason for hiding this comment

jnothman commented Jun 10, 2019

rth commented Jun 11, 2019

thomasjpfan commented Jun 11, 2019

amueller commented Jun 11, 2019

qinhanmin2014 commented Jun 12, 2019

qinhanmin2014 left a comment

Choose a reason for hiding this comment

qinhanmin2014 Jun 12, 2019

Choose a reason for hiding this comment

qinhanmin2014 commented Jun 12, 2019

codecov bot commented Jun 12, 2019 • edited Loading

Codecov Report

glemaitre commented Jun 13, 2019

codecov bot commented Jun 12, 2019 •

edited

Loading