BUG Fix zero division error in GBDTs #14024

NicolasHug · 2019-06-04T15:15:31Z

Probably also #14014

Non-regression test fails on master.

amueller · 2019-06-05T15:27:36Z

sklearn/utils/estimator_checks.py

@@ -2352,8 +2352,8 @@ def check_decision_proba_consistency(name, estimator_orig):
            hasattr(estimator, "predict_proba")):

        estimator.fit(X, y)
-        a = estimator.predict_proba(X_test)[:, 1]
-        b = estimator.decision_function(X_test)
+        a = estimator.predict_proba(X_test)[:, 1].round(decimals=10)


That's not great :-/ Can we avoid that? Basically you're saying after the "fix" the estimator is not consistent any more because of floating point issues?

because scipy's expit isn't precise enough. But I feel like 1e-10 is a decent precision. I had issues with this test before and IMO it's too strict.

Agreed it's fine.

Should we be rounding as a non-copying operation? Is this copying?

It's copying.

I can't tell from the docs (nor the code) whether it copies if we pass out=.

I don't think that's important though?

Using out= will not copy if it's into the same array.
I forgot this was in estimator checks. Yes, not important.

sklearn/ensemble/_hist_gradient_boosting/grower.py

glemaitre

LGTM

sklearn/utils/estimator_checks.py

jnothman · 2019-06-07T02:41:55Z

sklearn/utils/estimator_checks.py

@@ -2352,8 +2352,8 @@ def check_decision_proba_consistency(name, estimator_orig):
            hasattr(estimator, "predict_proba")):

        estimator.fit(X, y)
-        a = estimator.predict_proba(X_test)[:, 1]
-        b = estimator.decision_function(X_test)
+        a = estimator.predict_proba(X_test)[:, 1].round(decimals=10)


Should we be rounding as a non-copying operation? Is this copying?

Forgot to comment about test

sklearn/ensemble/_hist_gradient_boosting/tests/test_gradient_boosting.py

ogrisel

LGTM once #14024 (comment) is addressed.

…ro_div_gbdt

NicolasHug · 2019-06-13T16:02:12Z

Comment was addressed

NicolasHug added 2 commits June 4, 2019 11:11

Fix zero division error

87d4927

adjusted proba_decision_consistency check

4668651

amueller reviewed Jun 5, 2019

View reviewed changes

added comment

6c4057f

glemaitre reviewed Jun 6, 2019

View reviewed changes

sklearn/ensemble/_hist_gradient_boosting/grower.py Outdated Show resolved Hide resolved

glemaitre approved these changes Jun 6, 2019

View reviewed changes

jnothman added this to the 0.21.3 milestone Jun 7, 2019

jnothman reviewed Jun 7, 2019

View reviewed changes

addressed comments

abf6a07

thomasjpfan previously approved these changes Jun 12, 2019

View reviewed changes

thomasjpfan reviewed Jun 12, 2019

View reviewed changes

sklearn/ensemble/_hist_gradient_boosting/tests/test_gradient_boosting.py Show resolved Hide resolved

ogrisel approved these changes Jun 13, 2019

View reviewed changes

Merge branch 'master' of github.com:scikit-learn/scikit-learn into ze…

d72986a

…ro_div_gbdt

NicolasHug added 2 commits June 13, 2019 12:26

Add ids to tests

d19b97c

fix syntax

a45e00b

thomasjpfan changed the title ~~[MRG] Fix zero division error in GBDTs~~ BUG Fix zero division error in GBDTs Jun 13, 2019

thomasjpfan merged commit b580ad5 into scikit-learn:master Jun 13, 2019

NicolasHug mentioned this pull request Jun 21, 2019

Division by Zero running GridSearchCV with HistGradientBoostingClassifier #14014

Closed

jnothman pushed a commit to jnothman/scikit-learn that referenced this pull request Jun 24, 2019

BUG Fix zero division error in GBDTs (scikit-learn#14024)

e07b395

koenvandevelde pushed a commit to koenvandevelde/scikit-learn that referenced this pull request Jul 12, 2019

BUG Fix zero division error in GBDTs (scikit-learn#14024)

5e7dc8c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG Fix zero division error in GBDTs #14024

BUG Fix zero division error in GBDTs #14024

Uh oh!

NicolasHug commented Jun 4, 2019

Uh oh!

amueller Jun 5, 2019

Uh oh!

NicolasHug Jun 5, 2019

Uh oh!

amueller Jun 6, 2019

Uh oh!

jnothman Jun 7, 2019

Uh oh!

NicolasHug Jun 7, 2019

Uh oh!

jnothman Jun 10, 2019

Uh oh!

Uh oh!

glemaitre left a comment

Uh oh!

Uh oh!

jnothman Jun 7, 2019

Uh oh!

Uh oh!

ogrisel left a comment •

edited

Loading

Uh oh!

NicolasHug commented Jun 13, 2019

Uh oh!

Uh oh!

Uh oh!

BUG Fix zero division error in GBDTs #14024

BUG Fix zero division error in GBDTs #14024

Uh oh!

Conversation

NicolasHug commented Jun 4, 2019

Uh oh!

amueller Jun 5, 2019

Choose a reason for hiding this comment

Uh oh!

NicolasHug Jun 5, 2019

Choose a reason for hiding this comment

Uh oh!

amueller Jun 6, 2019

Choose a reason for hiding this comment

Uh oh!

jnothman Jun 7, 2019

Choose a reason for hiding this comment

Uh oh!

NicolasHug Jun 7, 2019

Choose a reason for hiding this comment

Uh oh!

jnothman Jun 10, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jnothman Jun 7, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ogrisel left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NicolasHug commented Jun 13, 2019

Uh oh!

Uh oh!

ogrisel left a comment •

edited

Loading