Use scipy.special.xlogy to avoid indefinite limit in 0 for x*log(y) #12915

rth · 2019-01-03T12:14:20Z

x*log(y) is undefined when both x and y are zero,

>>> import numpy as np
>>> 0*np.log(0)
__main__:1: RuntimeWarning: divide by zero encountered in log
__main__:1: RuntimeWarning: invalid value encountered in double_scalars
nan

currently we avoid this by clipping x and y to a very small value (e.g. 1e-10) instead of 0.

A cleaner solution is to use scipy.special.xlogy,

>>> from scipy.special import xlogy
>>> xlogy(0, 0)
0.0

which produces the correct limit in 0 and has a comparable performance otherwise.

rth · 2019-01-03T12:27:34Z

sklearn/neural_network/tests/test_mlp.py

@@ -297,7 +297,7 @@ def test_multilabel_classification():
                        max_iter=150, random_state=0, activation='logistic',
                        learning_rate_init=0.2)
    mlp.fit(X, y)
-    assert_equal(mlp.score(X, y), 1)
+    assert_greater(mlp.score(X, y), 0.97)


This fails otherwise, as the score is 0.98 in this case.

Testing that mean test accuracy is exactly 1 on this random dataset with those parameters doesn't sound too robust, and the change of the numerical accuracy in log loss might have been enough to break this condition.

qinhanmin2014 · 2019-01-03T15:05:46Z

I guess we don't need tests and what's new here.

…y) (scikit-learn#12915)

…r x*log(y) (scikit-learn#12915)" This reverts commit 6b476ca.

…y) (scikit-learn#12915)

rth added 2 commits January 3, 2019 14:08

Use scipy.special.xlogy

cf47ea3

Fix tests

f60c733

rth commented Jan 3, 2019

View reviewed changes

Lint

4d5a92b

TomDLT approved these changes Jan 3, 2019

View reviewed changes

qinhanmin2014 approved these changes Jan 3, 2019

View reviewed changes

qinhanmin2014 merged commit 836a812 into scikit-learn:master Jan 3, 2019

rth deleted the xlogy branch January 3, 2019 15:08

adrinjalali pushed a commit to adrinjalali/scikit-learn that referenced this pull request Jan 7, 2019

MNT Use scipy.special.xlogy to avoid indefinite limit in 0 for x*log(…

3e10ea7

…y) (scikit-learn#12915)

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

MNT Use scipy.special.xlogy to avoid indefinite limit in 0 for x*log(…

6b476ca

…y) (scikit-learn#12915)

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

Revert "MNT Use scipy.special.xlogy to avoid indefinite limit in 0 fo…

f48f3a9

…r x*log(y) (scikit-learn#12915)" This reverts commit 6b476ca.

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

Revert "MNT Use scipy.special.xlogy to avoid indefinite limit in 0 fo…

8ea30d9

…r x*log(y) (scikit-learn#12915)" This reverts commit 6b476ca.

koenvandevelde pushed a commit to koenvandevelde/scikit-learn that referenced this pull request Jul 12, 2019

MNT Use scipy.special.xlogy to avoid indefinite limit in 0 for x*log(…

a7297cb

…y) (scikit-learn#12915)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Use scipy.special.xlogy to avoid indefinite limit in 0 for x*log(y) #12915

Use scipy.special.xlogy to avoid indefinite limit in 0 for x*log(y) #12915

Uh oh!

rth commented Jan 3, 2019 •

edited

Loading

Uh oh!

rth Jan 3, 2019 •

edited

Loading

Uh oh!

qinhanmin2014 commented Jan 3, 2019

Uh oh!

Uh oh!

Uh oh!

Use scipy.special.xlogy to avoid indefinite limit in 0 for x*log(y) #12915

Use scipy.special.xlogy to avoid indefinite limit in 0 for x*log(y) #12915

Uh oh!

Conversation

rth commented Jan 3, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rth Jan 3, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

qinhanmin2014 commented Jan 3, 2019

Uh oh!

Uh oh!

rth commented Jan 3, 2019 •

edited

Loading

rth Jan 3, 2019 •

edited

Loading