Multinomial Bayes issue #5814

mladenk42 · 2015-11-13T20:03:56Z

Given the digits dataset (available in sklearn.datasets), we split it into train and test set.
We fit a MultinomialNB classifier on the train set, and generate predictions on that same train
set. When this is done without smoothing, the classification performance is rather low.
This is contrary to expectations, as the classifier has already seen all possible feature values.
So not including smoothing shouldn't really make such a big difference.

It seems the BernoulliNB class from sklearn also has this problem.

My inefficient but straightforward implementation performs expectedly well on the training set.
Code to reproduce the issue with some more tests is available at http://pastebin.com/2hsrA8xL
I hope the issue is not caused by some trivial implementation detail I've overlooked.

agramfort · 2015-11-14T08:08:13Z

please send a PR so the diff is readable

mladenk42 · 2015-11-14T13:58:55Z

Might be a silly question but where do i send it :)? The scikit-learn-issues mailing list?
Thanks.

agramfort · 2015-11-14T16:52:18Z

I mean open a pull request (PR) on github.

mladenk42 · 2015-11-15T13:07:27Z

Since it's my first time submitting an error report I'm completely lost :(

It says here that PR-s are used to tell others about changes I pushed to a repository. I didn't push anything O_o, nor do I want to. The demo script I submitted (via pastebin link) doesn't fix the issue, or change anything in sklearn code. It's just there to reproduce the issue, so someone who knows the sklearn code can debug it more easily. Perhaps I'm misunderstanding something.

absolutelyNoWarranty · 2015-11-19T10:16:34Z

It seems when alpha=0.0 and the data has features which never change, feature_log_prob_ has -inf which causes the calculations down the line to become all nan.

Given
X = np.array([[1, 0],[1, 1]])
y = np.array([0, 1])

Compare

nb = BernoulliNB(alpha=0.)
nb.fit(X, y)

nb.predict_proba(X)
Out[135]:
array([[ nan,  nan],
       [ nan,  nan]])

nb.feature_log_prob_
Out[139]:
array([[  0., -inf],
       [  0.,   0.]])

and

nb = BernoulliNB(alpha=1e-15)
nb.fit(X, y)
nb.predict_proba(X)

Out[136]:
array([[  1.0000e+00,   8.8818e-16],
       [  1.0000e-15,   1.0000e+00]])

amueller · 2016-09-14T19:21:39Z

We should probably throw an error when that happens if that creates issues.

yl565 · 2016-09-24T16:36:28Z

@amueller Please see my PR to this issue.

jmschrei · 2017-06-19T19:34:06Z

Fixed via #9131

…9131) * Fix #5814 * Fix pep8 in naive_bayes.py:716 * Fix sparse matrix incompatibility * Fix python 2.7 problem in test_naive_bayes * Make sure the values are probabilities before log transform * Improve docstring of `_safe_logprob` * Clip alpha solution * Clip alpha solution * Clip alpha in fit and partial_fit * Add what's new entry * Add test * Remove .project * Replace assert method * Update what's new * Format float into %.1e * Update ValueError msg

…cikit-learn#9131) * Fix scikit-learn#5814 * Fix pep8 in naive_bayes.py:716 * Fix sparse matrix incompatibility * Fix python 2.7 problem in test_naive_bayes * Make sure the values are probabilities before log transform * Improve docstring of `_safe_logprob` * Clip alpha solution * Clip alpha solution * Clip alpha in fit and partial_fit * Add what's new entry * Add test * Remove .project * Replace assert method * Update what's new * Format float into %.1e * Update ValueError msg

amueller added the Bug label Sep 14, 2016

amueller added the Need Contributor label Sep 14, 2016

amueller added this to the 0.19 milestone Sep 14, 2016

yl565 added a commit to yl565/scikit-learn that referenced this issue Sep 23, 2016

Fix scikit-learn#5814

4ce3e8f

yl565 mentioned this issue Sep 23, 2016

[MRG] Fix MultinomialNB and BernoulliNB alpha=0 bug #7477

Closed

yl565 added a commit to yl565/scikit-learn that referenced this issue Oct 17, 2016

Fix scikit-learn#5814

d3bb0ec

amueller removed the Need Contributor label Mar 3, 2017

herilalaina pushed a commit to herilalaina/scikit-learn that referenced this issue Jun 15, 2017

Fix scikit-learn#5814

ddb5383

herilalaina mentioned this issue Jun 15, 2017

[MRG+1] Fix MultinomialNB and BernoulliNB alpha=0 bug (continuation) #9131

Merged

jmschrei closed this as completed in #9131 Jun 19, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multinomial Bayes issue #5814

Multinomial Bayes issue #5814

mladenk42 commented Nov 13, 2015

agramfort commented Nov 14, 2015 via email

mladenk42 commented Nov 14, 2015

agramfort commented Nov 14, 2015 via email

mladenk42 commented Nov 15, 2015

absolutelyNoWarranty commented Nov 19, 2015

amueller commented Sep 14, 2016

yl565 commented Sep 24, 2016

jmschrei commented Jun 19, 2017

Multinomial Bayes issue #5814

Multinomial Bayes issue #5814

Comments

mladenk42 commented Nov 13, 2015

agramfort commented Nov 14, 2015 via email

mladenk42 commented Nov 14, 2015

agramfort commented Nov 14, 2015 via email

mladenk42 commented Nov 15, 2015

absolutelyNoWarranty commented Nov 19, 2015

amueller commented Sep 14, 2016

yl565 commented Sep 24, 2016

jmschrei commented Jun 19, 2017