Fix predict method for multiclass multioutput ensemble models #12834

elsander · 2018-12-19T22:36:36Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This PR fixes a bug where the predict method would fail for multiclass multioutput ensemble models, if any of the dependent variables were strings. The underlying issue was preallocating the predict output using np.zeros, which would then error when string predictions were inserted. I replaced the function call with a more dtype-agnostic call to np.empty.

Any other comments?

adrinjalali · 2018-12-20T10:23:48Z

sklearn/ensemble/forest.py

@@ -547,7 +547,8 @@ def predict(self, X):

        else:
            n_samples = proba[0].shape[0]
-            predictions = np.zeros((n_samples, self.n_outputs_))
+            predictions = np.empty((n_samples, self.n_outputs_),
+                                   dtype='object')


wouldn't it be better to have dtype=self.classes_.dtype or something?

jnothman · 2018-12-20T10:28:05Z

sklearn/ensemble/tests/test_forest.py

+
+    with np.errstate(divide="ignore"):
+        proba = est.predict_proba(X_test)
+        assert_equal(len(proba), 2)


With the adoption of pytest, we are phasing out use of test helpers assert_equal, assert_true, etc. Please use bare assert statements, e.g. assert x == y, assert not x, etc.

elsander · 2018-12-31T21:47:09Z

Sorry for the delay! I committed a couple of changes to address the code review comments.

adrinjalali

Thanks @elsander , LGTM!

jnothman

LGTM!

Please add an entry to the change log at doc/whats_new/v0.21.rst. Like the other entries there, please reference this pull request with :issue: and credit yourself (and other contributors if applicable) with :user:

…-learn#12834)

…scikit-learn#12834)" This reverts commit 06da503.

…-learn#12834)

BUG fix predict method for multiclass multioutput ensemble models

74cb798

adrinjalali reviewed Dec 20, 2018

View reviewed changes

jnothman reviewed Dec 20, 2018

View reviewed changes

code review fixes

d23a766

adrinjalali approved these changes Jan 2, 2019

View reviewed changes

jnothman approved these changes Jan 2, 2019

View reviewed changes

Liz Sander and others added 2 commits January 2, 2019 09:43

DOC update changelog

e6c4c51

Update v0.21.rst

87af947

jnothman merged commit 6581b0d into scikit-learn:master Jan 2, 2019

rth pushed a commit to rth/scikit-learn that referenced this pull request Jan 3, 2019

FIX predict method for multiclass multioutput ensemble models (scikit…

d47a0b7

…-learn#12834)

adrinjalali pushed a commit to adrinjalali/scikit-learn that referenced this pull request Jan 7, 2019

FIX predict method for multiclass multioutput ensemble models (scikit…

e7f6d4f

…-learn#12834)

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

FIX predict method for multiclass multioutput ensemble models (scikit…

06da503

…-learn#12834)

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

Revert "FIX predict method for multiclass multioutput ensemble models (…

d2a1e28

…scikit-learn#12834)" This reverts commit 06da503.

xhluca pushed a commit to xhluca/scikit-learn that referenced this pull request Apr 28, 2019

Revert "FIX predict method for multiclass multioutput ensemble models (…

a48fe96

…scikit-learn#12834)" This reverts commit 06da503.

koenvandevelde pushed a commit to koenvandevelde/scikit-learn that referenced this pull request Jul 12, 2019

FIX predict method for multiclass multioutput ensemble models (scikit…

6b9ec56

…-learn#12834)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix predict method for multiclass multioutput ensemble models #12834

Fix predict method for multiclass multioutput ensemble models #12834

Uh oh!

elsander commented Dec 19, 2018

Uh oh!

adrinjalali Dec 20, 2018

Uh oh!

jnothman Dec 20, 2018

Uh oh!

elsander commented Dec 31, 2018

Uh oh!

adrinjalali left a comment

Uh oh!

jnothman left a comment

Uh oh!

Uh oh!

Uh oh!

Fix predict method for multiclass multioutput ensemble models #12834

Fix predict method for multiclass multioutput ensemble models #12834

Uh oh!

Conversation

elsander commented Dec 19, 2018

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

adrinjalali Dec 20, 2018

Choose a reason for hiding this comment

Uh oh!

jnothman Dec 20, 2018

Choose a reason for hiding this comment

Uh oh!

elsander commented Dec 31, 2018

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!