[MRG+1] Multioutput bagging #4848

arjoly · 2015-06-11T12:16:19Z

This pull request brings multi-output support (#3449) to the bagging meta estimators.

It's different of #3449 since the implementation to make the averaging is shared for single-output and multi-output data.

I haven't implemented multi-output decision function as no base estimator currently support this.

arjoly · 2015-06-11T13:08:01Z

test_common has been hugely improved these days.

arjoly · 2015-07-13T13:40:41Z

bump

glouppe · 2015-07-19T17:52:51Z

sklearn/ensemble/bagging.py

+                proba[k] += all_proba[j][k]
+
+        for k in range(self.n_outputs_):
+            proba[k] /= self.n_estimators


Do we actually need a loop here?

It's needed to get a probability. Do you have a suggestion?

proba /= self.n_estimators

unless proba is not a Numpy array?

it's a list of the proba for each output.

glouppe · 2015-07-19T17:59:27Z

Other than my comment, +1 for merge

arjoly · 2015-07-20T07:38:28Z

Thanks for the review @glouppe !

arjoly · 2015-07-22T07:55:02Z

appveyor is working good!

ogrisel · 2015-07-24T12:11:42Z

Can you please add a new entry in whats_new.rst?

ogrisel · 2015-07-24T12:12:41Z

sklearn/ensemble/bagging.py

+            proba_est = estimator.predict_proba(X[:, features])
+            if n_outputs == 1:
+                proba_est = [proba_est]
+        except AttributeError:


why not use hasattr as previously?

I can switch to hasattr.

arjoly · 2015-07-24T12:34:51Z

@ogrisel I have taken into account your comments.

arjoly · 2015-07-29T07:51:30Z

bump fix merge conflict

amueller · 2015-07-30T19:43:51Z

just out of curiosity: what is your application for multi-output?

amueller · 2015-07-30T19:45:58Z

I am still pretty torn on multi-output stuff. It adds so much complexity, breaks the API for predict_proba and I have not really seen it used anywhere....

glouppe · 2015-07-30T20:05:02Z

I am still pretty torn on multi-output stuff. It adds so much complexity, breaks the API for predict_proba and I have not really seen it used anywhere....

I admit I have the same issue. I dont know if I would implement multi-output in RFs if I had to redo it from scratch...

arjoly · 2015-07-31T07:50:59Z

I am doing a lot of multi-label learning and multi-output regression. Using trees or variations as base learners, I am stuck with the current tree api and also often face this issue #2451.

amueller · 2015-07-31T15:10:58Z

so downgrading trees to multi-label would help you?

GaelVaroquaux · 2015-07-31T16:02:45Z

I think that we will see more and more usage of multi-label in the
future: it tends to map better to actual real-world problems.

Not to say that I have an opinion on this actual PR: I haven't looked at
it in detals.

amueller · 2015-07-31T16:12:32Z

@GaelVaroquaux I am totally for multi-label, the question is more multi-output multi-class, which doesn't seem to be a common setting.

GaelVaroquaux · 2015-07-31T16:14:30Z

@GaelVaroquaux I am totally for multi-label, the question is more multi-output multi-class, which doesn't seem to be a common setting.

I am with you, completely. My remark was more in the sense of multi-output trees. But we should wait a bit: Jacob is experimenting with some rewrites of the tree modules that seem to be giving a factor 2 speed up in RFs.

arjoly · 2015-07-31T16:29:20Z

so downgrading trees to multi-label would help you?

Given that it's an interface problem and that I am already working around it, it wouldn't help me.

arjoly · 2015-08-13T09:32:08Z

Maybe instead of having a list of array for each proba. We should have a tri-dimensional array of shape (n_samples, n_classes, n_outputs). If the number of classes is not identical, we could set the probability to zero. At least, we would get fast numpy operations and preserve the whole feature.

For decision_function, I don't how the dummy classes should be set.

cmarmo · 2022-05-02T01:11:58Z

Closing as superseded by #8547.

arjoly force-pushed the mo-bagging branch from b3f908c to 26d7201 Compare July 13, 2015 13:40

glouppe reviewed Jul 19, 2015
View reviewed changes

glouppe changed the title ~~[MRG] Multioutput bagging~~ [MRG+1] Multioutput bagging Jul 20, 2015

ogrisel reviewed Jul 24, 2015
View reviewed changes

arjoly force-pushed the mo-bagging branch from 06a2d4c to ae22594 Compare July 24, 2015 12:38

arjoly added 4 commits July 29, 2015 09:50

ENH Multi-output support added to the bagging module

5cbf9fa

ENH improve warning raised and error message

919ab89

COSMIT use hasattr instead of try ... except AttributeError ...

57504dd

DOC update what's new

0e5f56f

arjoly force-pushed the mo-bagging branch from ae22594 to 0e5f56f Compare July 29, 2015 07:51

amueller added the Waiting for Reviewer label Dec 10, 2015

maniteja123 mentioned this pull request Mar 21, 2016

[WIP] Label power set multilabel classification strategy #2461

Closed

5 tasks

jnothman modified the milestone: 0.19 Jun 18, 2017

dohmatob mentioned this pull request Jun 18, 2017

[MRG] ENH: multi-output support for BaggingRegressor #8547

Closed

amueller added the Superseded PR has been replace by a newer PR label Aug 6, 2019

github-actions bot added the module:ensemble label Mar 2, 2020

Base automatically changed from master to main January 22, 2021 10:48

cmarmo removed the Waiting for Reviewer label Mar 29, 2021

cmarmo closed this May 2, 2022

Uh oh!

[MRG+1] Multioutput bagging #4848

[MRG+1] Multioutput bagging #4848

Uh oh!

Conversation

arjoly commented Jun 11, 2015

Uh oh!

arjoly commented Jun 11, 2015

Uh oh!

arjoly commented Jul 13, 2015

Uh oh!

glouppe Jul 19, 2015

Choose a reason for hiding this comment

Uh oh!

arjoly Jul 20, 2015

Choose a reason for hiding this comment

Uh oh!

glouppe Jul 20, 2015

Choose a reason for hiding this comment

Uh oh!

arjoly Jul 20, 2015

Choose a reason for hiding this comment

Uh oh!

glouppe commented Jul 19, 2015

Uh oh!

arjoly commented Jul 20, 2015

Uh oh!

arjoly commented Jul 22, 2015

Uh oh!

ogrisel commented Jul 24, 2015

Uh oh!

ogrisel Jul 24, 2015

Choose a reason for hiding this comment

Uh oh!

arjoly Jul 24, 2015

Choose a reason for hiding this comment

Uh oh!

arjoly commented Jul 24, 2015

Uh oh!

arjoly commented Jul 29, 2015

Uh oh!

amueller commented Jul 30, 2015

Uh oh!

amueller commented Jul 30, 2015

Uh oh!

glouppe commented Jul 30, 2015

Uh oh!

arjoly commented Jul 31, 2015

Uh oh!

amueller commented Jul 31, 2015

Uh oh!

GaelVaroquaux commented Jul 31, 2015

Uh oh!

amueller commented Jul 31, 2015

Uh oh!

GaelVaroquaux commented Jul 31, 2015 via email

Uh oh!

arjoly commented Jul 31, 2015

Uh oh!

arjoly commented Aug 13, 2015

Uh oh!

cmarmo commented May 2, 2022

Uh oh!

Uh oh!