Add stacking-meta-model #6674

tsterbak · 2016-04-18T07:34:52Z

A wrapper allowing to combine models in a two stage stacking model.

This change is

A wrapper allowing to combine models in a two stage stacking model.

tsterbak · 2016-04-18T07:40:37Z

PR to #4816

jnothman · 2016-04-18T07:46:54Z

Please add tests!

jnothman · 2016-04-18T07:47:22Z

sklearn/ensemble/stacking_model.py

+
+        return y_out
+
+    def run_gridsearch(self,X,y,params):


What's this meant to do...? it doesn't look quite right for scikit-learn's API?

It may not be perfect yet, but it's able to use for example the cross_val_score function with this.

This evaluates the model on test data.

jnothman · 2016-04-18T07:47:50Z

Please adhere to PEP8. Using flake8 might help

MechCoder · 2016-04-28T22:29:02Z

sklearn/ensemble/stacking_model.py

+# libraries
+import numpy as np
+
+# scikit-learn base libraries


You can remove these comments

MechCoder · 2016-04-28T23:02:28Z

From the definition of the StackingClassifier here (http://machine-learning.martinsewell.com/ensembles/stacking/), self.stage_two_clfs should beself.stage_two_clf and there should be no weights argument.

I can understand the use case, but you should be applying a VotingClassifer on top of the StackingClassifier and StackingClassifier should not be doing it internally. This should be the API for that use case.

X, y = make_classification()
estimators1 = [Classifier1(), Classifier2()]
estimator2 = LogisticRegression()
estimator3 = RandomForestClassifier()
sc1 = StackingClassifier(estimators1, estimator2)
sc2 = StackingClassifier(estimators1, estimator3)
vc = VotingClassifier([("sc1", sc1), ("sc2", sc2)], weights=[0.3, 0.7])
vc.fit(X, y)

Also we should rename self.stage_one_clfs to something that has estimators in it. Also we should have a list of tuples where each tuple is of two length, (string, estimator) as done in VotingClassifier for GridSearch support. See (https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/ensemble/voting_classifier.py#L227)

MechCoder · 2016-04-28T23:07:40Z

sklearn/ensemble/stacking_model.py

+
+        # fit the second stage models
+        for clf in self.stage_two_clfs:
+            clf.fit(self.__X,self.__y)        


Shouldn't this be just clf.fit(y_pred, y)? according to this http://machine-learning.martinsewell.com/ensembles/stacking/

MechCoder · 2016-04-28T23:26:47Z

@tsterbak Thanks for the effort!

Please do consider adding a bit of documentation, tests and PEP8 compliant.

tsterbak · 2016-04-29T06:22:47Z

@MechCoder Thanks for your comments!! I will change the code asap.

tsterbak · 2016-04-29T07:11:13Z

@MechCoder about the API:
I think you are right in some ways, but using the voting internally and compute first and second stage in one object is computionally more efficient. So you have to fit and predict the first stage models only once to predict with all second stage models.

More suggestions are welcome! :)

MechCoder · 2016-04-29T17:23:53Z

All right, I can have a closer look after the code is cleaned up.

yl565 · 2016-08-07T03:34:30Z

@MechCoder Hi, is there anyone working on it currently? If not, I want to do a PR

rth · 2016-09-02T12:28:46Z

Is anybody planning to work on this? Otherwise I would be happy to look into it.

yl565 · 2016-09-13T15:09:19Z

ping @jnothman

MechCoder · 2016-09-13T15:20:33Z

Sure, please go ahead. There were some questions with regards to the API, but those can be discussed during the course of the pull request.

ivallesp · 2016-09-22T16:53:50Z

I would like to participate in this project. In fact, I almost win the BNP competition (22nd place) with my own implementation, which I would like to adapt to sklearn... If possible, I would like to work into it!!

JivanRoquet · 2016-11-20T11:42:30Z

@ivallesp I'm currently working on a stacking (aka Super Learner) implementation too - would you like to collaborate?

yl565 · 2016-11-20T11:56:02Z

@JivanRoquet I have already had a PR #7427 to stacking classifier, now just waiting for #7674 merged after which I can use the new _BaseComposition class. If you have any suggestions please let me know.

JivanRoquet · 2016-11-20T12:02:48Z

@yl565 awesome, thanks! I can see you provided a StackingClassifier.

My work is related to a StackingRegressor. Your PR is very helpful in that way, so that both codes can be consistent. I'll try to adapt what I've already done with the conventions you follow.

ivallesp · 2016-11-20T14:33:19Z

@JivanRoquet Sure! I would love to colaborate on this.

AlJohri · 2017-10-03T21:22:15Z

@jnothman should this PR be closed with #7427 as the more up to date?

EDIT: the even more up to date version is #8960

Add stacking-meta-model

e362502

A wrapper allowing to combine models in a two stage stacking model.

tsterbak mentioned this pull request Apr 18, 2016

implementing stacking and other ensemble techniques #4816

Closed

jnothman reviewed Apr 18, 2016
View reviewed changes

change the predict/predict_proba method

f770b02

betatim mentioned this pull request Apr 18, 2016

[WIP] Added PredictionTransformer and ThresholdClassifier #6663

Closed

4 tasks

MechCoder reviewed Apr 28, 2016
View reviewed changes

sklearn/ensemble/stacking_model.py

# libraries

import numpy as np

# scikit-learn base libraries

Copy link

Member

MechCoder Apr 28, 2016

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You can remove these comments

MechCoder reviewed Apr 28, 2016
View reviewed changes

MechCoder mentioned this pull request Jun 1, 2016

Feature request, stacked generalization #6778

Closed

yl565 mentioned this pull request Sep 14, 2016

[WIP] Add new feature StackingClassifier #7427

Closed

4 tasks

rth mentioned this pull request Oct 4, 2016

[RFC] Standardize parallel meta-estimators #7570

Closed

glemaitre mentioned this pull request May 1, 2018

[MRG] FEA: Stacking estimator for classification and regression #11047

Merged

amueller added the Superseded PR has been replace by a newer PR label Aug 5, 2019

jnothman closed this in #11047 Sep 18, 2019

Uh oh!

Add stacking-meta-model #6674

Add stacking-meta-model #6674

Uh oh!

Conversation

tsterbak commented Apr 18, 2016 • edited by lesteve Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tsterbak commented Apr 18, 2016

Uh oh!

jnothman commented Apr 18, 2016

Uh oh!

jnothman Apr 18, 2016

Choose a reason for hiding this comment

Uh oh!

tsterbak Apr 18, 2016

Choose a reason for hiding this comment

Uh oh!

tsterbak Apr 18, 2016

Choose a reason for hiding this comment

Uh oh!

jnothman commented Apr 18, 2016

Uh oh!

MechCoder Apr 28, 2016

Choose a reason for hiding this comment

Uh oh!

MechCoder commented Apr 28, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MechCoder Apr 28, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MechCoder commented Apr 28, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tsterbak commented Apr 29, 2016

Uh oh!

tsterbak commented Apr 29, 2016

Uh oh!

MechCoder commented Apr 29, 2016

Uh oh!

yl565 commented Aug 7, 2016

Uh oh!

rth commented Sep 2, 2016

Uh oh!

yl565 commented Sep 13, 2016

Uh oh!

MechCoder commented Sep 13, 2016

Uh oh!

ivallesp commented Sep 22, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JivanRoquet commented Nov 20, 2016

Uh oh!

yl565 commented Nov 20, 2016

Uh oh!

JivanRoquet commented Nov 20, 2016

Uh oh!

ivallesp commented Nov 20, 2016

Uh oh!

AlJohri commented Oct 3, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

tsterbak commented Apr 18, 2016 •

edited by lesteve

Loading

MechCoder commented Apr 28, 2016 •

edited

Loading

MechCoder Apr 28, 2016 •

edited

Loading

MechCoder commented Apr 28, 2016 •

edited

Loading

ivallesp commented Sep 22, 2016 •

edited

Loading

AlJohri commented Oct 3, 2017 •

edited

Loading