-
-
Notifications
You must be signed in to change notification settings - Fork 25.8k
AverageRegressor? #10743
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@amueller, I'd like to work on this issue, if no other PR is available. |
@amueller I think the most similar to this new The baggingRegressor fits the base regressors on a random subset of the data while using the same base_estimator :
The new |
Is this something that's done much? The soft voting makes VotingClassifier
much more powerful if the models have different biases.
I think we'd be better off merging the StackingTransformer (#8960) unless
this is well justified by the literature.
|
@jnothman, I see Averaging frequently in Kaggle competitions as one of the ensembling techniques, however stacking has proven to be more valuable than simple averaging. I think we can consider averaging a corner case of stacking where instead of using a new estimator to aggregate the predictions of previous estimators we use a simple function I guess the argument for adding AverageRegressor as a separate regressor is to keep consistency with other API as VotingClassifier and, also, to cover the most famous ensembling techniques that are currently used. |
As far as I know, there are four categories of ensembling techniques. For each of them, sklearn implements the following:
So, I think that it's relevant to implement the |
if there is not other existing object to do this I think it's useful.
It is at least as useful than VotingClassifier
|
Can I start working on a PR for this, or should I wait until consensus is reached? |
I think it would likely be merged
|
@amueller @jnothman @agramfort, I understand that the decision hasn't been made yet, but I thought it might be useful to have a concrete PR in case we decide in favor of including The work has been pushed here: #10868. I am looking forward to see your reviews. |
I forked #10868. |
Should we add the regressor equivalent of VotingClassifier, which would just be computing averages? (I vaguely remember seeing that somewhere but now couldn't find issue or PR)
The text was updated successfully, but these errors were encountered: