DOC add comments regarding to make a balanced random forest from a BalancedBaggingClassifier #372

glemaitre · 2017-11-22T17:25:07Z

I think that we could add note mentioning that we can achieve a balanced random forest classifier by setting max_features='auto' of the decision tree. I don't think that we should implement a new estimator since scikit-learn is going to do it.

The text was updated successfully, but these errors were encountered:

glemaitre · 2017-11-22T17:25:20Z

@chkoar WDYT?

chkoar · 2017-11-22T19:16:05Z

I don't have a strong opinion on this. I believe that a robust predictors/ensemble methods module could bring traction to the package. For that reason I would may implement BRF even as a shortcut.

glemaitre · 2017-11-22T19:22:16Z

Yep but it needs to follow the random forest API and not the Bagging classifier. In that regards this why scikit learn has a PR there. We could give an hand actually to do this one. It might go faster together :-)

chkoar · 2017-11-27T10:08:55Z

@glemaitre is a PR stalled in scikit-learn where we could contribute to be finished?

glemaitre · 2017-11-27T10:10:19Z

it is already merged in #373

chkoar · 2017-11-27T10:15:29Z

I asked because you said .

In that regards this why scikit learn has a PR there.

glemaitre · 2017-11-27T10:20:10Z

Oh yes there one PR that needs love. We wanted also to balanced at each node instead of tree to see the difference. So there is plenty of things

chkoar · 2017-11-27T14:35:20Z

which one?

glemaitre · 2017-11-27T14:42:07Z

scikit-learn/scikit-learn#8732

chkoar · 2017-12-26T17:22:47Z

We wanted also to balanced at each node instead of tree to see the difference.

@glemaitre in each tree? Does this task need modification in the Cython level?

glemaitre · 2017-12-26T18:02:28Z

‎Yep you need to do that in cython

potash · 2018-01-29T21:49:21Z

@chkoar, @glemaitre I'm not familiar with imblearn but in sklearn you can balance each tree simply by changing the sample_indices that get passed. This is how I implemented it in scikit-learn/scikit-learn#8732.

So does BalancedBaggingClassifier(max_features="auto") balance at each tree or does it just balance the data once? If the latter, I think it is confusing to call that a "balanced random forest" in the documentation because in the Breiman paper that refers to balancing each tree.

glemaitre · 2018-01-29T21:55:12Z

The implementation is a pipeline of a random under sampler with an estimator. So if you pass an estimator which is a tree, it will balance each subset and then fit a tree on each subset. Therefore, BalancedBaggingClassifier become a BalancedRandomForest with max_feature='auto' as mentioned in the documentation.

potash · 2018-01-29T21:56:38Z

@glemaitre OK thanks for the clarification

glemaitre mentioned this issue Nov 22, 2017

[MRG] EHN add note to create balanced RF #373

Merged

glemaitre closed this as completed in #373 Nov 24, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC add comments regarding to make a balanced random forest from a BalancedBaggingClassifier #372

DOC add comments regarding to make a balanced random forest from a BalancedBaggingClassifier #372

glemaitre commented Nov 22, 2017

glemaitre commented Nov 22, 2017

chkoar commented Nov 22, 2017

glemaitre commented Nov 22, 2017 via email

chkoar commented Nov 27, 2017

glemaitre commented Nov 27, 2017

chkoar commented Nov 27, 2017

glemaitre commented Nov 27, 2017 via email

chkoar commented Nov 27, 2017

glemaitre commented Nov 27, 2017

chkoar commented Dec 26, 2017

glemaitre commented Dec 26, 2017 via email

potash commented Jan 29, 2018

glemaitre commented Jan 29, 2018

potash commented Jan 29, 2018

DOC add comments regarding to make a balanced random forest from a BalancedBaggingClassifier #372

DOC add comments regarding to make a balanced random forest from a BalancedBaggingClassifier #372

Comments

glemaitre commented Nov 22, 2017

glemaitre commented Nov 22, 2017

chkoar commented Nov 22, 2017

glemaitre commented Nov 22, 2017 via email

chkoar commented Nov 27, 2017

glemaitre commented Nov 27, 2017

chkoar commented Nov 27, 2017

glemaitre commented Nov 27, 2017 via email

chkoar commented Nov 27, 2017

glemaitre commented Nov 27, 2017

chkoar commented Dec 26, 2017

glemaitre commented Dec 26, 2017 via email

potash commented Jan 29, 2018

glemaitre commented Jan 29, 2018

potash commented Jan 29, 2018