[MRG] FIX solve consistency between predict and predict_proba in AdaBoost #14114

glemaitre · 2019-06-18T13:20:20Z

compute the probabilities in AdaBoostClassifier as specified in "Multiclass AdaBoost"

glemaitre · 2019-06-19T15:52:16Z

@NicolasHug @amueller I am playing with something which is not my strong suit. Could you have a look at the PR and if the fix proposed seems theoretically reasonable?

NicolasHug

As far as I can tell the changes are correct.

What is strange to me is that the previous implementation is also a softmax of the decision function, so other than the fact that the new version is much clearer, I don't understand where the fix is.

(to be honest this isn't really my specialty either)

sklearn/ensemble/weight_boosting.py

NicolasHug · 2019-06-19T17:42:52Z

sklearn/ensemble/weight_boosting.py

+               2009.
+        """
+        if n_classes == 2:
+            decision = np.vstack([-decision, decision]).T


can you explain this? is it in the paper too?

My original thought was that we were keeping one of the 2 columns when computing the decision function. Actually, we do something a bit different:

if n_classes == 2: pred[:, 0] *= -1 return pred.sum(axis=1)

You originally have symmetry between both classes. However, I should divide decision by 2 since that we are summing.

glemaitre · 2019-06-21T16:38:49Z

What is strange to me is that the previous implementation is also a softmax of the decision function, so other than the fact that the new version is much clearer, I don't understand where the fix is.

It uses the predict_proba of the underlying classifier instead of the predict (used in the decision function). It is the main difference.

…_proba

glemaitre · 2019-07-01T14:51:39Z

@NicolasHug do you have any other comments?

Any second reviewer? @rth @thomasjpfan

thomasjpfan

Strange how master's implementation used predict_proba instead of predict which is more inline with the paper's use of missclassification error rate.

sklearn/ensemble/weight_boosting.py

glemaitre · 2019-07-10T15:10:31Z

@thomasjpfan I addressed the comments.

thomasjpfan

LGTM

agramfort · 2019-07-16T13:53:06Z

thx @glemaitre

glemaitre added 3 commits June 18, 2019 15:17

FIX solve consistency between predict and predict_proba in AdaBoost

57b0e73

fix when decision function is binary

3e4c94b

DOC add whats new

fa174dc

glemaitre changed the title ~~[WIP] FIX solve consistency between predict and predict_proba in AdaBoost~~ [MRG] FIX solve consistency between predict and predict_proba in AdaBoost Jun 19, 2019

NicolasHug reviewed Jun 19, 2019

View reviewed changes

glemaitre added 5 commits June 21, 2019 18:40

Merge branch 'master' into adaboost_proba

7af9582

address nicolas comments

c4f1a6f

fix

3c273b2

Merge remote-tracking branch 'origin/master' into adaboost_proba

73a39bf

Merge remote-tracking branch 'glemaitre/adaboost_proba' into adaboost…

14c5fa0

…_proba

thomasjpfan self-assigned this Jul 4, 2019

thomasjpfan reviewed Jul 4, 2019

View reviewed changes

sklearn/ensemble/weight_boosting.py Outdated Show resolved Hide resolved

sklearn/ensemble/weight_boosting.py Show resolved Hide resolved

glemaitre added 2 commits July 10, 2019 17:08

thomas review

2adbe9c

Merge remote-tracking branch 'origin/master' into adaboost_proba

0542f5a

thomasjpfan approved these changes Jul 14, 2019

View reviewed changes

agramfort merged commit c0c5313 into scikit-learn:master Jul 16, 2019

amueller mentioned this pull request Jul 23, 2019

[MRG] Release 0.20.4 #14443

Merged

11 tasks

xadupre mentioned this pull request Dec 1, 2019

AdaBoostClassifier probabilities is changed in scikit-learn 0.22 onnx/sklearn-onnx#333

Closed

NicolasHug mentioned this pull request Dec 9, 2019

AdaBoostClassifier predict() gives incorrect label #15732

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG] FIX solve consistency between predict and predict_proba in AdaBoost #14114

[MRG] FIX solve consistency between predict and predict_proba in AdaBoost #14114

Uh oh!

glemaitre commented Jun 18, 2019 •

edited

Loading

Uh oh!

glemaitre commented Jun 19, 2019

Uh oh!

NicolasHug left a comment

Uh oh!

Uh oh!

NicolasHug Jun 19, 2019

Uh oh!

glemaitre Jun 19, 2019

Uh oh!

glemaitre commented Jun 21, 2019

Uh oh!

glemaitre commented Jul 1, 2019

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

Uh oh!

glemaitre commented Jul 10, 2019

Uh oh!

thomasjpfan left a comment

Uh oh!

agramfort commented Jul 16, 2019

Uh oh!

Uh oh!

Uh oh!

[MRG] FIX solve consistency between predict and predict_proba in AdaBoost #14114

[MRG] FIX solve consistency between predict and predict_proba in AdaBoost #14114

Uh oh!

Conversation

glemaitre commented Jun 18, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glemaitre commented Jun 19, 2019

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

NicolasHug Jun 19, 2019

Choose a reason for hiding this comment

Uh oh!

glemaitre Jun 19, 2019

Choose a reason for hiding this comment

Uh oh!

glemaitre commented Jun 21, 2019

Uh oh!

glemaitre commented Jul 1, 2019

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

glemaitre commented Jul 10, 2019

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

agramfort commented Jul 16, 2019

Uh oh!

Uh oh!

glemaitre commented Jun 18, 2019 •

edited

Loading