DOC document about the divergent factor of the Binomial loss #25383

Nebula1230 · 2023-01-12T20:50:23Z

Reference Issues/PRs

Fixes #25206

What does this implement/fix? Explain your changes.

I did a documentation change where the deviance(=loss) given by the train_score_ is double the actual log_loss which gave some non understandable results when using the train_score_ functions for the training data and a classic log_normal function for the test data.
The fix i proposed is on the level of the documentary of the parameters of the loss of the GradientBoostingClassifier where i mentionned that loss(=deviance) might have a factor of two and that if we want the actual we need to take only half the loss.

Any other comments?

ogrisel · 2023-01-13T14:58:00Z

I don't think we should change the description of the hyperparameter that is use to select the loss to optimize for when fitting the model.

The original problem was about the content of the train_score_ attribute instead.

Furthermore, I think we might want store the real full loss value instead of the half-loss value inside that attribute. That should not be expensive to compute because it is not a per-sample operation but a per-epoch operation.

WDYT @lorentzenchr and others?

glemaitre · 2023-01-13T15:52:54Z

sklearn/ensemble/_gb.py

@@ -883,6 +883,7 @@ class GradientBoostingClassifier(ClassifierMixin, BaseGradientBoosting):
    loss : {'log_loss', 'deviance', 'exponential'}, default='log_loss'
        The loss function to be optimized. 'log_loss' refers to binomial and
        multinomial deviance, the same as used in logistic regression.
+        It contains a factor x2 that have to be neglected to reflect the actual loss.


I think this is only true for the log_loss and deviance (which is the same loss).

The way to check is to look at how the loss functions are defined in sklearn/_loss/loss.py: the loss function classes that have "Half" in their name only compute half of the usual value (for performance reasons when computing the per-sample gradients).

Actually not for the GradientBoosting. I think that we still import from _gb_losses.py

It is as @glemaitre says. The change to the loss function module was blocked by a deprecation period for the loss_ attribute of GradientBoostingRegressor. This is now over (1.3).

lorentzenchr · 2023-01-14T14:53:24Z

Furthermore, I think we might want store the real full loss value instead of the half-loss value inside that attribute. That should not be expensive to compute because it is not a per-sample operation but a per-epoch operation.

This would be fine. With version 1.3., deviance will be gone and for log_loss we can report the same as the mean_log_loss metric.

adrinjalali · 2023-07-27T15:24:56Z

@ogrisel @glemaitre @lorentzenchr what's the status here now? it's been stale since January.

lorentzenchr · 2023-08-03T17:47:31Z

Right now, I‘d like to first finish #25964.

adrinjalali · 2024-08-13T10:29:03Z

Looks like the solution to the issue is quite different than this proposal. Therefore closing and happy to have a fresh PR with suggested changes here.

Nebula1230 added 2 commits January 12, 2023 21:44

Update _gb.py

ced2eec

Update _gb.py

47468fe

github-actions bot added the module:ensemble label Jan 12, 2023

glemaitre reviewed Jan 13, 2023

View reviewed changes

glemaitre changed the title ~~Update the documentation of the loss parameter for the GradientBoostingClassifier~~ DOC document about the divergent factor of the Binomial loss Jan 13, 2023

github-actions bot added the Documentation label Jan 13, 2023

Merge branch 'main' into fix

cbd0fd4

Merge branch 'main' into fix

58985a9

adrinjalali closed this Aug 13, 2024

adrinjalali mentioned this pull request Aug 13, 2024

Unclear train_score_ attribute description for GradientBoostingClassifier #25206

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC document about the divergent factor of the Binomial loss #25383

DOC document about the divergent factor of the Binomial loss #25383

Nebula1230 commented Jan 12, 2023 •

edited by TomDLT

Loading

ogrisel commented Jan 13, 2023 •

edited

Loading

glemaitre Jan 13, 2023

ogrisel Jan 13, 2023 •

edited

Loading

glemaitre Jan 13, 2023

lorentzenchr Jan 14, 2023

lorentzenchr commented Jan 14, 2023 •

edited by ogrisel

Loading

adrinjalali commented Jul 27, 2023

lorentzenchr commented Aug 3, 2023

adrinjalali commented Aug 13, 2024

DOC document about the divergent factor of the Binomial loss #25383

DOC document about the divergent factor of the Binomial loss #25383

Conversation

Nebula1230 commented Jan 12, 2023 • edited by TomDLT Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

ogrisel commented Jan 13, 2023 • edited Loading

glemaitre Jan 13, 2023

Choose a reason for hiding this comment

ogrisel Jan 13, 2023 • edited Loading

Choose a reason for hiding this comment

glemaitre Jan 13, 2023

Choose a reason for hiding this comment

lorentzenchr Jan 14, 2023

Choose a reason for hiding this comment

lorentzenchr commented Jan 14, 2023 • edited by ogrisel Loading

adrinjalali commented Jul 27, 2023

lorentzenchr commented Aug 3, 2023

adrinjalali commented Aug 13, 2024

Nebula1230 commented Jan 12, 2023 •

edited by TomDLT

Loading

ogrisel commented Jan 13, 2023 •

edited

Loading

ogrisel Jan 13, 2023 •

edited

Loading

lorentzenchr commented Jan 14, 2023 •

edited by ogrisel

Loading