[MRG] FEA Lift metric and curve #21320

nawarhalabi · 2021-10-13T15:11:18Z

Reference Issues/PRs

implements partially what is in two stale PRs in a significantly more complete manner:

What does this implement/fix? Explain your changes.

Implemented the lift_score metric function, with the lift_curve function and the LiftCurveDisplay class.

Lift is a commonly used metric in evaluating response to ad campaigns. please read:

This implementation includes:

implementation of lift_score in sklearn/metrics/_classification.py for calculating a single value
Implementation of lift_curve in sklearn/metrics/_ranking.py for calculating an array of lifts based on different positive classification rates
Implementation of LiftCurveDisplay in sklearn/metrics/_plot/lift_curve.py for plotting the lift curve/chart
Added documentation in the model_evaluation section for lift and referenced the function page which contains examples

…splay classes

nawarhalabi · 2021-11-11T11:41:15Z

Any recommendations/comments. The lift metric seems to be a desired feature in previous PRs.
@jnothman, @rth, @GuillemGSubies I am ating you as you had a look at the previous PRs related #18479 and #10003
Thanks!

jnothman · 2021-11-21T10:27:28Z

Thanks @nawarhalabi this looks quite nice. However, given that the major bottleneck in scikit-learn is in review time, I'd suggest you break it into smaller contributions, starting with the lift_curve function and its tests. For lift_score I'd be interested in seeing references to its use in practice for classification evaluation or diagnostics.

nawarhalabi · 2021-11-24T16:35:00Z

Thanks @jnothman

Regarding splitting the contribution: Sure! I will limit this PR to lift_curve and its tests. I will create a new branch off this branch for the lift_score and create a separate PR for that later. Does this sound good?
Regarding references: I will add references to the docstring of the lift_score function in the future PR. Here are the ones I would add:

jnothman · 2021-11-24T20:08:05Z

I think that will be helpful for reviewers, yes, although I am not as available to review personally as once upon a time so I can't be too sure!

lorentzenchr · 2021-11-24T21:58:18Z

Just to clarify expectations: While splitting this PR will help in the review process, it is not yet decided to include the proposed functionality. In #21718, we are trying to figure out some principles for model evaluation tools.

adrinjalali · 2024-03-07T10:01:15Z

@lorentzenchr do you think these days we'd be including this in the codebase?

Or is this something we'd be happier to have in skrub @GaelVaroquaux @ogrisel

or maybe scikit-lego? (@koaning)

koaning · 2024-03-07T16:19:06Z

I'd be open to adding it to scikit-lego. But at first glance it does feel general enough that it could also live here. If the conclusion is that it's not a great fit for sklearn or skrub then I'll gladly consider it for sklego.

lorentzenchr · 2024-03-07T16:38:26Z

I think an "ML Gini index" as well as an accompanying graph "Cumulative Accuracy Profile" (CAP)¹ for non-negative regression (of which binary classification is an example) would complete our tools for measuring ranking (discriminative) power of models and generalize the existing AUC and ROC.

Note that there is a great confusion about terms. In Gini Index and Friends, my coauthors and me summarized existing literature and gave tutorial like examples. This is now my main reference if I look up terms like "Gini index" (which one?!?!?).

¹ aka gain curve and (cumulative) lift curve, it is almost the inverted Lorenz curve of the empirical distribution generated by the model predictions.

adrinjalali · 2024-03-07T17:23:41Z

@nawarhalabi would you be able to give this PR an update?

glemaitre · 2024-03-11T19:43:16Z

For sure we should prioritize this action when it comes to the inspection.

lorentzenchr · 2024-03-11T21:27:55Z

For sure we should prioritize this action when it comes to the inspection.

What do you mean by action?

glemaitre · 2024-03-12T09:50:44Z

Wrong word, my brain did not work properly. By "action" I meant that we should have a display for this type of curve.

nawarhalabi added 19 commits October 11, 2021 23:18

added unit tests

6e5bacb

added first version of lift curve metric function.

cd4853d

removed old code after return statement of lift curve function

7e1cdcc

fixed outdated example in docs of lift curve

e0061e9

added testing pos_label of lift curve

b04181d

added see also section of the lift curve function

0e171ae

added lift_score tests

425f15c

added lift_score function

72423a6

bug and doc fixes

5fbc53c

added first documentation of lift_score and lift_curve

3d78136

Added LiftCurveDisplay to plot lift curves. Same struture as other Di…

942dbd4

…splay classes

added lift display function for plotting with the tests

209ccf5

added lift display functions to the documentation

26c3ff3

Merge branch 'main' into lift_metric

effdfe6

extended documentation and added reference to the lift curve

bb7febf

ran black for code formatting and committing changes

aeecef1

Fixed bug after running flake8

88f11c7

Merge remote-tracking branch 'upstream/main' into lift_metric

01e2c63

correct example in lift_curve function docs

d3db8e7

github-actions bot added the module:metrics label Oct 13, 2021

nawarhalabi added 4 commits October 13, 2021 17:28

added changelog

b018fbd

added changelog refernce to pr

ceb51b3

corrected bug in whats news

68121b5

fixed bug in whats new

6568468

nawarhalabi changed the title ~~[MRG] Lift metric and curve~~ [MRG] FEA Lift metric and curve Oct 13, 2021

nawarhalabi added 5 commits October 13, 2021 18:19

fixed example of lift_curve not matching output value in doctest

f31bbf0

Merge remote-tracking branch 'upstream/main' into lift_metric

09f83e9

fixed documentation according to new standards

eb12357

fixed doc bug added in last commit by accendent in presison_recall_curve

4760796

adjusted doc of lift_curve function to match standards

597cc76

nawarhalabi added 6 commits October 25, 2021 14:45

Merge remote-tracking branch 'upstream/main' into lift_metric

a735182

Merge remote-tracking branch 'upstream/main' into lift_metric

c8f54da

Merge remote-tracking branch 'upstream/main' into lift_metric

f987a5d

Merge remote-tracking branch 'upstream/main' into lift_metric

0a89221

Merge remote-tracking branch 'upstream/main' into lift_metric

55f42b9

fixed documentation missing new line in v1.1

cac14e6

lorentzenchr mentioned this pull request Nov 19, 2021

RFC Principled metrics for scoring and calibration of supervised learning #21718

Open

nawarhalabi added 2 commits November 25, 2021 16:39

Merge remote-tracking branch 'upstream/main' into lift_metric

e5ecfb2

added online references for the lift_score function

a0508c3

cmarmo added Waiting for Reviewer Needs Decision Requires decision and removed Waiting for Reviewer labels Dec 22, 2021

lorentzenchr removed the Needs Decision Requires decision label Mar 7, 2024

lorentzenchr mentioned this pull request Mar 8, 2024

Add metrics.gini_index_score() #28535

Open

lorentzenchr linked an issue Mar 8, 2024 that may be closed by this pull request

Add metrics.gini_index_score() #28535

Open

lorentzenchr mentioned this pull request Mar 8, 2024

Two different versions for weighted lorenz curve calculation in the examples #28534

Open

adrinjalali added Stalled help wanted labels Apr 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] FEA Lift metric and curve #21320

[MRG] FEA Lift metric and curve #21320

nawarhalabi commented Oct 13, 2021

nawarhalabi commented Nov 11, 2021 •

edited

Loading

jnothman commented Nov 21, 2021

nawarhalabi commented Nov 24, 2021 •

edited

Loading

jnothman commented Nov 24, 2021 via email

lorentzenchr commented Nov 24, 2021

adrinjalali commented Mar 7, 2024

koaning commented Mar 7, 2024

lorentzenchr commented Mar 7, 2024

adrinjalali commented Mar 7, 2024

glemaitre commented Mar 11, 2024

lorentzenchr commented Mar 11, 2024

glemaitre commented Mar 12, 2024

[MRG] FEA Lift metric and curve #21320

Are you sure you want to change the base?

[MRG] FEA Lift metric and curve #21320

Conversation

nawarhalabi commented Oct 13, 2021

Reference Issues/PRs

What does this implement/fix? Explain your changes.

nawarhalabi commented Nov 11, 2021 • edited Loading

jnothman commented Nov 21, 2021

nawarhalabi commented Nov 24, 2021 • edited Loading

jnothman commented Nov 24, 2021 via email

lorentzenchr commented Nov 24, 2021

adrinjalali commented Mar 7, 2024

koaning commented Mar 7, 2024

lorentzenchr commented Mar 7, 2024

adrinjalali commented Mar 7, 2024

glemaitre commented Mar 11, 2024

lorentzenchr commented Mar 11, 2024

glemaitre commented Mar 12, 2024

nawarhalabi commented Nov 11, 2021 •

edited

Loading

nawarhalabi commented Nov 24, 2021 •

edited

Loading