DOC Rework ROC example with cross-validation #29611

ArturoAmorQ · 2024-08-02T08:03:57Z

Reference Issues/PRs

Somewhat related to #25856.

What does this implement/fix? Explain your changes.

Using quantiles to demonstrate the variance in ROC curves during cross-validation can be more appropriate than standard deviation because it does not assume a Gaussian distribution of the true positive rates (TPR) across different thresholds.

This PR also prefers using StratifiedShuffleSplit instead of a simple 5-fold cross-validation to better show the variability across splits. For that purpose I had to use a dataset with more points than iris and changed the svm classifier to a hgbt for faster predictions.

Any other comments?

This PR also takes the opportunity to improve the wording of the example's abstract.

github-actions · 2024-08-02T08:05:14Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 15efa03. Link to the linter CI: here}

ogrisel

Thanks for the PR.

I find the overlapping quantile regions hard to interprete. I think it would be simpler to plot a single 90% percentile region (the one computed by the 0.45 offset).

examples/model_selection/plot_roc_crossval.py

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

betatim · 2024-08-06T15:42:21Z

I think the point of the example is to illustrate that "your one ROC curve is not the truth, it will change and fluctuate because it is only an estimate of the true, unknowable ROC curve". Similar to how the mean of a set of observations is an estimate of the mean, not the true, unknowable value of the mean. I think we can use the median and a set of quantiles to illustrate the spread/variability. But I don't understand how that is better. Can you explain what the problem is with using the standard deviation in this example?

Naively I'd assume that the mean of the true positive rates at a given value of false positive rate is a quantity you can treat like the sample mean of any other set of observations sampled from a random distribution (normal or not). And that the error on that sample mean is std/sqrt(n). In our case n would be n_folds. The standard deviation (std) is the square root of the variance, which you can compute for (almost?) any distribution.

The thing we can't do is interpret the band drawn using the standard deviation as some form of confidence interval.

But then we are drawing the standard deviation (in the original example) and not the standard error on the mean. So I assume it is anyway only there to illustrate the spread (in a cartoon kind of way, not a precise statistical statement about confidence intervals or some such).

ArturoAmorQ · 2024-08-08T09:33:29Z

The thing we can't do is interpret the band drawn using the standard deviation as some form of confidence interval.

The main motivation is: now that we support tuning the decision threshold, confidence intervals are actually important, as they can be directly translated to confidence in a business metric and therefore decision making.

I find the overlapping quantile regions hard to interprete.

Visualizing different quantiles, also implies different risk acceptance in terms of the business metric.

ArturoAmorQ · 2025-01-13T13:55:10Z

Now that #29727 has been merged, I think this PR is good for a second pass of reviews.

ogrisel · 2025-01-20T13:32:14Z

For information, I started to review this PR but I need to read a bit on the literature about ROC averaging before finalizing it.

ArturoAmorQ added 2 commits August 2, 2024 09:46

DOC Use quantiles instead of std in ROC example with cross-validation

d6e864a

Improve wording

e37a089

github-actions bot added the Documentation label Aug 2, 2024

ogrisel reviewed Aug 2, 2024

View reviewed changes

examples/model_selection/plot_roc_crossval.py Outdated Show resolved Hide resolved

examples/model_selection/plot_roc_crossval.py Outdated Show resolved Hide resolved

Apply suggestions from code review

9b1d924

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Fix linter

00ada90

ArturoAmorQ and others added 7 commits August 26, 2024 16:58

Plot a single 90 percentile region as per Olivier's suggestion

d977d56

Iter

8bfebd2

Use ShuffleSplit, hgbt and make_classification

beda9e9

Iter

edf7299

Prefer f-string format for legend

c19e993

Merge branch 'main' into quantile_roc

9e93c79

Merge branch 'main' into quantile_roc

1e1d45f

ArturoAmorQ mentioned this pull request Aug 27, 2024

FIX Avoid setting legend when labels are None in RocCurveDisplay kwargs #29727

Merged

ArturoAmorQ changed the title ~~DOC Use quantiles instead of std in ROC example with cross-validation~~ DOC Rework ROC example with cross-validation Sep 2, 2024

ArturoAmorQ and others added 5 commits November 28, 2024 17:00

Merge branch 'main' into quantile_roc

626693d

Set chance level label to None

c0a5518

Merge branch 'main' into quantile_roc

9bb912d

Merge branch 'main' into quantile_roc

823c829

Merge branch 'main' into quantile_roc

f975ff9

lucyleeow mentioned this pull request Jan 16, 2025

ENH add from_cv_results in RocCurveDisplay (single RocCurveDisplay) #30399

Merged

ArturoAmorQ requested a review from ogrisel January 20, 2025 10:46

lucyleeow mentioned this pull request May 30, 2025

DOC Use from_cv_results in plot_roc_crossval.py #31455

Merged

Fix conflicts

15efa03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

DOC Rework ROC example with cross-validation #29611

DOC Rework ROC example with cross-validation #29611

Uh oh!

ArturoAmorQ commented Aug 2, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Aug 2, 2024 •

edited

Loading

Uh oh!

ogrisel left a comment

Uh oh!

Uh oh!

Uh oh!

betatim commented Aug 6, 2024

Uh oh!

ArturoAmorQ commented Aug 8, 2024

Uh oh!

ArturoAmorQ commented Jan 13, 2025

Uh oh!

ogrisel commented Jan 20, 2025

Uh oh!

Uh oh!

Uh oh!

DOC Rework ROC example with cross-validation #29611

Are you sure you want to change the base?

DOC Rework ROC example with cross-validation #29611

Uh oh!

Conversation

ArturoAmorQ commented Aug 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Aug 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

betatim commented Aug 6, 2024

Uh oh!

ArturoAmorQ commented Aug 8, 2024

Uh oh!

ArturoAmorQ commented Jan 13, 2025

Uh oh!

ogrisel commented Jan 20, 2025

Uh oh!

Uh oh!

ArturoAmorQ commented Aug 2, 2024 •

edited

Loading

github-actions bot commented Aug 2, 2024 •

edited

Loading