[WIP] Performance comparison (ROC) plots for anomaly detection methods #16378

MaiRajborirug · 2020-02-03T17:26:08Z

Reference Issues/PRs
PRs : [MRG] Comparison plot for anomaly detection methods. #10004

What does this implement/fix? Explain your changes.

Add a plot for anomaly detection methods on multi-D dataset (3 information dimension and >1 noise dimension)
Include algorithm performance comparisons: accuracy_score, roc_auc_score, and roc_curve

The plot:

ogrisel · 2020-02-04T10:00:50Z

I like the new column with the ROC curve plots but for the other columns, I preferred the 2D plots instead of the 3D plots.

albertcthomas · 2020-02-04T12:16:02Z

Thanks @MaiRajborirug, this is a nice visualization but I agree with @ogrisel: with the 3D it's harder to see the specificity of each of the estimators.

ogrisel · 2020-02-04T13:15:39Z

I particular, on the 2D plots it was easier to see the shape of the decision boundary with the black contour line.

MaiRajborirug · 2020-02-04T18:10:15Z

Thank you for your reviews! I will create an ROC curve and accuracy score in the 2D-plot so that we have the performance comparison measurement.

MaiRajborirug · 2020-02-05T06:27:03Z

According to your advice, I add ROC curves (last column), AUC, and prediction accuracy to the 2-D plots

The plot:

MaiRajborirug · 2020-02-05T20:01:23Z

The last update is to make this PR a bit shorter.

The plot:

albertcthomas · 2020-02-13T22:18:11Z

This is nice plot but I am a bit ambivalent about the usefulness of ROC curves for such toy examples. If we want to make an example with ROC curves @ogrisel suggested (more than 2 years ago) to change 2 of the benchmarks to an example. This would maybe be a better thing to do.

glemaitre · 2020-02-14T10:02:08Z

TBH, I had exactly the same reaction as @albertcthomas. I don't think that the quantitative analysis on such toy datasets is a must-have (maybe only the accuracy because it does not clutter the example so much). I think that the main point of the example is indeed a qualitative analysis. It provides highlights and intuition regarding the implemented algorithms, linked to assumptions made regarding the methods.

However, I agree with you that we miss an example where we should show an end-to-end pipeline where anomaly detection is beneficial in classification and this should be rigorously analyzed with such classification metrics/plots.

Another limitation of the ROC is that we only have 3 of the 4 methods as well.

MaiRajborirug · 2020-03-02T04:27:15Z

@albertcthomas, I created a new PR #16606 corresponding to @ogrisel 's and your suggestion. Could you take a look at them?

MaiRajborirug added 4 commits February 3, 2020 11:45

Add files via upload

becb589

Delete plot_anomaly_comparison-3D.py

9037891

Delete plot_anomaly_comparison_3D.ipynb

3d1338a

Add files via upload

c69d134

MaiRajborirug requested review from agramfort, amueller and jnothman February 3, 2020 20:24

MaiRajborirug added 7 commits February 5, 2020 01:17

Delete plot_anomaly_comparison-3D.py

4ffec76

Delete plot_anomaly_comparison_3D.ipynb

1f1696b

Add files via upload

c2f624b

Delete plot_anomaly_comparison_ROC.ipynb

b612a14

Delete plot_anomaly_comparison_ROC.py

f8eb182

Update plot_anomaly_comparison.py

d359f5e

Update plot_anomaly_comparison.py

c281b0e

Update plot_anomaly_comparison.py

70c464a

MaiRajborirug requested review from ogrisel and removed request for jnothman and amueller February 5, 2020 06:40

MaiRajborirug added 2 commits February 5, 2020 01:47

Update plot_anomaly_comparison.py

37d96f1

Update plot_anomaly_comparison.py

d1526c2

MaiRajborirug changed the title ~~[WIP] Performance comparison 3-D plot for anomaly detection methods~~ [WIP] Performance comparison plots for anomaly detection methods Feb 5, 2020

MaiRajborirug requested a review from amueller February 7, 2020 05:44

MaiRajborirug changed the title ~~[WIP] Performance comparison plots for anomaly detection methods~~ [WIP] Performance comparison (ROC) plots for anomaly detection methods Feb 7, 2020

MaiRajborirug mentioned this pull request Feb 10, 2020

Lack quantitative comparison for anomaly detection algorithms example #16420

Closed

MaiRajborirug mentioned this pull request Feb 17, 2020

Example of LOF and IF benchmarks NeuroDataDesign/SPORF#9

Open

Add files via upload

81a6d23

MaiRajborirug closed this Mar 2, 2020

lorentzenchr mentioned this pull request Nov 7, 2021

[MRG] Combining LOF and Isolation benchmarks #16606

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Performance comparison (ROC) plots for anomaly detection methods #16378

[WIP] Performance comparison (ROC) plots for anomaly detection methods #16378

MaiRajborirug commented Feb 3, 2020 •

edited

Loading

ogrisel commented Feb 4, 2020 •

edited

Loading

albertcthomas commented Feb 4, 2020

ogrisel commented Feb 4, 2020

MaiRajborirug commented Feb 4, 2020

MaiRajborirug commented Feb 5, 2020 •

edited

Loading

MaiRajborirug commented Feb 5, 2020 •

edited

Loading

albertcthomas commented Feb 13, 2020 •

edited

Loading

glemaitre commented Feb 14, 2020

MaiRajborirug commented Mar 2, 2020 •

edited

Loading

[WIP] Performance comparison (ROC) plots for anomaly detection methods #16378

[WIP] Performance comparison (ROC) plots for anomaly detection methods #16378

Conversation

MaiRajborirug commented Feb 3, 2020 • edited Loading

ogrisel commented Feb 4, 2020 • edited Loading

albertcthomas commented Feb 4, 2020

ogrisel commented Feb 4, 2020

MaiRajborirug commented Feb 4, 2020

MaiRajborirug commented Feb 5, 2020 • edited Loading

MaiRajborirug commented Feb 5, 2020 • edited Loading

albertcthomas commented Feb 13, 2020 • edited Loading

glemaitre commented Feb 14, 2020

MaiRajborirug commented Mar 2, 2020 • edited Loading

MaiRajborirug commented Feb 3, 2020 •

edited

Loading

ogrisel commented Feb 4, 2020 •

edited

Loading

MaiRajborirug commented Feb 5, 2020 •

edited

Loading

MaiRajborirug commented Feb 5, 2020 •

edited

Loading

albertcthomas commented Feb 13, 2020 •

edited

Loading

MaiRajborirug commented Mar 2, 2020 •

edited

Loading