[MRG] Add specificity score as a metric #10831

sandeepvaday · 2018-03-18T18:44:34Z

Reference Issues/PRs

Please see Issue #10391

What does this implement/fix? Explain your changes.

As per the discussion, instead of adding a False Positive Rate metric, adding a Specificity score metric.

Any other comments?

sandeepvaday · 2018-03-18T18:45:39Z

@jnothman FYI.

jnothman

I think we need to either not support multiclass or document the multiclass behaviour better. Is this a standard definition for the multiclass case? The binary case can also be calculated with recall_score, though.

sandeepvaday · 2018-03-18T23:20:22Z

How will we calculate from the recall_score? Perhaps you are referring to Sensitivity, and not Specificity as I aim to compute here?

In multiclass cases, it is often a requirement to compute the False Positive Rates for each class to evaluate the model. Here, by computing and returning the Specificity for each class, we can allow the user to refer to either individual values, or compute the macro/micro average from the returned array.

If you would like, I can document the multiclass stating the above idea or with more examples?

jnothman · 2018-03-19T01:18:27Z

I meant recall_score(y, y', pos_label=0) for instance.

jnothman

Please also add specificity_score to sklearn/metrics/tests/test_common.py.

jnothman · 2018-03-19T01:41:49Z

You might also want to see #10628 which may make the implementation of specificity_score less necessary, or at least simplify the implementation, by providing multilabel_confusion_matrix.

jnothman · 2018-03-19T01:43:18Z

And your tests are currently failing.

sandeepvaday · 2018-03-19T03:53:32Z

Well the multilabel_confusion_matrix does make my contribution trivial. I guess I should close my pull request and not bother about the failing tests either?

Let me know.

jnothman · 2018-03-19T04:18:39Z

Well, I'm not yet sure about the implementation of multilabel_confusion_matrix yet. It's not been benchmarked, for instance, and some of the code has become a bit hacky and could be neater. If you'd like to take on benchmarking that and bringing it to completion, I'd be interested in having it off my hands!! I had also thought we should consider a 'specificity' scorer for the binary case in sklearn/metrics/scorer.py

Add specificity score as a metric

4928470

jnothman reviewed Mar 18, 2018

View reviewed changes

jnothman reviewed Mar 19, 2018

View reviewed changes

amueller added Needs Benchmarks A tag for the issues and PRs which require some benchmarks Stalled labels Aug 5, 2019

sandeepvaday closed this Aug 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG] Add specificity score as a metric #10831

[MRG] Add specificity score as a metric #10831

Uh oh!

sandeepvaday commented Mar 18, 2018

Uh oh!

sandeepvaday commented Mar 18, 2018

Uh oh!

jnothman left a comment

Uh oh!

sandeepvaday commented Mar 18, 2018

Uh oh!

jnothman commented Mar 19, 2018 via email

Uh oh!

jnothman left a comment

Uh oh!

jnothman commented Mar 19, 2018

Uh oh!

jnothman commented Mar 19, 2018

Uh oh!

sandeepvaday commented Mar 19, 2018

Uh oh!

jnothman commented Mar 19, 2018 via email

Uh oh!

Uh oh!

Uh oh!

[MRG] Add specificity score as a metric #10831

[MRG] Add specificity score as a metric #10831

Uh oh!

Conversation

sandeepvaday commented Mar 18, 2018

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

sandeepvaday commented Mar 18, 2018

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

sandeepvaday commented Mar 18, 2018

Uh oh!

jnothman commented Mar 19, 2018 via email

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

jnothman commented Mar 19, 2018

Uh oh!

jnothman commented Mar 19, 2018

Uh oh!

sandeepvaday commented Mar 19, 2018

Uh oh!

jnothman commented Mar 19, 2018 via email

Uh oh!

Uh oh!