Confusion matrix derived metrics #15522

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

ghost wants to merge 11 commits into scikit-learn:main from srpadigepati:confusion-matrix-derived-metrics

ghost commented Nov 3, 2019

Reference Issues/PRs

Adding Fall-out, Miss rate, specificity as metrics #5516

What does this implement/fix? Explain your changes.

Implemented a function which returns fpr, tpr, fnr, tnr.

Any other comments?

Implementation of seperate functions for each metric which calls this function is still pending.
Co-authored by @ddhar1 @samskruthireddy

samskruthi reddy padigepati added 11 commits

November 2, 2019 15:07


          added a function with confusion matrix derived metrics (fpr, tpr, tnr…

3cbfee7

…, fnr)


          changed the true postive sum in the function

5b67455


          add print

9c60a38


          remove one print

11d1dfb


          remove print statements

e5bb3f3


          add coauthors.

fb6e200

Co-authored-by: samskruthi padigepati <https://github.com/ddhar1>
Co-authored-by: Divya Dhar<https://github.com/samskruthireddy>


          fix doc string outputs

a1caff3

Co-authored-by: samskruthi padigepati <https://github.com/ddhar1>
Co-authored-by: Divya Dhar<https://github.com/samskruthireddy>


          pep8 test

3de41f6

Co-authored-by: Divya Dhar  <https://github.com/ddhar1>
Co-authored-by: samskruthi padigepati <https://github.com/samskruthireddy>


          trivial

10ae773

Co-authored-by: samskruthi padigepati <https://github.com/samskruthireddy>


          remove imported but unused flake8

351d3f1


          to trigger test

17097e1

Member

jnothman commented Nov 4, 2019

This requires adding to test_common.py and specific behaviours tested in test_classification.py

glemaitre reviewed

View reviewed changes

Member

glemaitre left a comment

I am unsure about the naming. I think that we need to use the full name (e.g. true_positive_rate) instead of acronym (e.g. tpr).

sklearn/metrics/_classification.py

                   y_true : array-like or label indicator matrix
                       Ground truth (correct) labels for n_samples samples.
-                  y_pred : array-like of float, shape = (n_samples, n_classes) or (n_samples,)
+                  y_pred : array-like of float, shape = (n_samples, n_classes)

Member

glemaitre Nov 7, 2019

Suggested change

      
                y_pred : array-like of float, shape = (n_samples, n_classes)
          
                y_pred : array-like of float, shape = (n_samples, n_classes) \

sklearn/metrics/_classification.py

                   y_true : array-like or label indicator matrix
                       Ground truth (correct) labels for n_samples samples.
-                  y_pred : array-like of float, shape = (n_samples, n_classes) or (n_samples,)
+                  y_pred : array-like of float, shape = (n_samples, n_classes)
+                  or (n_samples,)

Member

glemaitre Nov 7, 2019

Suggested change

      
                or (n_samples,)
          
                    or (n_samples,)

github-actions bot added the module:metrics label

cmarmo added Stalled help wanted labels

haochunchang reviewed

View reviewed changes

Contributor

haochunchang left a comment

Good work! @samskruthireddy
Is this PR work in progress?
I would like to work on this if this PR is stalled.

sklearn/metrics/_classification.py

+                  tpr : float (if average is not None) or array of float, shape =\
+                      [n_unique_labels]
+                  fpr : float (if average is not None) or array of float, , shape =\

Contributor

haochunchang May 17, 2020

Just a small extra comma :)

Suggested change

      
                fpr : float (if average is not None) or array of float, , shape =\
          
                fpr : float (if average is not None) or array of float, shape =\

sklearn/metrics/_classification.py

+                  Examples
+                  --------
+                  >>> import numpy as np
+                  >>> from sklearn.metrics import precision_recall_fscore_support

Contributor

haochunchang May 17, 2020

Is this a dependency of this function?
Since the following examples did not use this function.

Suggested change

>>> from sklearn.metrics import precision_recall_fscore_support

sklearn/metrics/_classification.py

Comment on lines +1716 to +1726

+                  if average == 'weighted':
+                      weights = pos_sum
+                      if weights.sum() == 0:
+                          zero_division_value = 0.0 if zero_division in ["warn", 0] else 1.0
+                          # precision is zero_division if there are no positive predictions
+                          # recall is zero_division if there are no positive labels
+                          # fscore is zero_division if all labels AND predictions are
+                          # negative
+                          return (zero_division_value if pred_sum.sum() == 0 else 0,
+                                  zero_division_value,
+                                  zero_division_value if pred_sum.sum() == 0 else 0)

Contributor

haochunchang May 17, 2020

Here seems to only return 3 values.

sklearn/metrics/_classification.py

Comment on lines +1600 to +1601

		alters 'macro' to account for label imbalance; it can result in an
		F-score that is not between precision and recall.

Contributor

haochunchang May 17, 2020

I guess this function does not return F-score.

Suggested change

      
                        alters 'macro' to account for label imbalance; it can result in an
          
                        F-score that is not between precision and recall.
          
                        alters 'macro' to account for label imbalance.

sklearn/metrics/_classification.py

Comment on lines +1556 to +1558

+                  If ``pos_label is None`` and in binary classification, this function
+                  returns the average precision, recall and F-measure if ``average``
+                  is one of ``'micro'``, ``'macro'``, ``'weighted'`` or ``'samples'``.

Contributor

haochunchang May 17, 2020

Suggested change

      
                If ``pos_label is None`` and in binary classification, this function
          
                returns the average precision, recall and F-measure if ``average``
          
                is one of ``'micro'``, ``'macro'``, ``'weighted'`` or ``'samples'``.
          
                If ``pos_label is None`` and in binary classification, this function
          
                returns the true positive rate, false positive rate, true negative rate
          
                and false negative rate if ``average`` is one of ``'micro'``, ``'macro'``,
          
                ``'weighted'`` or ``'samples'``.

Member

lucyleeow commented May 18, 2020

@haochunchang it's been marked as 'stalled' so I think you can take over

haochunchang added a commit to haochunchang/scikit-learn that referenced this pull request


          Take over PR scikit-learn#15522

3d27422

Modify documentation and add deprecation of position arg.

haochunchang added a commit to haochunchang/scikit-learn that referenced this pull request


          Take over PR scikit-learn#15522

748494d

Modify doc and add deprecation to position arg.

haochunchang added a commit to haochunchang/scikit-learn that referenced this pull request


          Take over PR scikit-learn#15522

fb73c6e

Modify doc and add deprecation to position arg.

Contributor

haochunchang commented May 18, 2020

I have opened a new PR #17265 to take over this stalled PR.
Anyone interested is welcome :)

haochunchang mentioned this pull request

Adding Fall-out, Miss rate, specificity as metrics #5516

Open

Contributor

albertvillanova commented Sep 27, 2020

@cmarmo, maybe the label "help wanted" should be removed from this PR. Thanks.

cmarmo removed the help wanted label

cmarmo mentioned this pull request

FEA confusion matrix derived metrics #17265

Closed

3 tasks

Base automatically changed from master to main

January 22, 2021 10:51

Member

glemaitre commented Jul 29, 2021

closing in favor of #19556

glemaitre closed this

adrinjalali mentioned this pull request

FEA add binary_classification_curve #30134

Open

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

module:metrics Stalled