Enhancement to Confusion Matrix Output Representation for improving readability #19012 #19190

shubhamdo · 2021-01-17T18:50:43Z

Reference Issues/PRs

Fixes #19012

What does this implement/fix? Explain your changes.

When you have multiple levels you can have difficulty reading the ndarray, associating the levels with the True and Predicted values. It is an enhancement to the output of confusion matrix function, better representing the true and predicted values for multilevel classes.

Returns a confusion matrix in dict representation with labels as keys ('true', 'pred')
Example:

    >>> y_true = ["cat", "ant", "cat", "cat", "ant", "bird"]
    >>> y_pred = ["ant", "ant", "cat", "cat", "ant", "cat"]
    >>> confusion_matrix(y_true, y_pred, labels=["ant", "bird", "cat"], pprint=True)
    {('ant', 'ant'): 2, ('bird', 'ant'): 0, ('cat', 'ant'): 1,
     ('ant', 'bird'): 0, ('bird', 'bird'): 0, ('cat', 'bird'): 0,
     ('ant', 'cat'): 0, ('bird', 'cat'): 1, ('cat', 'cat'): 2}

Any other comments?

jnothman

Thanks for the PR.

jnothman · 2021-01-17T22:07:44Z

sklearn/metrics/_classification.py

@@ -249,6 +249,10 @@ def confusion_matrix(y_true, y_pred, *, labels=None, sample_weight=None,
        conditions or all the population. If None, confusion matrix will not be
        normalized.

+    pprint : bool, default=False


let's call this as_dict?

jnothman · 2021-01-17T22:08:54Z

sklearn/metrics/_classification.py

@@ -257,6 +261,14 @@ def confusion_matrix(y_true, y_pred, *, labels=None, sample_weight=None,
        samples with true label being i-th class
        and predicted label being j-th class.

+    Or


This isn't valid numpydoc. The types need to be mentioned all on the first line.

Changed parameter name from pprint --> as_dict()

Changed the Testing Function in test_classification.py

Changed the Docstring for the function, added explaination of Series usage

Not sure about the numpydoc, I have changed it please review.

Or should I mention it as --> tuple[ndarry, dict['true_class','pred_class']] ?

jnothman · 2021-01-17T22:09:47Z

sklearn/metrics/_classification.py

@@ -249,6 +249,10 @@ def confusion_matrix(y_true, y_pred, *, labels=None, sample_weight=None,
        conditions or all the population. If None, confusion matrix will not be
        normalized.

+    pprint : bool, default=False
+        Returns a confusion matrix in dict representation with labels as keys
+        ('true', 'pred')


It would be worth briefly noting the usage with pandas and unstack.

1. Changed parameter name from pprint --> as_dict() 2. Changed the Testing Function in test_classification.py 3. Tested 4. Changed the Docstring for the function, added explaination of Series usage

tansaku · 2022-11-06T09:32:40Z

I guess this PR has stalled?

shubhamdo added 6 commits January 4, 2021 17:13

Dictionary Representation of Confusion Matrix

9dcc1fd

Confusion Matrix Flat Dict Representation using tuples as key

c2037f4

Merge branch 'master' into feature19012

cde3916

Test Case for confusion_matrix_pprint Added

ece5663

Documentation for Confusion Matrix Updated

8c21e29

Merge branch 'master' into feature19012

230d6be

github-actions bot added the module:metrics label Jan 17, 2021

jnothman reviewed Jan 17, 2021

View reviewed changes

Added reviewed changes

da38022

1. Changed parameter name from pprint --> as_dict() 2. Changed the Testing Function in test_classification.py 3. Tested 4. Changed the Docstring for the function, added explaination of Series usage

Base automatically changed from master to main January 22, 2021 10:53

shubhamdo added 2 commits March 10, 2021 17:55

Linting Issues Fixed

cd0b082

Merge branch 'master' into feature19012

9f7ee8e

cmarmo added the Enhancement label Mar 10, 2021

shubhamdo added 4 commits March 10, 2021 18:15

Liniting fix _classification.py

7c3ce93

Liniting fix _classification.py

c0a3dcd

Changelog updated

46667a6

Linting Issue Fix

9440438

tansaku mentioned this pull request Nov 6, 2022

Confusion Matrix Representation / Return Value #19012

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Enhancement to Confusion Matrix Output Representation for improving readability #19012 #19190

Enhancement to Confusion Matrix Output Representation for improving readability #19012 #19190

Uh oh!

shubhamdo commented Jan 17, 2021

Uh oh!

jnothman left a comment

Uh oh!

jnothman Jan 17, 2021

Uh oh!

jnothman Jan 17, 2021

Uh oh!

shubhamdo Jan 18, 2021 •

edited

Loading

Uh oh!

jnothman Jan 17, 2021

Uh oh!

tansaku commented Nov 6, 2022

Uh oh!

Uh oh!

Uh oh!

Enhancement to Confusion Matrix Output Representation for improving readability #19012 #19190

Are you sure you want to change the base?

Enhancement to Confusion Matrix Output Representation for improving readability #19012 #19190

Uh oh!

Conversation

shubhamdo commented Jan 17, 2021

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

jnothman Jan 17, 2021

Choose a reason for hiding this comment

Uh oh!

jnothman Jan 17, 2021

Choose a reason for hiding this comment

Uh oh!

shubhamdo Jan 18, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman Jan 17, 2021

Choose a reason for hiding this comment

Uh oh!

tansaku commented Nov 6, 2022

Uh oh!

Uh oh!

shubhamdo Jan 18, 2021 •

edited

Loading