ENH Add replace_undefined_by to accuracy_score #31187

StefanieSenger · 2025-04-12T15:03:15Z

Reference Issues/PRs

towards #29048

What does this implement/fix? Explain your changes.

This PR adds a replace_undefined_by param to accuracy_score to deal with empty y_true and y_pred.
Also adds tests.

Open Question

Note that before this PR accuracy_score returned like this:
accuracy_score(np.array([]), np.array([]))

nan

accuracy_score(np.array([]), np.array([]), normalize=False)

0.0

I would like to consider this inconsistency as a bug and fix this with this PR for the next release without deprecation, so it comes faster. Would this be okay? How would you see that, @adrinjalali?

github-actions · 2025-04-12T15:04:31Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: a62e487. Link to the linter CI: here}

sklearn/metrics/_classification.py

adrinjalali

nits, otherwise LGTM.

adrinjalali · 2025-04-23T13:57:47Z

sklearn/metrics/_classification.py

+        thus ill-defined. Can take the following values:
+
+        - `np.nan` to return `np.nan`
+        - a floating point value in the range of [0.0, 1.0] or int 0


Suggested change

- a floating point value in the range of [0.0, 1.0] or int 0

- a floating point value in the range of $[0.0, 1.0]$ or `int` 0

I think that makes it math like

Guillaume once told me we don't use latex in docstrings. So I will use back ticks instead.

sklearn/metrics/_classification.py

adrinjalali · 2025-04-23T14:00:26Z

sklearn/metrics/_classification.py

+                "defaults to 0 when `normalize=False` is set."
+            )
+            warnings.warn(msg, UndefinedMetricWarning, stacklevel=2)
+            return replace_undefined_by if math.isnan(replace_undefined_by) else 0


Suggested change

return replace_undefined_by if math.isnan(replace_undefined_by) else 0

return replace_undefined_by if np.isnan(replace_undefined_by) else 0

since we're using np.nan?

Both is possible, but we're replacing np.isnan with math.isnan for array api, so we can already do it here, I think.

sklearn/metrics/_classification.py

adrinjalali · 2025-04-23T14:09:02Z

sklearn/metrics/_classification.py

@@ -3081,7 +3114,7 @@ def hamming_loss(y_true, y_pred, *, sample_weight=None):

    Returns
    -------
-    loss : float or int
+    loss : float


there return types are changed. I agree it should be always float for all of them, but it'd be nice to have a test for all the cases to make sure it's actually float.

~~I will do this in a separate PR.~~

These tests had already been added in #30575.

StefanieSenger

Thank you for your review, @adrinjalali.

I have addressed all your comments. For testing if all the classification metrics indeed return floats, I will open a separate PR.

StefanieSenger · 2025-04-29T13:24:31Z

sklearn/metrics/_classification.py

+        thus ill-defined. Can take the following values:
+
+        - `np.nan` to return `np.nan`
+        - a floating point value in the range of [0.0, 1.0] or int 0


Guillaume once told me we don't use latex in docstrings. So I will use back ticks instead.

StefanieSenger · 2025-04-29T13:31:36Z

sklearn/metrics/_classification.py

+                "defaults to 0 when `normalize=False` is set."
+            )
+            warnings.warn(msg, UndefinedMetricWarning, stacklevel=2)
+            return replace_undefined_by if math.isnan(replace_undefined_by) else 0


Both is possible, but we're replacing np.isnan with math.isnan for array api, so we can already do it here, I think.

StefanieSenger · 2025-04-29T13:46:00Z

sklearn/metrics/_classification.py

@@ -3081,7 +3114,7 @@ def hamming_loss(y_true, y_pred, *, sample_weight=None):

    Returns
    -------
-    loss : float or int
+    loss : float


~~I will do this in a separate PR.~~

These tests had already been added in #30575.

…scikit-learn into undefined_accuracy_score

ENH Add replace_undefined_by to accuracy_score

4e48f15

github-actions bot added the module:metrics label Apr 12, 2025

changelog

e53157f

StefanieSenger commented Apr 12, 2025

View reviewed changes

sklearn/metrics/_classification.py Outdated Show resolved Hide resolved

fix typo

09f57bb

StefanieSenger added this to the 1.7 milestone Apr 12, 2025

StefanieSenger marked this pull request as draft April 12, 2025 17:03

StefanieSenger marked this pull request as ready for review April 12, 2025 17:18

StefanieSenger and others added 2 commits April 15, 2025 09:57

fix return types in docs

63d9f8e

Merge branch 'main' into undefined_accuracy_score

8e13b63

adrinjalali reviewed Apr 23, 2025

View reviewed changes

apply suggestions from code review

fcfd339

StefanieSenger commented Apr 29, 2025

View reviewed changes

Merge branch 'undefined_accuracy_score' of github.com:StefanieSenger/…

a62e487

…scikit-learn into undefined_accuracy_score

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH Add replace_undefined_by to accuracy_score #31187

ENH Add replace_undefined_by to accuracy_score #31187

StefanieSenger commented Apr 12, 2025 •

edited

Loading

github-actions bot commented Apr 12, 2025 •

edited

Loading

adrinjalali left a comment

adrinjalali Apr 23, 2025

StefanieSenger Apr 29, 2025

adrinjalali Apr 23, 2025

StefanieSenger Apr 29, 2025

adrinjalali Apr 23, 2025

StefanieSenger Apr 29, 2025 •

edited

Loading

StefanieSenger left a comment

StefanieSenger Apr 29, 2025

StefanieSenger Apr 29, 2025

StefanieSenger Apr 29, 2025 •

edited

Loading

	- a floating point value in the range of [0.0, 1.0] or int 0
	- a floating point value in the range of $[0.0, 1.0]$ or `int` 0

	return replace_undefined_by if math.isnan(replace_undefined_by) else 0
	return replace_undefined_by if np.isnan(replace_undefined_by) else 0

ENH Add replace_undefined_by to accuracy_score #31187

Are you sure you want to change the base?

ENH Add replace_undefined_by to accuracy_score #31187

Conversation

StefanieSenger commented Apr 12, 2025 • edited Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Open Question

github-actions bot commented Apr 12, 2025 • edited Loading

✔️ Linting Passed

adrinjalali left a comment

Choose a reason for hiding this comment

adrinjalali Apr 23, 2025

Choose a reason for hiding this comment

StefanieSenger Apr 29, 2025

Choose a reason for hiding this comment

adrinjalali Apr 23, 2025

Choose a reason for hiding this comment

StefanieSenger Apr 29, 2025

Choose a reason for hiding this comment

adrinjalali Apr 23, 2025

Choose a reason for hiding this comment

StefanieSenger Apr 29, 2025 • edited Loading

Choose a reason for hiding this comment

StefanieSenger left a comment

Choose a reason for hiding this comment

StefanieSenger Apr 29, 2025

Choose a reason for hiding this comment

StefanieSenger Apr 29, 2025

Choose a reason for hiding this comment

StefanieSenger Apr 29, 2025 • edited Loading

Choose a reason for hiding this comment

StefanieSenger commented Apr 12, 2025 •

edited

Loading

github-actions bot commented Apr 12, 2025 •

edited

Loading

StefanieSenger Apr 29, 2025 •

edited

Loading

StefanieSenger Apr 29, 2025 •

edited

Loading