Mention possibility of regression targets in warning about unique classes >50% of n_samples #31689

lucyleeow · 2025-07-02T12:37:29Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

In #26335 we added a warning when unique classes >50% of n_samples. This adds a note that it could also be due to the data being from a regression problem.

Any other comments?

github-actions · 2025-07-02T12:38:25Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 428ef4d. Link to the linter CI: here}

jeremiedbb · 2025-07-02T13:44:37Z

sklearn/utils/multiclass.py

+            "of samples. Note this may be because the data belongs to a regression "
+            "problem, not a classification problem.",


That would only be triggered by a regression target containing only integers and with an integer dtype. Is it worth it ?

Well technically integer float values are "multiclass", but I get your point...

type_of_target([1.0, 0.0, 3.0]) 'multiclass'

Maybe it is too rare. Happy to close

Right, I misread the previous code block, integer float values end-up as multiclass.

The original motivation for the warning was not about confusion with regression problems according to #16399 (comment).
However this comment #26335 (comment) goes in your direction. So I think your addition is worth.

Hmm yes. Christian also mentions it later: #26335 (comment)

thomasjpfan · 2025-07-11T16:19:24Z

sklearn/utils/multiclass.py

+            "of samples. Note this may be because the data belongs to a regression "
+            "problem, not a classification problem.",


Small nit:

Suggested change

"of samples. Note this may be because the data belongs to a regression "

"problem, not a classification problem.",

"of samples. The target data could represent a regression "

"problem, not a classification problem.",

This is nicer thank you. What do you think about:

"y could represent a regression problem, not a classification problem." ..?

(I am happy either way)

Your suggestion works for me to.

…sses >50% of n_samples (scikit-learn#31689) Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

update warn

300d828

github-actions bot added the module:utils label Jul 2, 2025

jeremiedbb reviewed Jul 2, 2025

View reviewed changes

jeremiedbb approved these changes Jul 11, 2025

View reviewed changes

jeremiedbb added the No Changelog Needed label Jul 11, 2025

Merge branch 'main' into type_target_warn

ece5861

thomasjpfan reviewed Jul 11, 2025

View reviewed changes

review

428ef4d

StefanieSenger mentioned this pull request Jul 14, 2025

type_of_target misclassifies count/ordinal regression targets as multiclass #31752

Closed

thomasjpfan approved these changes Jul 14, 2025

View reviewed changes

thomasjpfan merged commit 6848353 into scikit-learn:main Jul 14, 2025
36 checks passed

lucyleeow deleted the type_target_warn branch July 14, 2025 21:58

jeremiedbb added a commit to jeremiedbb/scikit-learn that referenced this pull request Jul 15, 2025

Mention possibility of regression targets in warning about unique cla…

78ff82d

…sses >50% of n_samples (scikit-learn#31689) Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

jeremiedbb mentioned this pull request Jul 15, 2025

Release 1.7.1 #31762

Merged

13 tasks

jeremiedbb added a commit to jeremiedbb/scikit-learn that referenced this pull request Jul 16, 2025

Mention possibility of regression targets in warning about unique cla…

b7a2d1c

…sses >50% of n_samples (scikit-learn#31689) Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

jeremiedbb added a commit to jeremiedbb/scikit-learn that referenced this pull request Jul 16, 2025

Mention possibility of regression targets in warning about unique cla…

9b3c90d

…sses >50% of n_samples (scikit-learn#31689) Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

jeremiedbb added a commit to jeremiedbb/scikit-learn that referenced this pull request Jul 16, 2025

Mention possibility of regression targets in warning about unique cla…

63ca649

…sses >50% of n_samples (scikit-learn#31689) Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

jeremiedbb added a commit to jeremiedbb/scikit-learn that referenced this pull request Jul 16, 2025

Mention possibility of regression targets in warning about unique cla…

b2575a2

…sses >50% of n_samples (scikit-learn#31689) Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

jeremiedbb added a commit to jeremiedbb/scikit-learn that referenced this pull request Jul 16, 2025

Mention possibility of regression targets in warning about unique cla…

c6cdfde

…sses >50% of n_samples (scikit-learn#31689) Co-authored-by: Jérémie du Boisberranger <jeremie@probabl.ai>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Mention possibility of regression targets in warning about unique classes >50% of n_samples #31689

Mention possibility of regression targets in warning about unique classes >50% of n_samples #31689

Uh oh!

lucyleeow commented Jul 2, 2025

Uh oh!

github-actions bot commented Jul 2, 2025 •

edited

Loading

Uh oh!

jeremiedbb Jul 2, 2025

Uh oh!

lucyleeow Jul 3, 2025

Uh oh!

lucyleeow Jul 3, 2025

Uh oh!

jeremiedbb Jul 11, 2025

Uh oh!

lucyleeow Jul 11, 2025

Uh oh!

thomasjpfan Jul 11, 2025

Uh oh!

lucyleeow Jul 12, 2025 •

edited

Loading

Uh oh!

thomasjpfan Jul 12, 2025

Uh oh!

lucyleeow Jul 14, 2025

Uh oh!

Uh oh!

Uh oh!

		"of samples. Note this may be because the data belongs to a regression "
		"problem, not a classification problem.",

Uh oh!

Mention possibility of regression targets in warning about unique classes >50% of n_samples #31689

Mention possibility of regression targets in warning about unique classes >50% of n_samples #31689

Uh oh!

Conversation

lucyleeow commented Jul 2, 2025

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lucyleeow Jul 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jul 2, 2025 •

edited

Loading

lucyleeow Jul 12, 2025 •

edited

Loading