Enhance ROC Curve Display Tests for Improved Clarity and Maintainability #31264

NEREUScode · 2025-04-28T09:44:20Z

PR Description:

Summary of Changes:

This PR refactors the data_binary fixture in the test_roc_curve_display.py file. The previous fixture filtered a multiclass dataset (Iris) to create a binary classification task. However, this approach resulted in AUC values consistently reaching 1.0, which does not reflect real-world challenges.

The new fixture utilizes make_classification from sklearn.datasets to generate a synthetic binary classification dataset with the following characteristics:

200 samples and 20 features.
5 informative features and 2 redundant features.
10% label noise (flip_y=0.1) to simulate real-world imperfections in the data.
Class separation (class_sep=0.8) set to avoid perfect separation.

These changes provide a more complex and representative dataset for testing the roc_curve_display function and other related metrics, thereby improving the robustness of tests.

Reference Issues/PRs:

Fixes Use more complex data in test_roc_curve_display.py #31243
See also ENH add from_cv_results in RocCurveDisplay (single RocCurveDisplay) #30399 (comment)

For Reviewers:

This change ensures that the dataset used for testing is more reflective of real-world data, particularly in classification tasks that may involve noise and less clear separation between classes.

Replaced the `data_binary` fixture that filtered classes from a multiclass dataset with a new fixture generating a synthetic binary classification dataset using `make_classification`. This ensures consistent data characteristics, introduces label noise, and better simulates real-world classification challenges.

github-actions · 2025-04-28T09:46:59Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 57bc822. Link to the linter CI: here}

NEREUScode · 2025-04-28T10:14:39Z

@lucyleeow I removed data() as suggested, but I'm still unsure why the Linux check is failing

lucyleeow · 2025-04-28T11:05:19Z

sklearn/metrics/_plot/tests/test_roc_curve_display.py

@@ -1,11 +1,11 @@
 import numpy as np
 import pytest
 from numpy.testing import assert_allclose
-from scipy.integrate import trapezoid
+from scipy.integrate import trapz as trapezoid


Why this change?
The CI failure seems to be due to this

E ImportError: cannot import name 'trapz' from 'scipy.integrate' (/usr/share/miniconda/envs/testvenv/lib/python3.13/site-packages/scipy/integrate/init.py)

you can see the test failure details by clicking through 'details' eg. https://dev.azure.com/scikit-learn/scikit-learn/_build/results?buildId=76020&view=logs&j=dde5042c-7464-5d47-9507-31bdd2ee0a3a&t=4bd2dad8-62b3-5bf9-08a5-a9880c530c94&l=918

thanks a lot i'll fix it

mohammed benyamna added 5 commits April 25, 2025 19:33

Update test_roc_curve_display.py

e299bf6

Update test_roc_curve_display.py

7ab9430

Replace filtered data fixture with synthetic binary dataset

4cfe688

update the data_binary and delete the data()

e8b1e45

github-actions bot added the module:metrics label Apr 28, 2025

Merge branch 'main' into main

57bc822

lucyleeow reviewed Apr 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Enhance ROC Curve Display Tests for Improved Clarity and Maintainability #31264

Enhance ROC Curve Display Tests for Improved Clarity and Maintainability #31264

Uh oh!

NEREUScode commented Apr 28, 2025

Uh oh!

github-actions bot commented Apr 28, 2025 •

edited

Loading

Uh oh!

NEREUScode commented Apr 28, 2025

Uh oh!

lucyleeow Apr 28, 2025

Uh oh!

NEREUScode Apr 28, 2025

Uh oh!

Uh oh!

Uh oh!

Enhance ROC Curve Display Tests for Improved Clarity and Maintainability #31264

Enhance ROC Curve Display Tests for Improved Clarity and Maintainability #31264

Uh oh!

Conversation

NEREUScode commented Apr 28, 2025

PR Description:

Summary of Changes:

Reference Issues/PRs:

For Reviewers:

Uh oh!

github-actions bot commented Apr 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

NEREUScode commented Apr 28, 2025

Uh oh!

lucyleeow Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

NEREUScode Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Apr 28, 2025 •

edited

Loading