FEA Regression error characteristic curve and plotting #31380

alexshtf · 2025-05-18T11:53:33Z

Description

Compute regression error characteristic [1] curve, which is essentially the CDF of the regression errors. Its function is similar to that of ROC curves - allows comparing performance profiles of regressors beyond just one summary statistic, such as RMSE or MAE.

Examples

Used like this:

from sklearn.metrics import RecCurveDisplay

lr_estimator = LinearRegression()
lr_estimator.fit(X_train, y_train)

RecCurveDisplay.from_estimator(lr_estimator, X_test, y_test, name="Linear regression")

The result is like this:

It allows comparing one regressor to a constant prediction baseline (by default - the median of the test samples).

We can also compare several regressors:

fig, ax = plt.subplots()

RecCurveDisplay.from_predictions(
    y_test,
    pred_lr,
    ax=ax,
    name=f"Linear regression (MAE={mean_absolute_error(pred_lr, y_test):.2f})",
    plot_const_predictor=False,
)

RecCurveDisplay.from_predictions(
    y_test,
    pred_knn,
    ax=ax,
    name=f"KNN (MAE={mean_absolute_error(pred_lr, y_test):.2f})",
    plot_const_predictor=False,
)

# this one will plot the constant predictor for reference - note that the plot_const_predictor is not set to False!
RecCurveDisplay.from_predictions(
    y_test,
    pred_hgbr,
    ax=ax,
    name=f"Gradient Boosting Regressor (MAE={mean_absolute_error(pred_lr, y_test):.2f})",
)

fig.show()

This will plot something like this:

You can see here a clear domination of regressors.

Sometimes the curves will cross - in this case there is no clear domination, and the performance profiles of both regressors differs. Some are better for smaller error tolerance, whereas others may be better for larger error tolerance.

P.S - my first ever contribution to this phenomenal library. I hope I haven't missed something important.

References

[1]: Bi, J. and Bennett, K.P., 2003. Regression error characteristic curves. In Proceedings of the 20th international conference on machine learning (ICML-03) (pp. 43-50).

github-actions · 2025-05-18T11:54:33Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: bd3649b. Link to the linter CI: here}

… that

alexshtf · 2025-05-22T09:17:32Z

I don't see the additional rec_curve and RecCurveDisplay components anywhere in the rendered doc. Could one of the reviewers please point me to the right places to update? It appears I do not fully understand which files are autogenerated and which are not.

I only see the examples/miscellaneous/plot_rec_curve_visualization.py example that I added in the rendered docs.

lorentzenchr · 2025-06-02T18:09:56Z

I am closing, see #31441 (comment).

Alexander Shtoff added 8 commits May 17, 2025 20:22

Initial version of REC curve

0a976c5

Added import to RecCurveDisplay

643339f

Added tests and fixed bugs.

87e19ed

Added examples of REC curve plots.

fbcd7f8

Fixed REC curve example

64e101b

Improved the REC curve example

7aebfca

Demonstrate using california housing

10ec0e4

Added clarification to the comparison plot

f978b99

github-actions bot added the module:metrics label May 18, 2025

Alexander Shtoff and others added 9 commits May 18, 2025 15:22

Fixed documentation

9c9f804

Fixed more docs

73458c5

Merge branch 'main' into regression_error_characteristic

d683357

Doc fixes

675b9f0

More cosmetics

8e2f942

Removed unused argument from RecCurveDisplay ctor

c035196

Added changelog entries

4cfbcb6

Improved REC plot docs

ab6db06

Docstring fixes

7625c2c

alexshtf marked this pull request as draft May 18, 2025 19:08

Alexander Shtoff added 8 commits May 18, 2025 22:33

Bugfixes and doctest fixes

34b5236

Organize imports

604531a

Fix doctest

a36bf45

Renamed Remarks to Notes section to conform to sklearn standards

fb44418

Fixed typo in docstrings

708fdf0

Fixed docstring section order

1dbf56f

Added REC curve display tests

85515f8

Refactor code to reduce duplication in plotting

9173f46

alexshtf marked this pull request as ready for review May 19, 2025 08:01

Ensured deviations are sorted, since xp.unique_counts does not ensure…

83f3dc9

… that

alexshtf and others added 2 commits May 19, 2025 16:27

Merge branch 'main' into regression_error_characteristic

e72e06e

retrigger build

bd3649b

alexshtf mentioned this pull request May 28, 2025

Regression error characteristic curve #31441

Closed

lorentzenchr closed this Jun 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

FEA Regression error characteristic curve and plotting #31380

FEA Regression error characteristic curve and plotting #31380

Uh oh!

alexshtf commented May 18, 2025 •

edited

Loading

Uh oh!

github-actions bot commented May 18, 2025 •

edited

Loading

Uh oh!

alexshtf commented May 22, 2025

Uh oh!

lorentzenchr commented Jun 2, 2025

Uh oh!

Uh oh!

Uh oh!

FEA Regression error characteristic curve and plotting #31380

FEA Regression error characteristic curve and plotting #31380

Uh oh!

Conversation

alexshtf commented May 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Examples

Uh oh!

github-actions bot commented May 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

alexshtf commented May 22, 2025

Uh oh!

lorentzenchr commented Jun 2, 2025

Uh oh!

Uh oh!

alexshtf commented May 18, 2025 •

edited

Loading

github-actions bot commented May 18, 2025 •

edited

Loading