DOC Clarify usage of d2_pinball_score with model selection tools #31239

MaddyRizvi · 2025-04-22T13:12:29Z

Reference Issues/PRs

Towards #28671
This PR addresses confusion discussed by users trying to use d2_pinball_score directly as a string value for the scoring parameter in model selection APIs such as GridSearchCV and RandomizedSearchCV.

Although d2_pinball_score is a valid scoring function, it is not registered as a string scorer and must be wrapped using make_scorer. This was not clearly explained in the docstring.

What does this implement/fix? Explain your changes.

This improves the documentation of sklearn.metrics.d2_pinball_score by:

Adding a note to clarify that this metric is not a valid string identifier for the scoring parameter in model selection tools.

Providing a code example showing how to use make_scorer to wrap the function correctly for use with GridSearchCV or RandomizedSearchCV.

Adding a usage snippet under the Examples section for easy discoverability.

This change is intended to make usage of d2_pinball_score more transparent and reduce common user errors and confusion.

Any other comments?

This PR does not affect the behavior of the function or its API — it is documentation-only.

The motivation arose from real-world usage in probabilistic forecasting, where d2_pinball_score is useful but hard to integrate into model selection workflows due to missing documentation around make_scorer.

github-actions · 2025-04-22T13:13:33Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: fe85a06. Link to the linter CI: here}

yuanx749 · 2025-04-26T13:55:01Z

sklearn/metrics/_regression.py

+    This metric is not a built-in scoring string for use in model selection
+    tools such as `GridSearchCV` or `RandomizedSearchCV`.
+
+    To use it as a custom scoring function, wrap it using
+    :func:`~sklearn.metrics.make_scorer`:
+
+    >>> from sklearn.metrics import make_scorer, d2_pinball_score
+    >>> scorer = make_scorer(d2_pinball_score, alpha=0.95)
+    >>> # Then use it as `scoring=scorer` in RandomizedSearchCV or GridSearchCV


Suggested change

This metric is not a built-in scoring string for use in model selection

tools such as `GridSearchCV` or `RandomizedSearchCV`.

To use it as a custom scoring function, wrap it using

:func:`~sklearn.metrics.make_scorer`:

>>> from sklearn.metrics import make_scorer, d2_pinball_score

>>> scorer = make_scorer(d2_pinball_score, alpha=0.95)

>>> # Then use it as `scoring=scorer` in RandomizedSearchCV or GridSearchCV

This metric is not a built-in scoring string for use in model selection

tools such as `GridSearchCV` or `RandomizedSearchCV`.

To use it as a custom scoring function, wrap it using

:func:`~sklearn.metrics.make_scorer`. See Examples for details.

We can keep the code snippet in the examples section.

Moved the scorer usage example to the Examples section and cleaned up the Notes.
Thanks for the feedback — ready for your next review whenever you have time!

yuanx749 · 2025-04-26T13:56:00Z

sklearn/metrics/_regression.py

+    >>> # Using with make_scorer
+    >>> from sklearn.metrics import make_scorer
+    >>> scorer = make_scorer(d2_pinball_score, alpha=0.95)


Suggested change

>>> # Using with make_scorer

>>> from sklearn.metrics import make_scorer

>>> scorer = make_scorer(d2_pinball_score, alpha=0.95)

Using with :func:`~sklearn.metrics.make_scorer`:

>>> from sklearn.metrics import make_scorer, d2_pinball_score

>>> pinball_95_scorer = make_scorer(d2_pinball_score, alpha=0.95)

>>> from sklearn.model_selection import GridSearchCV

>>> from sklearn.svm import LinearSVC

>>> grid = GridSearchCV(

... LinearSVC(),

... param_grid={"C": [1, 10]},

... scoring=pinball_95_scorer,

... cv=5,

... )

Expand the example a bit.

Expand examples section to show how to use d2_pinball_score with
make_scorer and GridSearchCV. Also clarify in the Notes that
d2_pinball_score is not a built-in scorer string.

@yuanx749 note that LinearSVC is a classifier (suited to model discrete class observations in the target variable), while d2_pinball_score is a metric for quantile regression problems. Better use a (quantile) regression model in the example.

…s section

…_scorer and GridSearchCV

ogrisel

Thanks for the PR. Overall this looks good to me, but the failures reported by the continuous integration need to be addressed. Please find details and further suggestions below:

ogrisel · 2025-04-30T12:39:47Z

sklearn/metrics/_regression.py

+    >>> y_true = [3, -0.5, 2, 7]
+    >>> y_pred = [2.5, 0.0, 2, 8]
+    >>> d2_pinball_score(y_true, y_pred, alpha=0.95)
+    0.968...


This should help fix the broken tests.

Suggested change

0.968...

0.578...

Next time, please run pytest --doctest-modules path/to/the/code/you/change.py when editing doctests.

Alternatively, read the logs of the failing continuous integration reports linked from the PR to find out what caused the failures.

ogrisel · 2025-04-30T12:41:01Z

sklearn/metrics/_regression.py

+    ...     scoring=pinball_95_scorer,
+    ...     cv=2,
+    ... )
+    >>> _ = grid.fit(X, y)


Maybe you could display the value of grid.best_params_ to make the example more complete. E.g. something like the following:

>>> grid.fit(X, y).best_params_ {"fit_intercept": True}

Run the doctest locally to check that this is actually the best param:

$ pytest -v --doctest-modules sklearn/metrics/_regression.py

ogrisel · 2025-04-30T12:44:18Z

sklearn/metrics/_regression.py

+    >>> # Using with make_scorer
+    >>> from sklearn.metrics import make_scorer
+    >>> scorer = make_scorer(d2_pinball_score, alpha=0.95)


@yuanx749 note that LinearSVC is a classifier (suited to model discrete class observations in the target variable), while d2_pinball_score is a metric for quantile regression problems. Better use a (quantile) regression model in the example.

ogrisel · 2025-04-30T12:47:09Z

sklearn/metrics/_regression.py

+    >>> X = np.array([[1], [2], [3], [4]])
+    >>> y = np.array([2.5, 0.0, 2, 8])
+    >>> grid = GridSearchCV(
+    ...     LinearRegression(),


It would make more sense to tune the fit_intercept parameter of QuantileRegressor(quantile=0.95) instead of LinearRegression. LinearRegression predicts an estimate of E[y|X] instead of an estimate of Q_{0.95}(y|X).

DOC Clarify usage of d2_pinball_score with model selection tools

76cca9c

github-actions bot added module:metrics Documentation labels Apr 22, 2025

yuanx749 reviewed Apr 26, 2025

View reviewed changes

MaddyRizvi added 2 commits April 26, 2025 15:45

DOC Update d2_pinball_score docstring: move scorer example to Example…

afd3071

…s section

Expand examples section to show how to use d2_pinball_score with make…

fe85a06

…_scorer and GridSearchCV

MaddyRizvi force-pushed the doc/update-d2-pinball-score branch from e4f733b to fe85a06 Compare April 26, 2025 15:04

ogrisel reviewed Apr 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC Clarify usage of d2_pinball_score with model selection tools #31239

DOC Clarify usage of d2_pinball_score with model selection tools #31239

MaddyRizvi commented Apr 22, 2025 •

edited by ogrisel

Loading

github-actions bot commented Apr 22, 2025 •

edited

Loading

yuanx749 Apr 26, 2025

MaddyRizvi Apr 26, 2025

yuanx749 Apr 26, 2025

MaddyRizvi Apr 26, 2025

ogrisel Apr 30, 2025

ogrisel left a comment

ogrisel Apr 30, 2025

ogrisel Apr 30, 2025

ogrisel Apr 30, 2025

ogrisel Apr 30, 2025

ogrisel Apr 30, 2025

-    >>> # Using with make_scorer
-    >>> from sklearn.metrics import make_scorer
-    >>> scorer = make_scorer(d2_pinball_score, alpha=0.95)
+    Using with :func:`~sklearn.metrics.make_scorer`:
+    >>> from sklearn.metrics import make_scorer, d2_pinball_score
+    >>> pinball_95_scorer = make_scorer(d2_pinball_score, alpha=0.95)
+    >>> from sklearn.model_selection import GridSearchCV
+    >>> from sklearn.svm import LinearSVC
+    >>> grid = GridSearchCV(
+    ...     LinearSVC(),
+    ...     param_grid={"C": [1, 10]},
+    ...     scoring=pinball_95_scorer,
+    ...     cv=5,
+    ... )

DOC Clarify usage of d2_pinball_score with model selection tools #31239

Are you sure you want to change the base?

DOC Clarify usage of d2_pinball_score with model selection tools #31239

Conversation

MaddyRizvi commented Apr 22, 2025 • edited by ogrisel Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

github-actions bot commented Apr 22, 2025 • edited Loading

✔️ Linting Passed

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ogrisel left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MaddyRizvi commented Apr 22, 2025 •

edited by ogrisel

Loading

github-actions bot commented Apr 22, 2025 •

edited

Loading