ENH Add mean_pinball_loss metric for quantile regression #19415

sdpython · 2021-02-09T16:10:09Z

Reference Issues/PRs

Example: Fixes #18911.

What does this implement/fix?

Add function pinball_error as a new regression scoring function used to estimate quantile regression with quantile != 0.5.

sklearn/metrics/__init__.py

ogrisel

Thanks @sdpython! Here is a first pass of review comments:

doc/modules/classes.rst

doc/modules/model_evaluation.rst

sklearn/metrics/_regression.py

sklearn/metrics/tests/test_regression.py

doc/modules/model_evaluation.rst

ogrisel

Thanks, I find the updated example very interesting.

sklearn/ensemble/tests/test_gradient_boosting_loss_functions.py

examples/ensemble/plot_gradient_boosting_quantile.py

doc/modules/model_evaluation.rst

ogrisel · 2021-02-15T17:13:57Z

@sdpython @lorentzenchr I pushed the renaming in my last commits. I think I also addressed most of the pending review comments. Let me know what you think.

lorentzenchr

@ogrisel Accompanying you improving examples is a pleasure and a lot of fun. Now I learned how to highlight cells in displayed pandas tables:smiley:

sklearn/metrics/tests/test_regression.py

examples/ensemble/plot_gradient_boosting_quantile.py

ogrisel · 2021-02-15T21:08:06Z

Now I learned how to highlight cells in displayed pandas tables😃

I googled after your comment and found this solution on stackoverflow :)

lorentzenchr

LGTM

sklearn/metrics/tests/test_regression.py

Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

lorentzenchr · 2021-02-18T18:39:33Z

The CI failure of test_imports_strategies in macOS pylatest_conda_forge_mkl seems unrelated. Therefore, I dare to merge...

ogrisel · 2021-02-19T11:15:42Z

Thanks for the PR @sdpython! and @lorentzenchr for the reviews :)

ghost · 2021-05-26T13:58:24Z

Should this be median pinball loss for alpha=0.5/tau=0.5 (as per https://www.tensorflow.org/addons/api_docs/python/tfa/losses/pinball_loss)?

lorentzenchr · 2021-05-26T14:12:16Z

This PR implemented the mean of the pinball loss as a metric, which equals 1/2 * absolute loss for alpha=0.5 (which elicits the median). I think the tensorflow fomula has a typo (i.e. is incorrect).

airmilesabdullah · 2021-12-22T22:10:56Z

hi there, I noticed that when my pandas dataframe has more than 10,000 rows, I get an out-of-memory error. Why does this occur and is there a way around it?

add pinball_error

468c156

sdpython changed the title ~~Add pinball_error metrics, fixes issue #18911~~ [WIP] Add pinball_error metrics, fixes issue #18911 Feb 9, 2021

glemaitre reviewed Feb 9, 2021

View reviewed changes

sklearn/metrics/__init__.py Outdated Show resolved Hide resolved

sdpython added 2 commits February 9, 2021 17:20

fix \r issue

05dca44

add \r

367b265

github-actions bot added module:ensemble module:metrics labels Feb 9, 2021

sdpython added 4 commits February 9, 2021 18:11

add \r

d710a6f

remove \r

72d2d51

fix lint issue

5bc2bc5

lint

621abdd

sdpython changed the title ~~[WIP] Add pinball_error metrics, fixes issue #18911~~ Add pinball_error metrics, fixes issue #18911 Feb 9, 2021

lint

67f3c02

ogrisel reviewed Feb 9, 2021

View reviewed changes

sdpython added 6 commits February 9, 2021 19:27

rename pinball_error into pinball_loss

789442c

check exception is raised

076452d

Fix failing unit test

359538e

refactor example on gradient boosting

1c2d6d2

lint

ef64dd0

lint

9e56a19

ogrisel reviewed Feb 10, 2021

View reviewed changes

doc/modules/model_evaluation.rst Outdated Show resolved Hide resolved

sdpython added 2 commits February 10, 2021 14:38

add dependency on tqdm for examples, fix documentation

f07ad17

add new unit test

c41f2e3

ogrisel reviewed Feb 10, 2021

View reviewed changes

examples/ensemble/plot_gradient_boosting_quantile.py Outdated Show resolved Hide resolved

sdpython added 4 commits February 10, 2021 18:37

improve example, add another test on pinball_error with sample_weights

1bdc94e

lint

0aa34df

fix failing test due to very small discrepencies

31d721c

documentation

f8988c7

ogrisel changed the title ~~Add pinball_error metrics, fixes issue #18911~~ Add pinball_loss metric for quantile regression, fixes issue #18911 Feb 11, 2021

ogrisel added 2 commits February 15, 2021 16:35

Minimize pinball loss with Nelder-Mead

1b2d0ed

Small fixes and improvements in model_evaluation.rst

f6482e6

ogrisel reviewed Feb 15, 2021

View reviewed changes

doc/modules/model_evaluation.rst Outdated Show resolved Hide resolved

ogrisel added 3 commits February 15, 2021 17:41

Rename variable in test

bc1882a

Rename pinball_loss to mean_pinball_loss

53e0230

Fix linter

6694b1f

ogrisel added 3 commits February 15, 2021 18:39

Fix missing indent in math formula

d139dc2

Fix phrasing

728d632

Add integration test

eb22059

ogrisel changed the title ~~Add pinball_loss metric for quantile regression, fixes issue #18911~~ Add mean_pinball_loss metric for quantile regression, fixes issue #18911 Feb 15, 2021

lorentzenchr reviewed Feb 15, 2021

View reviewed changes

ogrisel added 4 commits February 15, 2021 22:17

Change optimization test to make it run faster

8edbed1

Missing cell marker

1b36537

Missing comas

85ab9d3

DOC small improvements

70d9323

ogrisel requested a review from lorentzenchr February 16, 2021 17:48

ogrisel added the Waiting for Reviewer label Feb 16, 2021

lorentzenchr approved these changes Feb 16, 2021

View reviewed changes

sklearn/metrics/tests/test_regression.py Show resolved Hide resolved

sklearn/metrics/tests/test_regression.py Outdated Show resolved Hide resolved

Update sklearn/metrics/tests/test_regression.py

7820618

Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

lorentzenchr changed the title ~~Add mean_pinball_loss metric for quantile regression, fixes issue #18911~~ ENH Add mean_pinball_loss metric for quantile regression Feb 18, 2021

lorentzenchr merged commit 6a6217f into scikit-learn:main Feb 18, 2021

lorentzenchr mentioned this pull request Feb 18, 2021

META Quantile Regression #18997

Open

4 tasks

glemaitre mentioned this pull request Apr 22, 2021

Release 0.24.2 #19954

Merged

12 tasks

Uh oh!

ENH Add mean_pinball_loss metric for quantile regression #19415

ENH Add mean_pinball_loss metric for quantile regression #19415

Uh oh!

Conversation

sdpython commented Feb 9, 2021

Reference Issues/PRs

What does this implement/fix?

Uh oh!

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ogrisel commented Feb 15, 2021

Uh oh!

lorentzenchr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ogrisel commented Feb 15, 2021

Uh oh!

lorentzenchr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lorentzenchr commented Feb 18, 2021

Uh oh!

ogrisel commented Feb 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ghost commented May 26, 2021

Uh oh!

lorentzenchr commented May 26, 2021

Uh oh!

airmilesabdullah commented Dec 22, 2021

Uh oh!

Uh oh!

ogrisel commented Feb 19, 2021 •

edited

Loading