[FEAT] Implement quantile SVR #23153

atrettin · 2022-04-18T09:23:50Z

What is implemented?

This PR implements quantile regression using support-vector machines.

Mathematically, this applies the "kernel trick" to an L2 regularized linear regression that minimizes the "pinball-loss". For linear kernels and without regularization, this would give the same result as the already existing QuantileRegressor (see #9978). The dual problem is derived in Hwang et al. (2005).

Implementation-wise, the algorithm is only a slight modification of epsilon-SVR. In fact, when the quantile is set to 0.5, the regression is exactly equivalent to an epsilon-SVR where epsilon is set to zero! Thanks to this very close similarity, only very few changes are needed w.r.t. epsilon-SVR to make it work. The efficiency is the same as that of epsilon-SVR (for better or worse).

Why is this useful?

Scikit-learn already contains a quantile regressor, but it is restricted to solving linear problems. Although these restrictions can partially be alleviated by using transformers into polynomial features or B-Splines, it would be far more useful, especially when dealing with more than one dimension, if one could apply the kernel trick. In addition, L2 regularization is probably more desirable than L1 regularization for most regression problems.

The QuantileSVR regressor can be used to estimate prediction intervals for non-linear functions as shown in the example (see example code)

References

Hwang, C., Shim, J. (2005). A Simple Quantile Regression via Support Vector Machine. In: Wang, L., Chen, K., Ong, Y.S. (eds) Advances in Natural Computation. ICNC 2005. Lecture Notes in Computer Science, vol 3610. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11539087_66

atrettin · 2022-04-19T09:40:53Z

LGTM analysis fails due to version bump in PR #22674. Should I bump it in lgtm.yml?

* implement sparse version of quantile SVR

atrettin · 2022-06-27T07:04:48Z

Re-based onto latest main and fixed unit tests. It looks like the LGTM issue has been fixed in the meantime, yay! But the doc-min-dependencies workflow just timed out for no discernible reason, depriving me of the green check mark. :( Anyways, should anyone be interested in this regressor, I would appreciate feedback!

ogrisel

Thanks for the PR, this is very interesting. Given the fact that this new estimator can be implemented with very little new code, I think it's worth considering inclusion in scikit-learn.

To make the PR easier to review, could you please change your PR to avoid changing the lines of svm.cpp that are not related to the topic of the PR (e.g. trailing spaces).

For the example, it would be interesting to compare several methods:

nonlinear feature engineer with SplineTransformer (+Nystroem?) followed by QuantileRegressor
quantile SVR with a non-linear kernel
tree based quantile methods (e.g. gradient boosted trees)

and have a conluding paragraph that gives some pros and cons of each method.

Also, to avoid example profileration, maybe it would be worth expanding the existing example:

https://scikit-learn.org/dev/auto_examples/ensemble/plot_gradient_boosting_quantile.html#sphx-glr-auto-examples-ensemble-plot-gradient-boosting-quantile-py

by adding new sections at the end.

About the CI timeout, it's unfortunate and we need to debug this but it might be transient, so feel free to push a new empty commit to your PR to re-trigger the CI when this happens (and/or merge main to make sure that if/when we fix the problem in main, your PR can benefit from the fix).

/cc @lorentzenchr

sklearn/svm/_classes.py

sklearn/svm/src/libsvm/svm.cpp

ogrisel · 2022-07-05T10:20:09Z

sklearn/svm/tests/test_svm.py

+    assert_allclose(np.linalg.norm(qvr.coef_), np.linalg.norm(svr.coef_), 1, 0.0001)
+    assert_almost_equal(score1, score2, 2)
+
+


Could you please add tests that checks that it converges to the expected results (e.g. by measuring the pinball loss on the training set and checking that it's small) for different values of the quantile parameter, for instance on synthetic data on a dataset with a fixed repeated values and a known distribution of Y|X.

Okay, I will get to this soon! Thank you so much for reviewing!

Alright, I added several additional tests (that I admit to have shamelessly stolen from QuantileRegressor. They test that the quantiles are well calibrated with any kernel, that illegal inputs raise appropriate errors, and, finally, that the results of (linear) QSVR give approximately the same result as a Nelder-Mead minimization of the pinball loss.

ogrisel · 2022-07-05T14:51:48Z

doc/whats_new/v1.1.rst

@@ -1131,6 +1131,10 @@ Changelog
 :mod:`sklearn.svm`
 ..................

+- |Feature| :class:`svm.QuantileSVR` implements quantile regression with support vector
+  machines as derived in Hwang, C., Shim, J. (2005), DOI: 10.1007/11539087_66
+  :pr:`23153` by :user:`Alexander Trettin <atrettin>`.


This will need to be moved to v1.2.rst (sorry for the slow feedback...).

atrettin · 2022-07-05T14:55:02Z

Thank you for having a look at this, @ogrisel ! I will look at the example and perhaps merge it with the example you linked, and add some more unit tests (where I might just shamelessly copy a few things from the QuantileRegressor, since it checks some the same things).

ogrisel · 2022-07-05T14:55:45Z

sklearn/svm/_base.py

@@ -722,6 +726,7 @@ def __init__(
            C=C,
            nu=nu,
            epsilon=0.0,
+            quantile=0.0,


Would it be possible to pass quantile=None in models for which the quantile parameter (and actually not pass it in the call to super().__init__ when left to the default value) in the Python level API of the classes?

Actually scratch that. I re-read how the base class BaseLibSVM is organized and it does not use kwarg in its __init__.

It's a bit unfortunate that BaseLibSVM responsible to set the public attributes. It would be necessary to refactor it to avoid setting the parameters that are unused in the concrete classes (e.g. SVC.quantile, SVC.nu and so on). However, I have the feeling that this refactoring should be done in another PR.

Now that you point it out, I'm not sure why I put this into BaseSVC, since that should be the base class for classifiers, and QSVR is not a classifier. I'll check if I can get rid of it in this place. Other than that, the module is just built in this somewhat odd way in which the base classes anticipate the arguments of all possible sub-classes. epsilon=0.0 is also passed every time, even though not every SVM uses it.

This is probably because the libsvm C++ wrapper functions (libsvm.fit / libsvm_sparse.libsvm_sparse_train) expect a fixed list of mandatory arguments and only the base class is calling the wrapper internal API.

We could change it to pass getattr(self, "quantile", 0.0) instead of self.quantile and only set the self.quantile attribute in the __init__ of the concrete QuantileSVR subclass. This way this attribute would not be set on unrelated classes such as SVC.

We could fix other sub-class specific attributes (e.g. self.nu) in a similar way but I would rather like to keep this PR focused on quantile regression.

Alright, I did what you suggested and used getattr with a default value when calling the libsvm function, and removed the quantile attribute from the base classes.

* test that the quantiles are properly calibrated * test that illegal inputs result in appropriate errors * test that (linear) QSVR is equivalent to pinball loss minimization

atrettin · 2022-07-12T13:31:49Z

I think this leaves only the better example on the to-do list, but I won't be able to get around to that today. This is plenty of material to review already. :)

github-actions bot added module:svm cython labels Apr 18, 2022

atrettin marked this pull request as draft April 18, 2022 10:39

atrettin changed the title ~~Implement quantile SVR~~ [WIP] [FEAT] Implement quantile SVR Apr 18, 2022

atrettin changed the title ~~[WIP] [FEAT] Implement quantile SVR~~ [FEAT] Implement quantile SVR Apr 19, 2022

atrettin marked this pull request as ready for review April 19, 2022 11:56

atrettin force-pushed the quantile_svr branch from e49edb5 to 08f7246 Compare April 21, 2022 14:10

atrettin added 9 commits June 26, 2022 10:41

implement quantile SVR

966cf08

fix unit tests

d684db1

* implement sparse version of quantile SVR

add changelog entry

9996fa2

fix changelog entry

54dc30b

update documentation

4752723

add period to summary

99c7904

add quantile SVR example

470f710

code formatting

d2fe0b5

add class_weight_ deprecation for QSVR

388e017

atrettin force-pushed the quantile_svr branch from 08f7246 to 388e017 Compare June 26, 2022 09:28

QSVR does not use common parameter validation yet

2ebe047

atrettin mentioned this pull request Jun 27, 2022

[MRG] Add quantile regression #9978

Merged

ogrisel reviewed Jul 5, 2022

View reviewed changes

sklearn/svm/_classes.py Outdated Show resolved Hide resolved

sklearn/svm/src/libsvm/svm.cpp Outdated Show resolved Hide resolved

ogrisel reviewed Jul 5, 2022

View reviewed changes

atrettin added 4 commits July 5, 2022 15:10

revert whitespace changes in svm.cpp

fb2de0a

fix indentatino in new functions

7ac2980

remove deprecated quantity from new class

148957a

remove unused import

a31f8bc

ogrisel reviewed Jul 5, 2022

View reviewed changes

atrettin added 7 commits July 12, 2022 13:10

add QSVR quantile calibration test

0e924bb

add more unit tests to QSVR

b7b7970

* test that the quantiles are properly calibrated * test that illegal inputs result in appropriate errors * test that (linear) QSVR is equivalent to pinball loss minimization

move QSVR changelog from 1.1 to 1.2

a466210

remove quantile attribute from SVR base classes

94ecb42

run black on test_svm.py

f6643a1

revert conflicting formatting change

576b382

run black on _classes.py

e107a8b

lorentzenchr added the Stalled label Mar 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEAT] Implement quantile SVR #23153

[FEAT] Implement quantile SVR #23153

atrettin commented Apr 18, 2022 •

edited

Loading

atrettin commented Apr 19, 2022

atrettin commented Jun 27, 2022

ogrisel left a comment •

edited

Loading

ogrisel Jul 5, 2022 •

edited

Loading

atrettin Jul 5, 2022

atrettin Jul 12, 2022

ogrisel Jul 5, 2022

atrettin Jul 12, 2022

atrettin commented Jul 5, 2022

ogrisel Jul 5, 2022 •

edited

Loading

atrettin Jul 5, 2022

ogrisel Jul 6, 2022

atrettin Jul 12, 2022

atrettin commented Jul 12, 2022

		assert_allclose(np.linalg.norm(qvr.coef_), np.linalg.norm(svr.coef_), 1, 0.0001)
		assert_almost_equal(score1, score2, 2)

[FEAT] Implement quantile SVR #23153

Are you sure you want to change the base?

[FEAT] Implement quantile SVR #23153

Conversation

atrettin commented Apr 18, 2022 • edited Loading

What is implemented?

Why is this useful?

References

atrettin commented Apr 19, 2022

atrettin commented Jun 27, 2022

ogrisel left a comment • edited Loading

Choose a reason for hiding this comment

ogrisel Jul 5, 2022 • edited Loading

Choose a reason for hiding this comment

atrettin Jul 5, 2022

Choose a reason for hiding this comment

atrettin Jul 12, 2022

Choose a reason for hiding this comment

ogrisel Jul 5, 2022

Choose a reason for hiding this comment

atrettin Jul 12, 2022

Choose a reason for hiding this comment

atrettin commented Jul 5, 2022

ogrisel Jul 5, 2022 • edited Loading

Choose a reason for hiding this comment

atrettin Jul 5, 2022

Choose a reason for hiding this comment

ogrisel Jul 6, 2022

Choose a reason for hiding this comment

atrettin Jul 12, 2022

Choose a reason for hiding this comment

atrettin commented Jul 12, 2022

atrettin commented Apr 18, 2022 •

edited

Loading

ogrisel left a comment •

edited

Loading

ogrisel Jul 5, 2022 •

edited

Loading

ogrisel Jul 5, 2022 •

edited

Loading