[MRG] Add quantile regression #9978

avidale · 2017-10-22T20:50:37Z

This PR fixes issue #3148

This new feature implements quantile regression - an algorithm that directly minimizes mean absolute error of a linear regression model.

~~The work is still in progress, but I do want to receive some feedback.~~

jnothman · 2017-10-22T22:32:19Z

Thanks. You'll need a lot of patience for core devs to review this in detail, I suspect. We have a number of pressing API issues as well as highly requested features that have been in the reviewing queue for some time. But I hope we'll get here soon enough!

ashimb9 · 2017-11-28T03:46:51Z

@avidale Thanks a lot for your contribution! @jnothman has already outlined the broader picture but FWIW a few pointers from a wanderer for when the core devs become available. First, I would suggest that you look into resolving the CI test failures. After that, you might want to consider adding to the PR to a point where you feel comfortable changing the status of this PR to [MRG] from [WIP]. (Of course, this is of no use if you actually need some comments/ideas before you can start working any further). In my limited experience, [WIP]s are usually not prioritized for review (but don't quote me on this ;)) so you might want to consider the change. Finally when you get to that point, you might want to tag some of the core devs that participated in the original discussion since some of them might have missed the initial post.

sklearn/linear_model/quantile.py

JasonSanchez · 2017-12-23T02:41:01Z

This is a great add to scikit-learn. Would personally really like to see it merged.

jnothman

You have test failure to deal with.
Some parameters have not been tested, such as l1_ratio.
Please add the class to doc/modules/classes.rst
We no longer use assert_true, assert_greater. Just use bare assert.

sklearn/linear_model/tests/test_quantile.py

avidale · 2018-03-18T18:56:13Z

Under Travis CI there is another failure: the 'nit' parameter not found in the result of a scipy.optimize call. It runs with scipy 0.13.3, which might be too old. What would you recommend to do?

make a workaround for old versions of scipy?
remove the 'nit' functionality at all?
mark the class as working only with new scipy and change my test to respect this restriction?

Current solution: just changed the solver from BFGS to L-BFGS-B. The latter has supported nit since scipy 0.12.

…it-learn into quantile-regression

ghost · 2021-05-23T13:53:35Z

I have had a go with a real training set as follows:

X = pandas dataframe ([61296 rows x 2846 columns])
y = numpy array (array of float 64) - Size is (61296,)

(There are 2846 variables in the model and I have 61296 measurements of the 2846 variables.)

I run into the following problem in _quantile.py
"MemoryError: Unable to allocate 28.0 GiB for an array with shape (61296, 61296) and data type float64"

This occurs when np.eye(n_mask) tries to create a 61296 x 61296 identity matrix.

Is a more memory efficient implementation possible?

File "", line 1, in
z=est.fit(X, y)

File "\sklearn\linear_model_quantile.py", line 217, in fit
-np.eye(n_mask),

File "\site-packages\numpy\lib\twodim_base.py", line 209, in eye
m = zeros((N, M), dtype=dtype, order=order)

MemoryError: Unable to allocate 28.0 GiB for an array with shape (61296, 61296) and data type float64

agramfort · 2021-05-23T14:10:11Z

@RPyElec do you know a solver that would handle problems of this size? if so what optimization method is used?

glemaitre · 2021-05-23T14:22:31Z

I did only list free references, otherwise the book by Koenker (2005) would be THE reference

The book reference could be nice I think.

ghost · 2021-05-23T14:24:32Z

I'm not an expert on solvers I'm afraid. I have been running the quantile regression problem above with the implementation here (https://pypi.org/project/asgl/). I have an academic license for MOSEK/Gurobi as the LP solver but have also had a go with the free solvers that it comes with - which also work).

(Scikit-learn is better maintained so is preferable).

glemaitre · 2021-05-23T14:29:00Z

do you know a solver that would handle problems of this size

Is there some incremental/online solver?

agramfort · 2021-05-23T15:42:37Z

You manage to make it work with asgl? It seems to use cvxopt/cvxpy which makes me skeptical that it scales? Can you share more details?

lorentzenchr · 2021-05-23T18:38:30Z

@RPyElec Cool that you already gave this PR a try and feedback. Once this PR is merged, there might be room for improvements. In particular, if your problem/feature matrix X is sparse, one could use the linprog methods that support sparse input.

If you urgently need a solution right now, you could try the R package https://cran.r-project.org/package=quantreg. Though I don't know if it will work on your problem.

ghost · 2021-05-24T08:44:39Z

@lorentzenchr - understood! Looking forward to seeing this get pushed too. I have a more powerful machine I can run the current implementation on (and ASGL too) - so no worries there. I was just flagging the large identity matrices being created as a potential issue.

@agramfort - I'm trying to put something together that I can use as an example and will get back to you.

ghost · 2021-05-24T12:50:41Z

You manage to make it work with asgl? It seems to use cvxopt/cvxpy which makes me skeptical that it scales? Can you share more details?

@agramfort

QR examples.txt
I have attached a simplified QR script which runs scikit-learn's QR and asgl with MOSEK (licence needed) and SCS (free) as LP solvers. It is slow so not sure how well it scales (though it does run with 16 GB RAM).

If you run with MOSEK, then in asgl.py, you will need to add the MOSEK section to _cvxpy_solver_options:

def _cvxpy_solver_options(self, solver):

    if solver == 'ECOS':
        solver_dict = dict(solver=solver,
                           max_iters=self.max_iters)

    elif solver == 'OSQP':
        solver_dict = dict(solver=solver,
                           max_iter=self.max_iters)

    elif solver == "MOSEK":
        import mosek
        solver_dict = dict(solver=solver,
                           warm_start=True,
                           #max_iters=self.max_iters,
                           mosek_params={mosek.iparam.intpnt_solve_form:
                                         mosek.solveform.dual,
                         #                mosek.iparam.num_threads: 1
                                         })

    else:
        solver_dict = dict(solver=solver)
    return solver_dict

(It might be necessary to uncomment the num_threads line depending on your setup).

I would also recommend adding a print statement in def lasso at the three locations below (again in asgl.py):

            if self.solver == 'default':
                print("Using %s" % self.solver)                                     # PRINT 1
                problem.solve(warm_start=True) 
            else:
                print("Using %s" % self.solver)                                    # PRINT 2
                solver_dict = self._cvxpy_solver_options(solver=self.solver)
                problem.solve(**solver_dict)
        except (ValueError, cvxpy.error.SolverError):
            logging.warning(
                'Default solver failed. Using alternative options. Check solver and solver_stats for more '
                'details')
            solver = ['ECOS', 'OSQP', 'SCS']
            for elt in solver:
                print("Using %s" % elt)                                    # PRINT 3

avidale · 2021-05-24T13:51:36Z

@RPyElec @glemaitre I think that for quantile regression it is very easy to implement a naive gradient descent based solver that is memory-efficient and is relatively fast on large datasets.

The quantile loss is (resid > 0) * resid * q - (resid < 0) * resid * (1 - q) where resid = y - X @ coef, so its antigradient w.r.t. the coefficients is ((resid > 0) * q - (resid < 0) * (1 - q)) @ X, and we can just add this value to the coefficients until the loss converges. The tricky parts is choosing the right learning rate, but there are a couple of heuristics that I hope will work well on most datasets.

Here is a Colab notebook with my proof-of-concept implementation that trains on a 60,000 x 3,000 dataset in 20 seconds.

If you think that this direction is correct, maybe we include a solver like this into a future version of QuantileRegressor?

GaelVaroquaux · 2021-05-24T15:38:09Z

@RPyElec @glemaitre I think that for quantile regression it is very easy to implement a naive gradient descent based solver

The pinball loss is non-differentiable. It's not at all clear to me that a gradient descent solver will converge (I'd rather say that they are good theoretical arguments, and empirical evidence on non-smooth problems in learning that standard gradient methods will not converge to a good accuracy).

agramfort · 2021-05-24T16:21:29Z

@RPyElec it would make our life easier if you share code snippets eg in https://gist.github.com/ so we have very control if needed and if you could share a branch from your fork of asgl project so we don't have to apply the patch manually.

@avidale as @GaelVaroquaux said above it's a non-smooth problem. What you do is a "sub-gradient" descent will can be quite slow (known theoretical rates). You could add your solver in https://github.com/benchopt/benchmark_quantile_regression to actually compare the solvers objectively. WDYT?

glemaitre · 2021-05-25T10:19:13Z

You could add your solver in https://github.com/benchopt/benchmark_quantile_regression to actually compare the solvers objectively. WDYT?

This seems a good idea.

Now, my question is: do we want to merge the current version with solvers that work reasonably well for in-memory problems, or do we want as well benchmark solver optimum for online learning on the offline problems? In short, do we merge now and improve the estimator or by doing so, do are putting ourselve into trouble?

lorentzenchr · 2021-05-25T10:52:48Z

+1 for merging now (or else I go crazy - to share my feelings, too:smirk:). linprog is a solid approach. Let's investigate room for improvement (which I'm very interested in, don't get me wrong) after having a solid solution in place => merge.

glemaitre · 2021-05-25T12:03:07Z

OK so LGTM. I will open a subsequent issue to address the point raised in the discussion.

agramfort · 2021-05-25T12:10:40Z

🍻 🎉

glemaitre · 2021-05-25T12:11:51Z

Thanks to all contributors

atrettin · 2022-06-27T11:39:17Z

Hi! The QuantileRegressor is already very neat, but it can be made even nicer by applying the kernel trick to support more functional forms (in particular the 'rbf' kernel). So, I did that in #23153 ! I thought I'd mention it here since people who are interested in quantile regression might see it. I'd appreciate feedback!

glemaitre · 2022-06-27T12:02:59Z

Open an issue when you make a feature request. A closed or merged issue/pr will not raise any attention.

On the topic, it might be better to do a pipeline with a Nystroem transformer followed by a Quantile Regressor. It will scale better.

atrettin · 2022-06-27T12:13:29Z

Open an issue when you make a feature request. A closed or merged issue/pr will not raise any attention.

On the topic, it might be better to do a pipeline with a Nystroem transformer followed by a Quantile Regressor. It will scale better.

Well, it already got more than zero engagement 😅 ! I was not aware of this, so should I make an issue for the PR as well? Also, thanks for the tip with the transformer, I'll check it out! On the other hand, maybe L2 regularization would be more desirable? The QuantileRegressor only does L1, the support vector regression does L2.

David Dale added 17 commits October 4, 2017 11:15

Added basic Quantile regression

7988049

Added L1 penalty to loss function

fd04aed

Add tests and plot example for Quantile Regression

6284c83

Enable gtol and maxiter

8ed4899

light refactor

285b5d7

Code for approximate smooth quantile loss

bb5ce51

Fix error in the smooth version

fb07f0e

Solve a sequence of smooth problems instead of one non-smooth.

95492fe

Tuned convergence

bdb48ac

Rename approximation threshold, write docstrings

8ee950d

Enforce zero coefficients

00a72fb

Fix zero enforcement

4a25fad

Add description in user guide

b3ccff6

pep8 line widths

cb87e63

Merge remote-tracking branch 'upstream/master' into quantile-regression

0ea2317

Mention GradientBoostingRegressor in the docs.

b567c82

Mention robustness in doc.

0686e66

pseudotensor mentioned this pull request Nov 14, 2017

Implement quantiles regression h2oai/h2o4gpu#32

Open

JasonSanchez reviewed Dec 23, 2017

View reviewed changes

sklearn/linear_model/quantile.py Outdated Show resolved Hide resolved

jnothman reviewed Feb 1, 2018

View reviewed changes

sklearn/linear_model/tests/test_quantile.py Outdated Show resolved Hide resolved

sklearn/linear_model/tests/test_quantile.py Outdated Show resolved Hide resolved

sklearn/linear_model/tests/test_quantile.py Outdated Show resolved Hide resolved

David Dale added 3 commits March 18, 2018 14:16

Remove magic constants for quantile regression

d2a59d8

Small edits to .rst on quantile regression

4dc4eb2

Improve formatting of quantile regression tests

eca1165

avidale and others added 3 commits March 18, 2018 22:53

Change the solver for quantile regression

2524c2b

small refactor of tests of the quantile regression

04e6165

Merge branch 'quantile-regression' of https://github.com/avidale/scik…

4b715d3

…it-learn into quantile-regression

lorentzenchr removed the Waiting for Reviewer label May 21, 2021

DOC add reference to Koenker's book

6f870a4

glemaitre merged commit c1cc67d into scikit-learn:main May 25, 2021

glemaitre mentioned this pull request May 25, 2021

Alternative solvers for QuantileRegressor #20132

Open

3 tasks

avidale deleted the quantile-regression branch May 25, 2021 12:15

lorentzenchr mentioned this pull request May 25, 2021

Add linear quantile regression #3148

Closed

TomDLT mentioned this pull request Jul 28, 2021

implementation of the quantile regression #20628

Closed

lorentzenchr mentioned this pull request Feb 4, 2022

DOC label addition of QuantileRegressor as major feature #22372

Merged

atrettin mentioned this pull request Jun 27, 2022

[FEAT] Implement quantile SVR #23153

Open

Sandy4321 mentioned this pull request May 25, 2023

sklearn quantreg is bad on categorical data ( after one hot) i wrote them but thye are not intersted .... avidale/anekbot-example#1

Closed

Uh oh!

[MRG] Add quantile regression #9978

[MRG] Add quantile regression #9978

Uh oh!

Conversation

avidale commented Oct 22, 2017 • edited by lorentzenchr Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jnothman commented Oct 22, 2017

Uh oh!

ashimb9 commented Nov 28, 2017

Uh oh!

Uh oh!

JasonSanchez commented Dec 23, 2017

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

avidale commented Mar 18, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ghost commented May 23, 2021

Uh oh!

agramfort commented May 23, 2021

Uh oh!

glemaitre commented May 23, 2021

Uh oh!

ghost commented May 23, 2021

Uh oh!

glemaitre commented May 23, 2021

Uh oh!

agramfort commented May 23, 2021 via email

Uh oh!

lorentzenchr commented May 23, 2021

Uh oh!

ghost commented May 24, 2021

Uh oh!

ghost commented May 24, 2021

Uh oh!

avidale commented May 24, 2021

Uh oh!

GaelVaroquaux commented May 24, 2021 via email

Uh oh!

agramfort commented May 24, 2021

Uh oh!

glemaitre commented May 25, 2021

Uh oh!

lorentzenchr commented May 25, 2021

Uh oh!

glemaitre commented May 25, 2021

Uh oh!

agramfort commented May 25, 2021

Uh oh!

glemaitre commented May 25, 2021

Uh oh!

atrettin commented Jun 27, 2022

Uh oh!

glemaitre commented Jun 27, 2022

Uh oh!

atrettin commented Jun 27, 2022

Uh oh!

Uh oh!

avidale commented Oct 22, 2017 •

edited by lorentzenchr

Loading

avidale commented Mar 18, 2018 •

edited

Loading