[MRG] Adds FutureWarning changing default solver to 'lbfgs' and multi_class to 'multinomial' in LogisticRegression #10001

thechargedneutron · 2017-10-25T11:18:17Z

Reference Issues/PRs

Fixes #9997

What does this implement/fix? Explain your changes.

Adds FutureWarning that default of solver and multi_class will be changed to 'lbfgs' and 'multinomial' respectively.

Any other comments?

…into logistic_solver

thechargedneutron · 2017-10-31T19:00:40Z

@jnothman Is this PR in the right direction? I am not sure if this is what you meant by using 'auto'.

jnothman · 2017-10-31T21:20:49Z

sklearn/linear_model/logistic.py

@@ -1198,6 +1198,14 @@ def fit(self, X, y, sample_weight=None):
        self : object
            Returns self.
        """
+        if self.solver == 'auto':
+            self.solver = 'liblinear'
+            warnings.warn("Default solver will be changed to 'lbfgs' in 0.22",


Only for L2 penalty

Maybe instead of default say 'auto'

But you also now give the user no easy way to silence the warning. We will need a temporary setting like 'default' to manage the warning.

jnothman · 2017-10-31T21:22:44Z

sklearn/linear_model/logistic.py

@@ -1198,6 +1198,14 @@ def fit(self, X, y, sample_weight=None):
        self : object
            Returns self.
        """
+        if self.solver == 'auto':
+            self.solver = 'liblinear'
+            warnings.warn("Default solver will be changed to 'lbfgs' in 0.22",


Maybe instead of default say 'auto'

But you also now give the user no easy way to silence the warning. We will need a temporary setting like 'default' to manage the warning.

jnothman · 2017-10-31T21:23:00Z

sklearn/linear_model/logistic.py

+                          FutureWarning)
+        if self.multi_class == 'auto':
+            self.multi_class = 'ovr'
+            warnings.warn("Default multi_class will be changed to "


Only when solver is not liblinear

But when solver is indeed liblinear, multi_class remains as auto and thus raises an error. Am I missing out something here?

…into logistic_solver

thechargedneutron · 2017-12-05T21:09:36Z

@jnothman Really Sorry, I forget that I had an open PR here, all because of my semester examinations. I somehow feel that the search option in github's open PR does not give full list :P

jnothman · 2017-12-05T22:52:29Z

https://github.com/scikit-learn/scikit-learn/pulls/thechargedneutron should help.

You're getting an error in CI:

/home/travis/build/scikit-learn/scikit-learn/sklearn/linear_model/tests/test_logistic.py:245: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
/home/travis/build/scikit-learn/scikit-learn/sklearn/linear_model/logistic.py:1231: in fit
    self.dual)
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
solver = 'liblinear', multi_class = 'auto', penalty = 'l2', dual = False
    def _check_solver_option(solver, multi_class, penalty, dual):
        if solver not in ['liblinear', 'newton-cg', 'lbfgs', 'sag', 'saga']:
            raise ValueError("Logistic Regression supports only liblinear, "
                             "newton-cg, lbfgs, sag and saga solvers, got %s"
                             % solver)
    
        if multi_class not in ['multinomial', 'ovr']:
            raise ValueError("multi_class should be either multinomial or "
>                            "ovr, got %s" % multi_class)
E           ValueError: multi_class should be either multinomial or ovr, got auto

…into logistic_solver

thechargedneutron · 2017-12-06T00:48:52Z

@jnothman Do you think we need to convert auto into the original parameter in the fit? If that's the case, _check_solver_option fails otherwise. So, do I need to copy paste the same thing there also or there's some better way to do it?

jnothman

Again if you wrote tests or at least documented correct behaviour first, with a premise that 'auto' must always work, you would have an easier time getting the logic right, and a green tick to say you had done so

jnothman · 2017-12-06T04:58:59Z

sklearn/linear_model/logistic.py

@@ -1198,6 +1202,14 @@ def fit(self, X, y, sample_weight=None):
        self : object
            Returns self.
        """
+        if self.solver == 'auto' and self.penalty == 'l2':
+            self.solver = 'liblinear'


We don't allow fit to change constructor parameters. You can store the validated parameter as _solver if it should be private, or solver_ if it should be public (assumed useful to users).

…into logistic_solver

thechargedneutron · 2017-12-22T21:24:36Z

@jnothman I guess this is a better implementation and indeed passes tests (except probably a doctest failure).

thechargedneutron · 2018-01-08T20:44:47Z

Pulled the latest version to resolve conflicts.

TomDLT · 2018-01-25T10:40:07Z

test_logistic_regression_warnings and test_logistic_regression_cv_warnings could also be merged into one test to avoid duplication, using a loop or with pytest:

@pytest.mark.parametrize('model', [LogisticRegression, LogisticRegressionCV])
def test_logistic_regression_warnings(model):
    ...

Same for test_logistic_regression_auto and test_logistic_regression_cv_auto

…into logistic_solver

thechargedneutron · 2018-01-28T10:03:03Z

@TomDLT Parametrize added.

Can you refactor it, for instance putting this part in _check_solver_option?

That would require me to have a return parameter in _check_solver_option which returns the solver according to the penalty. I think the current implementation is better than returning all those from the check function. I would change if you think returning is a better option.

TomDLT · 2018-01-29T12:28:51Z

I think the current implementation is better than returning all those from the check function.

Why?

But if you prefer you can also merge them into a new function. I just find it redundant and prone to future mistakes.

jnothman

You really need to improve the docs here. I would consider documenting what 'auto' means to be something you'd do before labelling the PR as MRG.

jnothman · 2018-01-30T21:58:52Z

sklearn/linear_model/logistic.py

@@ -500,6 +500,7 @@ def logistic_regression_path(X, y, pos_class=None, Cs=10, fit_intercept=True,
        number for verbosity.

    solver : {'lbfgs', 'newton-cg', 'liblinear', 'sag', 'saga'}
+        default: 'default'. Will be changed to 'auto' solver in 0.22.


default 'default' doesn't mean anything. If you're going to say such a thing, you need to explain what it means. Better to say it is lbfgs by default, even if the string 'default' is used as a placeholder. 'auto' needs to be added to the list of options and described.

jnothman · 2018-01-30T21:59:19Z

sklearn/linear_model/logistic.py

@@ -587,6 +589,19 @@ def logistic_regression_path(X, y, pos_class=None, Cs=10, fit_intercept=True,
    .. versionchanged:: 0.19
        The "copy" parameter was removed.
    """
+    if solver == 'default':
+        solver = 'lbfgs'
+        warnings.warn("Default solver will be changed from 'lbfgs' "


We should probably only warn if auto would choose a solver other than lbfgs.

jnothman · 2018-01-30T22:00:57Z

sklearn/linear_model/logistic.py

@@ -1606,7 +1660,24 @@ def fit(self, X, y, sample_weight=None):
        -------
        self : object
        """
-        _check_solver_option(self.solver, self.multi_class, self.penalty,
+        if self.solver == 'default':
+            _solver = 'lbfgs'


I find it strange to have these preceding undescores. This is just a plain old local variable.

jnothman · 2018-03-05T08:13:47Z

what's the status here. I'd really like this to happen

jnothman · 2018-05-17T11:13:20Z

I'm marking this as stalled / help wanted. But maybe we need a core dev to take it on, as it really is bad that we have ovr and intercept regularisation as default.

amueller · 2018-06-02T04:08:43Z

Do we really want L-BFGS as default or SAG or depend on the data to decide? I think if we change the default we should change to a solver that's automatic, as we did for PCA, and that is picked according to the penalty.

agramfort · 2018-06-02T08:16:00Z

yes agreed. This needs a careful benchmark though.

jnothman · 2018-06-03T22:38:51Z

So can someone write down the decision tree of settings we're working towards? SAG and L-BFGS should at least reach the same coefficients given reasonable amounts of data?

I think this should be added as a priority for 0.20.

jnothman · 2018-06-07T00:45:53Z

Another question related to this: should LogisticRegressionCV(solver='liblinear') be optimising intercept_scaling as well as C?

jnothman · 2018-06-15T04:47:21Z

@TomDLT, given your previous work in this space, would you mind working on defining appropriate defaults? If fit_intercept=False and (the data is binary or multiclass='ovr'), should we still be defaulting to liblinear??

I also wonder if we can say that for something named 'auto' we give no strong assurances of API stability across versions: we make a weak assurance that with sufficient (i.e. infinite) max_iter, 'auto' should result in convergence to the same solution (but not necessarily the same n_iter_) across versions.

TomDLT · 2018-06-15T11:44:54Z

I'll do it

amueller · 2018-07-10T20:55:30Z

Hm SAG is pretty unstable if you don't scale the data. We could always pre-scale the data but that seems a bit un-scikit-learn.

amueller · 2018-07-10T20:59:42Z

@TomDLT are you still on this? Otherwise @NicolasHug will take it up. Also: are you at scipy?

amueller · 2018-07-10T22:20:21Z

So do we just want to change the default of solver as a first step or do we want to do an automagic "auto"? It's really not clear to me how auto would really work, since good values for tol and max_iter and dual depend on the solver, right? If someone sets "dual=True" does that mean they'll always get liblinear? Or do we ignore that unless they set the solver to liblinear?

On the one hand it would be annoying to make a backward-incompatible change twice, on the other hand, getting "auto" right seems very hairy and probably not doable for 0.20, even without benchmarks.

If we want to run benchmarks, we would need to benchmark at least these cases, right?
Tall data (small n_features, large n_samples)
wide data (small n_samples, large n_features)
sparse wide data
multiclass = ovr or multinomial or binary
fit_intercept = True/False

That would be 18 possible regimes already. Well maybe the decision tree for the optimum solver doesn't involve all combinations.

SAG and L-BFGS should at least reach the same coefficients given reasonable amounts of data?

If the data is scaled badly that's probably not gonna happen, I think?

jnothman · 2018-07-11T08:28:13Z

We can advertise that 'auto' behaviour is still being finalised, or will have no long-term stability assured. The important thing here is to advertise that untegularised intercept and ovr are defaults that users should not be accepting where there is a choice.

thechargedneutron added 3 commits October 25, 2017 16:43

added FutureWarning

9b72243

default changed to auto

5bb61e8

Merge branch 'master' of https://github.com/scikit-learn/scikit-learn …

9f99456

…into logistic_solver

thechargedneutron changed the title ~~Adds FutureWarning changing default solver to 'lbfgs' and multi_class to 'multinomial' in LogisticRegression~~ [WIP] Adds FutureWarning changing default solver to 'lbfgs' and multi_class to 'multinomial' in LogisticRegression Oct 31, 2017

jnothman reviewed Oct 31, 2017

View reviewed changes

allComputableThings mentioned this pull request Nov 17, 2017

Document what cost function LogisticRegression minimizes for each solver #10164

Open

thechargedneutron added 2 commits December 6, 2017 02:36

changes added

b877620

Merge branch 'master' of https://github.com/scikit-learn/scikit-learn …

8832c89

…into logistic_solver

thechargedneutron added 4 commits December 6, 2017 04:34

auto added in allowed multi_class

803b6f7

Merge branch 'master' of https://github.com/scikit-learn/scikit-learn …

bc7db9e

…into logistic_solver

changes added

1c7b502

auto as solver added

16fa162

thechargedneutron added 3 commits December 6, 2017 06:21

changes added

d672f1e

indentation corrected

e215d9f

self removed

edb071e

jnothman reviewed Dec 6, 2017

View reviewed changes

thechargedneutron added 2 commits December 23, 2017 01:29

Merge branch 'master' of https://github.com/scikit-learn/scikit-learn …

ba50c26

…into logistic_solver

reformed

06d08d3

thechargedneutron added 4 commits December 23, 2017 03:10

doctest corrected

7079567

Empty commit

c92c6dd

doctest updated

23777e9

merge conflict resolved

f26ed59

thechargedneutron changed the title ~~[WIP] Adds FutureWarning changing default solver to 'lbfgs' and multi_class to 'multinomial' in LogisticRegression~~ [MRG] Adds FutureWarning changing default solver to 'lbfgs' and multi_class to 'multinomial' in LogisticRegression Jan 8, 2018

thechargedneutron added 2 commits January 26, 2018 00:51

redundant test removed with parametrize

ec8e6b4

Merge branch 'master' of https://github.com/scikit-learn/scikit-learn …

8d2bc20

…into logistic_solver

jnothman requested changes Jan 30, 2018

View reviewed changes

jnothman mentioned this pull request Feb 14, 2018

Suggestion: Add support for unpenalized logistic regression #6738

Closed

jnothman added Waiting for Reviewer Stalled help wanted and removed Waiting for Reviewer labels May 17, 2018

jnothman added this to the 0.20 milestone Jun 3, 2018

TomDLT mentioned this pull request Jul 11, 2018

[MRG] Change default solver in LogisticRegression #11476

Closed

amueller modified the milestones: 0.20, 0.21 Jul 16, 2018

This was referenced Aug 23, 2018

Change default solver in LogisticRegression and replace multi_class with multinomial #11903

Closed

[MRG+1] ENH add multi_class='auto' for LogisticRegression, default from 0.22; default solver will be 'lbfgs' #11905

Merged

rth closed this in #11905 Aug 26, 2018

Uh oh!

[MRG] Adds FutureWarning changing default solver to 'lbfgs' and multi_class to 'multinomial' in LogisticRegression #10001

[MRG] Adds FutureWarning changing default solver to 'lbfgs' and multi_class to 'multinomial' in LogisticRegression #10001

Uh oh!

Conversation

thechargedneutron commented Oct 25, 2017

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

thechargedneutron commented Oct 31, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

thechargedneutron commented Dec 5, 2017

Uh oh!

jnothman commented Dec 5, 2017

Uh oh!

thechargedneutron commented Dec 6, 2017

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

jnothman Dec 6, 2017 • edited by TomDLT Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

thechargedneutron commented Dec 22, 2017

Uh oh!

thechargedneutron commented Jan 8, 2018

Uh oh!

TomDLT commented Jan 25, 2018

Uh oh!

thechargedneutron commented Jan 28, 2018

Uh oh!

TomDLT commented Jan 29, 2018

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman commented Mar 5, 2018 via email

Uh oh!

jnothman commented May 17, 2018

Uh oh!

amueller commented Jun 2, 2018

Uh oh!

agramfort commented Jun 2, 2018 via email

Uh oh!

jnothman commented Jun 3, 2018

Uh oh!

jnothman commented Jun 7, 2018

Uh oh!

jnothman commented Jun 15, 2018

Uh oh!

TomDLT commented Jun 15, 2018

Uh oh!

amueller commented Jul 10, 2018

Uh oh!

amueller commented Jul 10, 2018

Uh oh!

amueller commented Jul 10, 2018

Uh oh!

jnothman commented Jul 11, 2018 via email

Uh oh!

Uh oh!

jnothman Dec 6, 2017 •

edited by TomDLT

Loading