[MRG+1] fix for erroneous max_iter and tol warnings for SGDClassifier when using partial_fit #10053

siftikha · 2017-11-01T03:07:52Z

Added an optional argument for_partial_fit=False to _validate_params which bypasses warnings about max_iter and tol

added for_partial_fit=True to _validate_params calls in partial_fit methods for both sgdclassifer and sgdregressor

I appreciate that this could have been done through the use of set_max_iter=False but it seemed clearer to me to have a dedicated flag.

first contribution to this project, so I apologize if I've done something horrifically wrong

jnothman · 2017-11-01T05:29:17Z

You have flake8 errors

siftikha · 2017-11-01T14:54:27Z

flake8 issues have been fixed

jnothman · 2017-11-01T06:51:42Z

sklearn/linear_model/stochastic_gradient.py

@@ -538,7 +539,7 @@ def partial_fit(self, X, y, classes=None, sample_weight=None):
        -------
        self : returns an instance of self.
        """
-        self._validate_params()
+        self._validate_params(for_partial_fit=True)


I think using set_max_iter=False should suffice

Ok, I will make that change.

This hasn't been addressed and it seems cleaner not to add another parameter if it's possible.

It was attempted, but was not trivial. We could have a different value for the parameter, but I figure it's hardly worth the bother for a deprecation.

jnothman · 2017-11-01T06:53:14Z

sklearn/linear_model/tests/test_sgd.py

@@ -1211,6 +1211,9 @@ def init(max_iter=None, tol=None, n_iter=None):
    assert_no_warnings(init, None, 1e-3, None)
    assert_no_warnings(init, 100, 1e-3, None)

+    # Test that for_partial_fit will not throw warnings for max_iter or tol
+    assert_no_warnings(init, None, None, None, True)
+


Why not test if partial_fit itself raises a warning. Testing the helper is a bit strange, even if it seems that's already what we're doing...

siftikha · 2017-11-02T02:33:07Z

@jnothman the set_max_iter approach appears to have caused other issues. It appears to me that making it work with partial_fit would require non-trivial modifications to other aspects of the library. So I've gone back to the for_partial_fit approach.

jnothman · 2017-11-02T03:05:57Z

What other issues?

siftikha · 2017-11-02T17:54:40Z

https://travis-ci.org/scikit-learn/scikit-learn/jobs/296053432 This is the result of just using the set_max_iter=False flag with everything else unchanged. The use case for that flag seems to be different than what we are aiming for here.

The issue seems to be that set_max_iter=False also keeps self._tol from being set which causes issues when partial_fit is run. You could conceivably make sure that self._tol is set even with set_max_iter=False but this seems like it would likely break the other cases that rely on set_max_iter=False

jnothman · 2017-11-02T21:35:59Z

okay. well then I'd rather set_max_iter='quiet' than a separate parameter, but this is okay.

siftikha · 2017-11-06T20:25:56Z

If there is a sufficiently compelling reason to stick to using one parameter for both these cases, I'm happy to do that but I think such a parameter would need to be renamed from set_max_iter to some more general parameter describing the validation required.

jnothman · 2017-11-06T23:26:42Z

No, there's no compelling reason... these are ephemeral in that the code will be removed in a year's time or so. I just would like it to be as readable as possible in the meantime.

siftikha · 2017-11-09T22:35:42Z

Do I need to add [MRG] to the title for this to be merged?

siftikha · 2017-11-13T17:07:26Z

@jnothman anything else you need to get this merged?

jnothman

LGTM. We usually require approval from two core devs

siftikha · 2017-11-15T17:36:31Z

@amueller Figured I'd ping you, since you made the initial issue

amueller · 2017-11-15T21:38:56Z

lgtm.

jnothman · 2017-11-15T22:55:50Z

Thanks for contributing, @siftikha!

amueller · 2017-12-11T23:07:46Z

Hm this undid #10050 :-/

… when using partial_fit (scikit-learn#10053) * partial fit warnings disabled * partial fit warnings disabled for regressor * style improved * tests added * pycodestyle passing * rejiggered format * fixed style issues

salman added 6 commits October 31, 2017 22:05

partial fit warnings disabled

e21cb64

partial fit warnings disabled for regressor

428e9c2

style improved

b009389

tests added

d6b49ed

pycodestyle passing

aff9c73

rejiggered format

9b624ea

fixed style issues

48c00fe

jnothman reviewed Nov 1, 2017

View reviewed changes

siftikha force-pushed the partial_max_iter_tol branch from e255cb7 to 9b624ea Compare November 2, 2017 02:34

Merge branch 'master' into partial_max_iter_tol

e083429

qinhanmin2014 mentioned this pull request Nov 6, 2017

Fix #10051 #10075

Closed

siftikha changed the title ~~fix for erroneous max_iter and tol warnings for SGDClassifier when using partial_fit~~ [MRG] fix for erroneous max_iter and tol warnings for SGDClassifier when using partial_fit Nov 9, 2017

jnothman approved these changes Nov 13, 2017

View reviewed changes

jnothman changed the title ~~[MRG] fix for erroneous max_iter and tol warnings for SGDClassifier when using partial_fit~~ [MRG+1] fix for erroneous max_iter and tol warnings for SGDClassifier when using partial_fit Nov 13, 2017

amueller merged commit f485a9e into scikit-learn:master Nov 15, 2017

amueller mentioned this pull request Dec 11, 2017

[MRG+1] make SGDClassifier deprecation warnings nice again #10286

Merged

Uh oh!

[MRG+1] fix for erroneous max_iter and tol warnings for SGDClassifier when using partial_fit #10053

[MRG+1] fix for erroneous max_iter and tol warnings for SGDClassifier when using partial_fit #10053

Uh oh!

Conversation

siftikha commented Nov 1, 2017

Uh oh!

jnothman commented Nov 1, 2017

Uh oh!

siftikha commented Nov 1, 2017

Uh oh!

jnothman Nov 1, 2017

Choose a reason for hiding this comment

Uh oh!

siftikha Nov 2, 2017

Choose a reason for hiding this comment

Uh oh!

amueller Nov 15, 2017

Choose a reason for hiding this comment

Uh oh!

jnothman Nov 15, 2017

Choose a reason for hiding this comment

Uh oh!

jnothman Nov 15, 2017

Choose a reason for hiding this comment

Uh oh!

jnothman Nov 1, 2017

Choose a reason for hiding this comment

Uh oh!

siftikha commented Nov 2, 2017

Uh oh!

jnothman commented Nov 2, 2017

Uh oh!

siftikha commented Nov 2, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jnothman commented Nov 2, 2017 via email

Uh oh!

siftikha commented Nov 6, 2017

Uh oh!

jnothman commented Nov 6, 2017 via email

Uh oh!

siftikha commented Nov 9, 2017

Uh oh!

siftikha commented Nov 13, 2017

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

siftikha commented Nov 15, 2017

Uh oh!

amueller commented Nov 15, 2017

Uh oh!

jnothman commented Nov 15, 2017

Uh oh!

amueller commented Dec 11, 2017

Uh oh!

Uh oh!

siftikha commented Nov 2, 2017 •

edited

Loading