[MRG+1] BUG: reset internal state of scaler before fitting #5416

giorgiop · 2015-10-16T09:09:55Z

Fixes #5408.

giorgiop · 2015-10-16T09:43:44Z

The example, which is not run by CI, works for me now.

lesteve · 2015-10-16T11:33:21Z

The example, which is not run by CI, works for me now.

I just double-checked and this has fixed examples/svm/plot_rbf_parameters.py indeed.

lesteve · 2015-10-16T11:41:50Z

sklearn/preprocessing/tests/test_data.py

+            print("Cannot fit %s a second time with different shape "
+                  "of input. Error message: %s"
+                  % (scaler.__class__.__name__, str(err)))
+            assert False


Maybe you could leave it as a smoke test.

for scaler in scalers: scaler.fit_transform(X) # with a different shape, this may break the scaler unless the internal # state is reset scaler.fit_transform(X_2d)

I am not sure what your try/except gives you. In particular, stdout is hidden by default in nosetests and even if it was not hidden it would very likely be not very obvious to spot because of the exception stack created by assert False.

OK make sense. I will just leave the ValueError to break the test.

lesteve · 2015-10-16T14:35:52Z

sklearn/preprocessing/data.py

+            del self.n_samples_seen_
+            del self.data_min_
+            del self.data_max_
+            del self.data_range_


Is it worth trying to write slightly more generic code, something like:

attributes = [a for a in dir(self) if a.endswith('_')] for attr in attributes: delattr(self, attr)

could go into BaseEstimator even. but not now ;)

amueller · 2015-10-16T15:13:17Z

This is not the style we ususally use, but I guess it is ok. This is arguably a cleaner way than overwriting in partial_fit.

giorgiop · 2015-10-16T15:16:29Z

Any better idea? We may end up doing something similar into other estimators sooner or later.

ogrisel · 2015-10-16T15:22:03Z

I am fine with the explicit _reset as well. We should investigate why test_common did not catch this and fix it.

amueller · 2015-10-16T15:22:48Z

I think it's good. Well, the common tests would be updated here: #3907 I think that includes the right test, but I'd have to double check.

ogrisel · 2015-10-16T15:24:09Z

Indeed. +1 for merge on my side then.

raghavrv · 2015-10-16T15:30:44Z

We may end up doing something similar into other estimators sooner or later.

From my experiments at #3907 I think most estimators do the reset on fit ! There still might be one or two but not more I think... :)

amueller · 2015-10-16T15:31:22Z

yeah but they don't do it in a consistent and nice way.

raghavrv · 2015-10-16T15:32:17Z

Ah okay :)

amueller · 2015-10-16T15:47:24Z

Merging and retouching the docs.

[MRG+1] BUG: reset internal state of scaler before fitting

giorgiop force-pushed the fix-scaler-refit branch from fda4e58 to 6eb1ddc Compare October 16, 2015 09:38

giorgiop force-pushed the fix-scaler-refit branch 2 times, most recently from 5f7a43e to 5ab76c9 Compare October 16, 2015 10:34

giorgiop mentioned this pull request Oct 16, 2015

[WIP] Adding tests for estimators implementing partial_fit and a few other related fixes / enhancements #3907

Closed

6 tasks

lesteve reviewed Oct 16, 2015
View reviewed changes

giorgiop force-pushed the fix-scaler-refit branch from 5ab76c9 to 49a5dbd Compare October 16, 2015 13:52

lesteve reviewed Oct 16, 2015
View reviewed changes

amueller added the Blocker label Oct 16, 2015

amueller added this to the 0.17 milestone Oct 16, 2015

giorgiop changed the title ~~BUG: reset internal state of scaler before fitting~~ [MRG] BUG: reset internal state of scaler before fitting Oct 16, 2015

ogrisel changed the title ~~[MRG] BUG: reset internal state of scaler before fitting~~ [MRG+1] BUG: reset internal state of scaler before fitting Oct 16, 2015

BUG: reset internal state of scaler before fitting

c7b1a6e

giorgiop force-pushed the fix-scaler-refit branch from 49a5dbd to c7b1a6e Compare October 16, 2015 15:29

amueller added a commit that referenced this pull request Oct 16, 2015

Merge pull request #5416 from giorgiop/fix-scaler-refit

4e64915

[MRG+1] BUG: reset internal state of scaler before fitting

amueller merged commit 4e64915 into scikit-learn:master Oct 16, 2015

giorgiop deleted the fix-scaler-refit branch November 3, 2015 12:29

giorgiop restored the fix-scaler-refit branch February 21, 2016 22:31

Uh oh!

[MRG+1] BUG: reset internal state of scaler before fitting #5416

[MRG+1] BUG: reset internal state of scaler before fitting #5416

Uh oh!

Conversation

giorgiop commented Oct 16, 2015

Uh oh!

giorgiop commented Oct 16, 2015

Uh oh!

lesteve commented Oct 16, 2015

Uh oh!

lesteve Oct 16, 2015

Choose a reason for hiding this comment

Uh oh!

giorgiop Oct 16, 2015

Choose a reason for hiding this comment

Uh oh!

lesteve Oct 16, 2015

Choose a reason for hiding this comment

Uh oh!

amueller Oct 16, 2015

Choose a reason for hiding this comment

Uh oh!

amueller commented Oct 16, 2015

Uh oh!

giorgiop commented Oct 16, 2015

Uh oh!

ogrisel commented Oct 16, 2015

Uh oh!

amueller commented Oct 16, 2015

Uh oh!

ogrisel commented Oct 16, 2015

Uh oh!

raghavrv commented Oct 16, 2015

Uh oh!

amueller commented Oct 16, 2015

Uh oh!

raghavrv commented Oct 16, 2015

Uh oh!

amueller commented Oct 16, 2015

Uh oh!

Uh oh!