[MRG + 2] FIX precision to float64 across the codebase #5375

raghavrv · 2015-10-09T09:18:57Z

amueller · 2015-10-09T17:51:26Z

Did you just replace all float32 with float64? Some parts of sklearn work in float64, right? Or not any more? The trees do and the SVM does IIRC. So in some places we want float32.

raghavrv · 2015-10-11T08:42:54Z

I've changed

all float to float64.
all float32 in some tests to float64 (under the assumption that if it passes in float64 it should pass for float32 too right?)
all float32 in non test code is left as such.

Are these the proper things to do??

ogrisel · 2015-10-12T09:33:32Z

sklearn/utils/tests/test_multiclass.py

    ],
    'continuous-multioutput': [
        np.array([[0, .5], [.5, 0]]),
-        np.array([[0, .5], [.5, 0]], dtype=np.float32),


tests that checks for np.float32 should be left unchanged.

ogrisel · 2015-10-12T09:35:19Z

all float32 in some tests to float64 (under the assumption that if it passes in float64 it should pass for float32 too right?)

No this is a wrong assumption. 32 bit is less precise than 64 bit and some our functions and class methods might use the dtype of the input to select the precision level of their internal datastructures. Please leave the np.float32 tests unchanged.

raghavrv · 2015-10-12T09:39:43Z

Thanks for clarifying that!! I'll revert in a minute :)

ogrisel · 2015-10-12T11:16:47Z

Note that as far as I know np.dtype(np.float) always yields np.float64 on all our supported architectures (even 32 bit Python), but it's more explicit to use np.float64 directly in our code instead of relying of potentially platform specific behaviors.

raghavrv · 2015-10-12T11:23:49Z

@ogrisel Have reverted all float32 --> float64 changes in tests... Could you give this a go now?

amueller · 2015-10-12T20:19:21Z

sklearn/preprocessing/data.py

@@ -1451,7 +1451,7 @@ class OneHotEncoder(BaseEstimator, TransformerMixin):
    >>> enc = OneHotEncoder()
    >>> enc.fit([[0, 0, 3], [1, 1, 0], [0, 2, 1], \
 [1, 0, 2]])  # doctest: +ELLIPSIS
-    OneHotEncoder(categorical_features='all', dtype=<... 'float'>,
+    OneHotEncoder(categorical_features='all', dtype=<... 'numpy.float64'>,


wait so the type is the same but the string representation is different? that is somewhat confusing, isn't it?

As far as my understanding goes... numpy.float is alias of python's default float while numpy.float64 is a scalar type defined by numpy...
(This answer seems to explain it... (especially the comment below it!))

amueller · 2015-10-12T20:20:23Z

lgtm

raghavrv · 2015-10-16T09:18:29Z

@ogrisel could you take a look? :)

glouppe · 2015-10-19T09:00:23Z

Looks good to me too. +1 for merge.

[MRG + 2] FIX precision to float64 across the codebase

raghavrv · 2015-10-19T09:10:33Z

@glouppe Thanks :)

raghavrv mentioned this pull request Oct 9, 2015

[MRG] FIX Use float64 instead of float. #5356

Merged

raghavrv force-pushed the set_precision branch 2 times, most recently from 7a6f80c to 37794ea Compare October 9, 2015 13:22

raghavrv force-pushed the set_precision branch from 37794ea to 461bf92 Compare October 11, 2015 08:49

ogrisel reviewed Oct 12, 2015
View reviewed changes

raghavrv force-pushed the set_precision branch 2 times, most recently from 31d1de0 to 1cc1dee Compare October 12, 2015 11:22

raghavrv force-pushed the set_precision branch from 1cc1dee to bb5c10c Compare October 12, 2015 11:48

amueller reviewed Oct 12, 2015
View reviewed changes

amueller changed the title ~~[MRG] FIX precision to float64 across the codebase~~ [MRG + 1] FIX precision to float64 across the codebase Oct 12, 2015

FIX precision to float64 across the codebase

a870112

raghavrv force-pushed the set_precision branch from bb5c10c to a870112 Compare October 16, 2015 09:20

glouppe changed the title ~~[MRG + 1] FIX precision to float64 across the codebase~~ [MRG + 2] FIX precision to float64 across the codebase Oct 19, 2015

glouppe added a commit that referenced this pull request Oct 19, 2015

Merge pull request #5375 from rvraghav93/set_precision

008adf1

[MRG + 2] FIX precision to float64 across the codebase

glouppe merged commit 008adf1 into scikit-learn:master Oct 19, 2015

raghavrv deleted the set_precision branch October 19, 2015 09:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG + 2] FIX precision to float64 across the codebase #5375

[MRG + 2] FIX precision to float64 across the codebase #5375

Uh oh!

raghavrv commented Oct 9, 2015

Uh oh!

amueller commented Oct 9, 2015

Uh oh!

raghavrv commented Oct 11, 2015

Uh oh!

ogrisel Oct 12, 2015

Uh oh!

ogrisel commented Oct 12, 2015

Uh oh!

raghavrv commented Oct 12, 2015

Uh oh!

ogrisel commented Oct 12, 2015

Uh oh!

raghavrv commented Oct 12, 2015

Uh oh!

amueller Oct 12, 2015

Uh oh!

raghavrv Oct 13, 2015

Uh oh!

amueller commented Oct 12, 2015

Uh oh!

raghavrv commented Oct 16, 2015

Uh oh!

glouppe commented Oct 19, 2015

Uh oh!

raghavrv commented Oct 19, 2015

Uh oh!

Uh oh!

Uh oh!

[MRG + 2] FIX precision to float64 across the codebase #5375

[MRG + 2] FIX precision to float64 across the codebase #5375

Uh oh!

Conversation

raghavrv commented Oct 9, 2015

Uh oh!

amueller commented Oct 9, 2015

Uh oh!

raghavrv commented Oct 11, 2015

Uh oh!

ogrisel Oct 12, 2015

Choose a reason for hiding this comment

Uh oh!

ogrisel commented Oct 12, 2015

Uh oh!

raghavrv commented Oct 12, 2015

Uh oh!

ogrisel commented Oct 12, 2015

Uh oh!

raghavrv commented Oct 12, 2015

Uh oh!

amueller Oct 12, 2015

Choose a reason for hiding this comment

Uh oh!

raghavrv Oct 13, 2015

Choose a reason for hiding this comment

Uh oh!

amueller commented Oct 12, 2015

Uh oh!

raghavrv commented Oct 16, 2015

Uh oh!

glouppe commented Oct 19, 2015

Uh oh!

raghavrv commented Oct 19, 2015

Uh oh!

Uh oh!