ENH: hard-code finfo parameters for known types #8504

matthew-brett · 2017-01-20T01:26:06Z

Hard-code the MachAr parameters for float64, float32, 80-bit extended
precision, to save time, and provide skeleton for more difficult types
such as the double double on PPC; see
#2669

matthew-brett · 2017-01-20T02:09:58Z

There's a test error from test_warnings.py in https://travis-ci.org/numpy/numpy/jobs/193586749#L3330 . It's telling me I'm not allowed to use warnings.simplefilter('ignore') - which I used here. It seems like a perfectly sensible use to me - why is test_warnings complaining?

charris · 2017-01-20T04:05:24Z

Hmm, not sure. @seberg Thoughts?

EDIT: I suspect that there is no guarantee that the warning filter will be properly reset, but exactly why that would be the case here I'm not sure. You could probably use with suppress_warnings() instead of with catch_warnings():, but as both are currently used I'm not clear on when the new version is required.

charris · 2017-01-20T04:17:57Z

    Context manager and decorator doing much the same as
    ``warnings.catch_warnings``.

    However, it also provides a filter mechanism to work around
    http://bugs.python.org/issue4180.

    This bug causes Python before 3.4 to not reliably show warnings again
    after they have been ignored once (even within catch_warnings). It
    means that no "ignore" filter can be used easily, since following
    tests might need to see the warning. Additionally it allows easier
    specificity for testing warnings and can be nested.

So the "ignore" is the problem. Note also that the warning filters are not thread safe, so should be avoided outside of tests when possible.

matthew-brett · 2017-01-20T06:14:07Z

Sure - but how should I do what I'm trying to do if I'm not allowed to ignore the warning inside the context manager block?

matthew-brett · 2017-01-20T06:35:44Z

OK - found a suppress_warnings incantation that seems to work.

njsmith · 2017-01-20T06:44:52Z

What's the warning here?

seberg · 2017-01-20T08:29:31Z

Well, to be honest, for some code doing funny warning stuff may make sense and the test should likely be relaxed to only tests. On the other hand, I think np.seterr is probably the correct thing here, since it seems to me that this should be floating point stuff (and I think seterr is even threadsafe? frankly not sure).

seberg · 2017-01-20T08:31:11Z

Or if the warning makes sense/does not matter, moving the context to the tests makes more sense.

pv · 2017-01-20T08:48:52Z

seterr is threadsafe iirc

njsmith · 2017-01-20T09:31:09Z

seterr is threadsafe yeah:

numpy/numpy/core/src/umath/ufunc_object.c

Line 4517 in 20d6ca1

thedict = PyThreadState_GetDict();

mraspaud · 2017-01-20T10:32:09Z

@matthew-brett on ppc64, I get a mantissa (significand) size of 116 bits for long doubles (double double in this case). But you seem to set it to 105, (105, 1024, 16), may I ask why ?

matthew-brett · 2017-01-20T18:00:43Z

OK - I used the errstate context manager instead, that does look better.

matthew-brett · 2017-01-20T18:04:40Z

@mraspaud - I got the number of digits from looking at the eps value found by using np.nextafter(np.longdouble(1), np.longdouble(2)). Actually, the number also makes some sense, if we suppose that the number of digits comes from the sum of the number of significand digits in both doubles = 53 + 53 - 1 for the implicit first.

charris · 2017-01-20T18:32:35Z

I was thinking of something more along the lines of the build time determination, basically viewing a value as a string and then looking that up. See numpy/core/setup_common.py.

matthew-brett · 2017-01-20T18:53:16Z

Chuck -

a) I think we'd need most of this machinery anyway, to replicate the old finfo behavior. Then the only difference between build-time and run-time detection is the small fragment of code here to detect the eps and huge and byte size values, and select accordingly;
b) I can't explain the mechanism, but I believe I have seen instances where the type of longdouble changed at run time - see comment here.

matthew-brett · 2017-01-20T19:36:34Z

Other somewhat trivial advantage of this approach - finfo is a lot faster. With this PR:

In [2]: time np.finfo(np.float64)
CPU times: user 189 µs, sys: 23 µs, total: 212 µs
Wall time: 218 µs

With 1.12 release wheel:

In [2]: time np.finfo(np.float64)
CPU times: user 12.1 ms, sys: 308 µs, total: 12.4 ms
Wall time: 12.3 ms

Of course, this is a one-time only cost, as the results are cached.

matthew-brett · 2017-01-21T20:19:17Z

@mraspaud - also see https://en.wikipedia.org/wiki/Quadruple-precision_floating-point_format#Double-double_arithmetic - which says these have 106 digits in the significand. The numpy figures for significand digits are all N-1, I guess omitting the implicit first. Also see : http://mrob.com/pub/math/f161.html (the paper explains that using the sign bit of the second double makes the actual number 107...).

matthew-brett · 2017-01-21T22:13:06Z

Longdoubles are especially slow in the old implementation:

In [3]: time np.finfo(np.longdouble)
CPU times: user 150 ms, sys: 2.02 ms, total: 152 ms
Wall time: 161 ms
Out[3]: finfo(resolution=1e-18, min=-1.18973149536e+4932, max=1.18973149536e+4932, dtype=float128)

Same command takes 263 microseconds from code in this PR.

mraspaud · 2017-01-22T21:20:20Z

@matthew-brett indeed, I was just mislead by HDF5 reporting 116 bits significand...

matthew-brett · 2017-01-24T18:50:55Z

Chuck - would you consider merging this one, with a hoped-for follow-up to do build-time detection?

charris · 2017-01-24T18:56:09Z

@matthew-brett I'll take a look at this later today. I wasn't actually suggesting doing build-time detection, but rather doing the same sort of type detection at runtime. It is/can be all done in python. But I will need to take a closer look here before deciding if it is worthwhile.

matthew-brett · 2017-01-24T18:58:01Z

OK - thanks. If you're unhappy with the float32, float64, float80 detection, I'm happy to drop those, leaving only the PPC longdouble, which does need a fix.

Check that finfo returns somewhat reasonable parameters for all floating point types.

matthew-brett · 2017-02-02T13:38:36Z

@mraspaud - can you test the results of:

nosetests path/to/numpy/core/tests/test_getlimits.py

with the current branch?

matthew-brett · 2017-02-02T13:48:54Z

I think this one is ready now. The detection doesn't cover IEEE float128, hence the lack of warning when the byte-string detection fails and falls back to the discovery code. If anyone has an AIX machine they have access to, it would be good to check on that - but at least, this code falls back to the previous code when it fails to detect a signature, so it shouldn't be worse at detection.

matthew-brett · 2017-02-02T16:06:50Z

Planning on getting access to a SPARC with actual IEEE 754 128-bit floats to test against.

mraspaud · 2017-02-02T18:43:41Z

(h5py)debian@debian8-ppc64el:~$ nosetests numpy/numpy/core/tests/test_getlimits.py 
.............
----------------------------------------------------------------------
Ran 13 tests in 0.062s

OK

charris · 2017-02-02T20:53:09Z

numpy/core/getlimits.py

+        if key in _KNOWN_TYPES:
+            return _KNOWN_TYPES[key]
+    # Fall back to parameter discovery
+    return _discovered_machar(ftype)


Might be worth raising a warning here so that we might get reports of new types we should cover.

Maybe include the missing key in the message.

Warn if we don't have a signature for the float type.

Still needs test on platform with IEEE float128, to check MachAr-like parameters are correct.

pv · 2017-02-03T10:23:09Z

nextafter is AFAIK present in umath on all platforms and defined for all float types. I think things are written so that Numpy compilation will fail if the long double float format is not recognized (cf src/private/npy_fpmath.h, src/npymath/npy_math_private.h). . Can the numpy C code that understands the float formats be leveraged here, instead of needing to reimplement?

charris · 2017-02-03T15:19:19Z

@pv The C code just writes a structure that is then read and analysed in python, but is otherwise similar to the current code. Nextafter isn't really defined for double_double, so depends on the library choices, although the libraries are likely to follow the IBM recommendation. If we ever expose the build time macros the code here can be modified, but as it stands should serve as good check for hardcoding the types. The only thing that worries me a bit is that the -0.1 value is not exactly representable, so the result may depend on the library implementation. OTOH, it is a test of the libraries.

charris · 2017-02-03T15:30:58Z

I should say that by C code analysed is the object file and the Python code looks for the structure in that file.

matthew-brett · 2017-02-03T17:22:30Z

Chuck - I believe (though happy to be corrected) that all the floating point implementations specify a unique correspondence between bit pattern and input number (here -0.1). I mean, they guarantee the closest representable number, and there is only one bit pattern for any given representable number. I can see how there might be more than one pattern for double double (1.0 + 0.01 == 1.1 - 0.09), but I haven't looked into it. The patterns we're getting also match with those recorded in the perl configure script I pointed to - here - so I guess we can depend on them, but in any case, all that will happen, if they are wrong, is that the code will return to the original behavior.

matthew-brett · 2017-02-03T19:43:14Z

Tests pass on SPARC, with IEEE float128. Ready to merge from my point of view.

charris · 2017-02-03T20:02:30Z

In it goes, thanks @matthew-brett .

charris · 2017-02-03T20:04:30Z

How do the timings compare?

matthew-brett · 2017-02-04T09:29:14Z

Timings, macOS Intel, numpy 1.12:

In [3]: time np.finfo(np.longdouble)
CPU times: user 139 ms, sys: 2.43 ms, total: 141 ms
Wall time: 149 ms

This branch:

In [2]: time np.finfo(np.longdouble)
CPU times: user 165 µs, sys: 23 µs, total: 188 µs
Wall time: 195 µs

ahaldane · 2017-05-12T17:20:34Z

@matthew-brett This commit is causing me a bit of trouble as described in this comment.

The problem revolves around the fact that you are calling numpy functions before numpy modules are fully loaded here, specifically you are calling array2string. This causes cyclic import issues.

Do you think we might delay the evaluations here until after numpy modules are fully loaded?

matthew-brett · 2017-05-12T18:49:57Z

@ahaldane - does this work? #9113

tacaswell · 2018-10-24T12:25:05Z

Just to verify, this went is for 1.13 and did not get backported further?

mattip · 2018-10-24T16:25:12Z

@tacaswell git tag --contains a611932bbcb132 shows 1.13.0 is the earliest tag with this PR.

matthew-brett mentioned this pull request Jan 20, 2017

Check also type size in float promotion method h5py/h5py#825

Merged

matthew-brett force-pushed the hard-code-floats branch from 698f3d6 to bf5d15a Compare January 20, 2017 06:35

matthew-brett force-pushed the hard-code-floats branch from ee63c6f to 4ff8433 Compare January 20, 2017 08:06

matthew-brett force-pushed the hard-code-floats branch from 4ff8433 to a0dee35 Compare January 20, 2017 18:00

matthew-brett force-pushed the hard-code-floats branch from 91964cd to b5bd561 Compare January 20, 2017 21:18

charris added 03 - Maintenance component: numpy._core labels Jan 20, 2017

matthew-brett force-pushed the hard-code-floats branch from b5bd561 to 7cd0846 Compare January 21, 2017 22:25

TST: add tests for reasonable finfo parameters

3b5d45a

Check that finfo returns somewhat reasonable parameters for all floating point types.

charris reviewed Feb 2, 2017

View reviewed changes

matthew-brett added 2 commits February 3, 2017 09:02

RF: add warning for not-recognized float type

a49ae30

Warn if we don't have a signature for the float type.

NF: add IEEE float128 type signature

c5f9f18

Still needs test on platform with IEEE float128, to check MachAr-like parameters are correct.

charris merged commit a611932 into numpy:master Feb 3, 2017

This was referenced Feb 3, 2017

Incorrect finfo from PPC longdouble (Trac #2077) #2669

Closed

Power PC long double repr needs more precision. #7940

Closed

ENH: Improve support for POWER7 and POWER8 machines #8134

Closed

Test failures building up on PPC #8298

Closed

matthew-brett mentioned this pull request Feb 5, 2017

TST: re-enable PPC longdouble spacing tests #8568

Merged

ragulpr mentioned this pull request Sep 14, 2018

[feature request][pytorch] finfo as in numpy and finfo for default dtype pytorch/pytorch#10742

Closed

mraspaud mentioned this pull request Oct 24, 2018

FIX: reading float128 data from hdf5 h5py/h5py#1114

Merged

charris mentioned this pull request Mar 16, 2020

BUG: ppc64le uses double-double for np.float128, routines need adjustment #15763

Open

seberg mentioned this pull request Apr 15, 2021

ENH: Add smallest_normal and smallest_subnormal attributes to finfo #18536

Merged

ENH: hard-code finfo parameters for known types #8504

ENH: hard-code finfo parameters for known types #8504

Conversation

matthew-brett commented Jan 20, 2017

matthew-brett commented Jan 20, 2017

charris commented Jan 20, 2017 • edited Loading

charris commented Jan 20, 2017

matthew-brett commented Jan 20, 2017

matthew-brett commented Jan 20, 2017

njsmith commented Jan 20, 2017

seberg commented Jan 20, 2017

seberg commented Jan 20, 2017

pv commented Jan 20, 2017 via email

njsmith commented Jan 20, 2017

mraspaud commented Jan 20, 2017

matthew-brett commented Jan 20, 2017

matthew-brett commented Jan 20, 2017

charris commented Jan 20, 2017

matthew-brett commented Jan 20, 2017

matthew-brett commented Jan 20, 2017

matthew-brett commented Jan 21, 2017

matthew-brett commented Jan 21, 2017

mraspaud commented Jan 22, 2017

matthew-brett commented Jan 24, 2017

charris commented Jan 24, 2017

matthew-brett commented Jan 24, 2017

matthew-brett commented Feb 2, 2017

matthew-brett commented Feb 2, 2017

matthew-brett commented Feb 2, 2017

mraspaud commented Feb 2, 2017

charris Feb 2, 2017

Choose a reason for hiding this comment

charris Feb 3, 2017

Choose a reason for hiding this comment

pv commented Feb 3, 2017 via email

charris commented Feb 3, 2017

charris commented Feb 3, 2017

matthew-brett commented Feb 3, 2017

matthew-brett commented Feb 3, 2017

charris commented Feb 3, 2017

charris commented Feb 3, 2017

matthew-brett commented Feb 4, 2017

ahaldane commented May 12, 2017

matthew-brett commented May 12, 2017

tacaswell commented Oct 24, 2018

mattip commented Oct 24, 2018

charris commented Jan 20, 2017 •

edited

Loading