np.float128 is a confusing name #10288

eric-wieser · 2017-12-28T08:22:50Z

It implies the IEEE 754R 128-bit float, but in practice is typically whatever long double is on the platform, which #10281 shows can sometimes be other types.

The text was updated successfully, but these errors were encountered:

seberg · 2017-12-28T15:46:19Z

Wouldn't be against putting a DeprecationWarning then making it a VisibleDeprecationWarning in a bit without any real plan to remove it, just to educate users that longdouble (and possibly checking its size) is really much more to the point (mostly for education, but at some point it might open up other things, or somewhat open up the name for a drop in replacement in some sense).

Even then, if we do it, might want to help out/check upstream a bit before doing it (or well, if they complain/notice in their testsuits).

njsmith · 2017-12-28T19:53:45Z

There's also np.float96: https://stackoverflow.com/questions/9062562/what-is-the-internal-precision-of-numpy-float128/17023995#17023995

…

On Dec 28, 2017 12:22 AM, "Eric Wieser" ***@***.***> wrote: It implies the IEE 754R 128-bit float, but in practice is typically whatever long double is on the platform, which #10281 <#10281> shows can sometimes be other types. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#10288>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAlOaN6MuwJXpKA3BOCNw97JOe46uSC1ks5tE0_bgaJpZM4ROHL4> .

ahaldane · 2017-12-28T20:09:47Z

There's even np.float80, which we carefully account for in many places. However, I'm not yet aware of which platform this can actually exist on. Intel has only ever used float96 (i386) or float128 (x86_64) for storage, as I undersand.

charris · 2017-12-28T20:15:44Z

Intel has only ever used float96 (i386) or float128 (x86_64) for storage, as I undersand.

It comes down to 4 or 8 byte alignment for the 10 byte number. The first is on 32 bit systems, the second on 64 bit systems. I think that is pretty standard.

EDIT: (3 x 4) * 8 = 96 bits > 80, (2 x 8) x 8 = 128 bits > 80.

mhvk · 2017-12-30T21:10:36Z

I've been fooled by that. It would seem better if the number of bits reflected the actual number used in the computation rather than the size (which one can get via dtype.alignment). Even better would be to actually have a float128, even if slow.

aragilar · 2018-01-09T05:01:17Z

A suggestion would be to explicitly reserve the float(32,64,128,...) names for the corresponding IEEE 754 floating point types (there appears to be work to add this to C, which gcc and glibc support). It would make sense to add a visible generic warning (not deprecation) using float128/96 when it corresponds to a non-IEEE 754 type.

I've got a branch https://github.com/aragilar/numpy/tree/add_FloatN_new which adds the binary128 type (a.k.a. float128 or quad), using the new C types or libquadmath (it includes binary32/64 so I could check if problems were due to conflicts between binary128 and longdouble vs introducing new types, I plan on dropping them before submitting the PR). Support for non-gcc/glibc systems (or for those who want to avoid libquadmath) will need to be added (it's on my todo list, but unlikely to happen before May), and there are some bugs with complex support which I haven't fixed (I don't need complex support for the project I'm working on, which is why I haven't fixed the bugs). I plan to submit the PR and send out a email on the mailing list about the change in May (as I'm submitting my PhD in April), but if someone wants to finish cleaning up the code they can go ahead. I'll submit some of the helper code that I wrote as separate PRs, but I suspect because of how much code adding quad touches, it'll need to be one big patch.

charris · 2018-01-09T05:34:27Z

@aragilar Good to hear. The coming of true float128 is going go happen, or rehappen -- VAX Fortran and SPARC had it -- and it would be good to get ahead of the curve. What is missing from the high precision types is BLAS and LAPACK support, but I suspect that will also come along at some point. Hopefully the system libraries will support the usual sin, cos, and so forth.

I would also like to be able to use true float128 for time, it would solve a lot of problems ...

charris · 2018-01-09T05:58:22Z

There is an old discussion (but not the oldest) here. One of the suggestions there is to rename the extended float types float80_96 and float80_128. That omits IBM double double, however. I think we may just want to introduce a new quad type to avoid the hassle of deprecating current float128 uses.

njsmith · 2018-01-09T06:14:58Z

The guarantee for float128 has always been "well, it's something, it definitely takes 128 bits of memory, but who can say what the precision is?". So mayyyyybe we could just swap it to IEEE754 quad?

matthew-brett · 2018-01-09T10:09:08Z

I've hated the name for ages, as y'all may know, but there are surely people using float128 out there, and not suffering a massive performance penalty, because in fact they're getting 80-bit float, and maybe even 64 bit float computation. If we switch them so they are getting full IEEE 128 bit without due warning, that may cause some shocks.

My vote is still to deprecate float128 (if that's possible), and give IEEE 128-bit another name, maybe float128ieee or float128ie3. I worry that quad sounds compiler-dependent - I guess it isn't? I'd be perfectly happy to drop the float96 / float128 names in favor of longdouble.

seberg · 2018-01-09T13:43:27Z

Give it a future warning for at least a release, better a year, and then do the "just switch it because there was no real guarantee" argument maybe. Just switching based on "there was no guarantee" seems unnecessary.
IIRC FutureWarnings were always visible, so then that also serves as a UserWarning for people expecting IEEE754 quad.
If we implement the dtype earlier (then the finished futurewarning), we will have the quad name as an alias in any case and can switch over later.

EDIT: I guess float96 should be deprecated, though it can be given more time and is probably much less used anyway.

matthew-brett · 2018-01-09T14:08:49Z

About the guarantee - float128 / float96 has always been longdouble - so no guarantee what that is across platform / compile, but an implicit never-broken assumption within platform / compiler. And the even across compilers, in practice it's fixed for a given platform (there was a difference between MSVC and certain mingw builds for a while, but those mingw builds are really uncommon and have been for a while).

Following the rule that we are allowed to break old code with suitable warnings, but not change the behavior of old code silently (unless it was a bug) - I think we should not re-use float128 for something different. OK, it's a precision change, but it may also be a huge performance change, which will be surprising.

pv · 2018-01-09T14:12:24Z

Not necessarily just a precision change --- not unlikely there's code out there interfacing with C and assuming float128 et al. are longdoubles.

mhvk · 2018-01-09T14:49:06Z

The numpy names are relatively easy, but what do we do with the corresponding type strings? Ideally, we can use dtype('f16') to always give quad precision.

ahaldane · 2018-01-09T17:37:17Z

The more official names of the IEEE types are "binary32", "binary64", "binary128". I don't see the word "quad" used in the IEEE 754 doc: http://ieeexplore.ieee.org/document/4610935/.

By the way, in my draft code to fix the IBM float128 printing (which I haven't put up yet) I'm currently using suffixes like "IEEE_binary64" for these types. For other types I am using "Intel_extended96", "Intel_extended128", "Motorola_extended96", "IBM_double_double" and so on. I may need to be more precise for the IBM types, since IBM also supports something that might be called "IBM_hexadecimal64", and "IBM_hexadecimal_double_double".

charris · 2018-01-09T17:48:51Z

what do we do with the corresponding type strings?

I think that the one letter typestrings will need to be replaced at some point.

eric-wieser · 2018-01-09T18:08:58Z

Could perhaps add an np.floats namespace to contain all the various float types, rather than cluttering the main namespace.

eric-wieser · 2018-01-09T18:10:09Z

what do we do with the corresponding type strings?

f8[ieee] or similar? There's already precedent for dtypes, and having the size in the typestring is useful for inspecting struct offsets.

njsmith · 2018-01-09T23:59:25Z

Type strings may have also ended up in .npy files. What happens right now if you make an .npy file with np.float128 values in it?

mhvk · 2018-01-10T01:18:01Z

np.save('a.npy', np.arange(2., dtype=np.float128))
!head -1 a.npy
...{'descr': '<f16', 'fortran_order': False, 'shape': (2,), }

mhvk · 2019-05-30T19:16:18Z

Cross-linking to #7647, which asked explicitly for float128 support.

@aragilar - above you wrote you had an implementation. I'd love to see that used!

The comments above seem to have been mostly about names. My 2¢ is to just replace float128 by the version that fully uses that precision - the main argument against is that some code may become slow, which I think is a small price to pay for ending the confusion we have right now.

aragilar · 2019-05-31T01:17:20Z

I had half of an implementation (in that it supported linux, not windows): I wrapped quadmath/glibc with _Float128 which needs to be rebased. I looked at finding support for other compilers (icc is really poorly documented in this respect), there is http://www.jhauser.us/arithmetic/SoftFloat.html which provides different precisions, and which is BSD licensed. It doesn't provide trig or special functions though, so those will require implementation (there exists code to do this floating around on the internet, e.g. http://collaboration.cmc.ec.gc.ca/science/rpn/biblio/ddj/Website/articles/CUJ/1996/9602/prince/prince.htm, but the bigger challenge will finding ones that are licensed in such a way that numpy can use them).

My PhD is still ongoing, so I won't have time in the foreseeable future to get this merged, but if people want to make it easier to get quad precision (or other floating point types, such as double-double or quad-double) working in numpy, moving/rewriting/rearranging the type code so that there's a clear divide between native types (float, double, long double) and size/format based types (float32, float64), and when each is used (they're currently mixed everywhere), which made conversion and type checking more difficult than it needed to be.

seberg · 2019-05-31T20:48:51Z

Frankly, if this is tricky to get out on most platforms or fairly feature complete, it might be a better option to start developing such a dtype outside of proper numpy. This should even be possible fairly reasonable with the currently available framework, and hopefully get more powerful in the foreseeable future.

mhvk · 2019-05-31T21:07:40Z

@seberg - I fear you may well be right, but sadly that probably will mean we won't have it for another decade...

This is partially why I was suggesting replacing float128 outright - i.e., stick with the inconsistent state we're in but for one compiler and processor/OS at the time move towards treating them as proper quad precision. This is much easier than working outside numpy as one can use the infrastructure already in place. It also doesn't break anybody's code as it just increases precision, though quite possibly it does make code slower.

And obviously if done as carelessly as I would do it, this would end up getting rid of the long doubles stored in 16 bytes, which one may or may not consider a loss (I guess possibly it could be a compile-time option).

charris · 2019-05-31T21:08:47Z

I think I started complaining about this around 2008 :) With modern hardware and compiler support coming on line it seems the time is ripe to figure out a solution. My own preference is a quad precision type for the IEEE standard, something like quad128, that is only available when there is support. Theoretically we only support IEEE floats, but I think practicality also requires double double, so maybe add ibm128 as well. Then float128 can simply be considered indeterminate, but probably will probably settle down to IEEE extended precision over the long term. I think SPARC quad precision is compatible with the IEEE standard, but am not sure about that.

charris · 2019-05-31T21:11:54Z

One problem is will with our type numbering and single letter form ("q"), so if we retain that the type will need more information attached. This case might be worth exploring as part of the new type system design.

EDIT: The problem is not when using float128 on a single machine, it is using it across platforms and in pickled data.

seberg · 2019-05-31T21:16:52Z

I think the main question right now should be what we do with the current float128 name (or if we do anything with it). We could also just put a DeprecationWarning around everything that spells float128 pointing to the alternative spellings and pointing out that quad128 may be what the user wants.

Developing things outside numpy does not seem too bad to me. It might be slower, but there is no real issue with it (yes the kind char is annoying, also because it only makes much sense for our own types right now in any case). And even if it has quirks, all the ufunc/casting code, the biggest chunk of work, could be merged into numpy at any time.

Of course in either case the question is about priorities and time...

matthew-brett · 2019-05-31T21:40:50Z

My vote would be:

float128 with a deprecation warning
float80_96 and float80_128 with for 80bit Intel floats, as permanent names
quad128 for 128 bit IEEE type
float128 removed from namespace at some distant time.

mhvk · 2019-05-31T21:55:39Z

I like the naming suggestions, especially as one goes from np.single to np.double to np.quad (where we could introduce the latter only when almost all systems support it).

Still leaves the string versions to be decided 'f4', 'f8', but what should 'f16' point to? Eventually or immediately to quad?

charris · 2019-05-31T22:10:22Z

This is sounding like the start of an NEP where some of the details can be ironed out.

seberg · 2019-05-31T22:11:28Z

Also leaves how tricky it will be to deprecate the actual name ;). I think we could possibly pull of to simply not have any f16, at least until true quad becomes standard. These shorthands are nice, but they are not strictly necessary?

Yeah, Chuck is right, we should make this an NEP if we want to continue down the line...

aarchiba · 2019-09-15T18:45:27Z

I have looked into this in the past but not had the resources to figure out numpy's type system. But the quaternions and the initial 16-bit float implementations demonstrate that you can have new dypes in importable modules. So a quick start would be to implement IEEE binary128 in a separate package, which would then be available for use immediately.

Such a package could also implement double-double precision based on one of the adequately-licensed libraries that are out there: these are usually faster than software binary128. Some of the libraries also offer a quad-double (that is, the implicit sum of four doubles) for even higher precision at modest additional cost. I suspect double-doubles (or quad-singles maybe even) would be faster on GPU hardware too.

The stumbling block is really to understand how to implement a new dtype in numpy so that it can work without too many surprises. (For example, losing precision on using np.cos, or on printing.)

I can also add, having tested it on a Raspberry Pi (armhf), that when C long doubles are the same as doubles, np.double==np.longdouble and np.float96 and np.float128 simply don't exist. (I can't guarantee that this is also true on exotic platforms like MSVC, as I have no access to them.)

charris · 2019-09-15T18:52:56Z

AArch64 (ARM64) has quad precision, so I expect it will become more common in the not too distant future.

aarchiba · 2019-09-15T19:36:02Z

AArch64 (ARM64) has quad precision, so I expect it will become more common in the not too distant future.

If I understand correctly, on aarch64 (which apparently the Raspberry Pi has the hardware to run so there are going to be users out there) long double is 128 bits, so (again if I understand correctly) np.longdouble will actually be quad precision without any action on numpy's part. I'm not clear on whether there is hardware support.

charris · 2019-09-15T20:11:49Z

AArch64 has 16 quad precision registers, but I don't know the details of the implementation. It also has support for half precision floats (float16). Looks like Power9 also has support for quad precision.

This was referenced Dec 28, 2017

Dragon4 invocation in np.float128.__repr__ assumes IEEE 754R 128-bit floats #10289

Open

Numpy float128 overflows on ppc64le but works on x86_64 #10281

Closed

eteq mentioned this issue Jun 20, 2019

Have coordinate transformations preserve (float?) dtypes astropy/astropy#8870

Open

ExpHP mentioned this issue Jul 13, 2019

Support f16, f128 ExpHP/npyz#3

Open

2 tasks

seberg mentioned this issue Sep 25, 2019

Options for implementing a quadruple-precision dtype #14574

Open

matthiasdiener mentioned this issue Jun 23, 2021

float128 -> longdouble inducer/pyvisfile#22

Merged

matthew-brett mentioned this issue Nov 13, 2021

BUG: typedef long double npy_longdouble; creates problems on platforms with double == longdouble #20348

Closed

asmeurer mentioned this issue May 15, 2022

test: fix use of Float/Rational in tests sympy/sympy#23499

Merged

jeremysanders mentioned this issue May 16, 2022

Tests do not work on i386 due to use of float128 scikit-hep/iminuit#753

Closed

melissawm mentioned this issue Jul 13, 2022

Data type precision problems? #5272

Open

nathan-hess mentioned this issue Nov 30, 2022

Add new class UnitConverterSI that is populated by default with common units nathan-hess/python-utilities#35

Merged

skirpichev mentioned this issue Aug 14, 2024

Misc fixes mpmath/mpmath#839

Merged

jorenham mentioned this issue Nov 4, 2024

signal: Add type stubs to _spectral_py.pyi. scipy/scipy-stubs#157

Merged

Uh oh!

np.float128 is a confusing name #10288

np.float128 is a confusing name #10288

Comments

eric-wieser commented Dec 28, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

seberg commented Dec 28, 2017

Uh oh!

njsmith commented Dec 28, 2017 via email

Uh oh!

ahaldane commented Dec 28, 2017

Uh oh!

charris commented Dec 28, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mhvk commented Dec 30, 2017

Uh oh!

aragilar commented Jan 9, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

charris commented Jan 9, 2018

Uh oh!

charris commented Jan 9, 2018

Uh oh!

njsmith commented Jan 9, 2018

Uh oh!

matthew-brett commented Jan 9, 2018

Uh oh!

seberg commented Jan 9, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

matthew-brett commented Jan 9, 2018

Uh oh!

pv commented Jan 9, 2018 via email

Uh oh!

mhvk commented Jan 9, 2018

Uh oh!

ahaldane commented Jan 9, 2018

Uh oh!

charris commented Jan 9, 2018

Uh oh!

eric-wieser commented Jan 9, 2018

Uh oh!

eric-wieser commented Jan 9, 2018

Uh oh!

njsmith commented Jan 9, 2018

Uh oh!

mhvk commented Jan 10, 2018

Uh oh!

mhvk commented May 30, 2019

Uh oh!

aragilar commented May 31, 2019

Uh oh!

seberg commented May 31, 2019

Uh oh!

mhvk commented May 31, 2019

Uh oh!

charris commented May 31, 2019

Uh oh!

charris commented May 31, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seberg commented May 31, 2019

Uh oh!

matthew-brett commented May 31, 2019

Uh oh!

mhvk commented May 31, 2019

Uh oh!

charris commented May 31, 2019

Uh oh!

seberg commented May 31, 2019

Uh oh!

aarchiba commented Sep 15, 2019

Uh oh!

charris commented Sep 15, 2019

Uh oh!

aarchiba commented Sep 15, 2019

Uh oh!

charris commented Sep 15, 2019

Uh oh!

eric-wieser commented Dec 28, 2017 •

edited

Loading

charris commented Dec 28, 2017 •

edited

Loading

aragilar commented Jan 9, 2018 •

edited

Loading

seberg commented Jan 9, 2018 •

edited

Loading

charris commented May 31, 2019 •

edited

Loading