Numpy mean fails/gives huge precision issues with large arrays and axis selection #11331

shachar-i · 2018-06-14T09:33:50Z

On Numpy 1.14.2 I get the following:

A = np.random.rand(1024,256,256,3)*255 # similar to a 1024 256x256 images tensor
print(np.mean(A,axis=(0,1,2))) # 64 bit works fine
print(np.mean(A.astype(np.float32),axis=(0,1,2))) # 32 bit works fails
print(np.mean(A.astype(np.float32))) # 32 bit works fine if without axis selection

results in:
[127.50656009 127.49165182 127.51390158]
[64. 64. 64.]
127.50413

Even considering float32 precision, this type of failure seems odd, especially given that the entire array's mean can be calculated succesfully

The text was updated successfully, but these errors were encountered:

seberg · 2018-06-14T13:11:05Z

Well, numpy is not overly fancy about how to calculate means, there is a thing that you get better then naive precision in some cases (which kicks in for the full array here), see also for example gh-8116

shachar-i · 2018-06-17T08:50:25Z

Ok. So I get why a naive summation with float32 would that, but then it was quite confusing to see it succeed without axis selection.
As I understand from your comment this is due to pairwise summation in some of the cases (?)
Is this strange behavior what one should expect?
Why not implement pairwise summation for all situations?
And (maybe this naive) - is there a way to warn about such drastic round errors without a huge performance hit?
(these type of bugs are usually hard to pin down)

seberg · 2018-06-17T10:27:34Z

The reason is memory layout and thus speed. Doing the (mostly) pairwise summation in numpy with the typical machinery only works reasonably along a single axis (where it comes with no performance loss at all). But only when summing the fast axis it is feasable to do this, because otherwise others would be complaining about massive performance drops.

I agree that there should be more documentation on this, heck I was even hesitant when we first put this in.... It would also be nice to have more stable summations in general....

omasoud · 2018-12-08T21:36:49Z

I'm glad to see #9393 that will add something in the documentation. But this can be a serious issue that can go unnoticed. A warning would be even more welcome. A fix is of course the idea situation.

My recommended workaround for people running into this issue is to add ,dtype=np.float64 to the np.mean() and np.std() calls. Doing that in the above code yields the following result:

[127.48983901 127.50801956 127.49455946]
[127.48983901 127.50801956 127.49455946]
127.4976

Another minimalist example that illustrates this issue:

a=np.random.rand(30*1000*1000,2).astype(np.float32)*.01+3.9 # 30 million pairs
print('Expected:')
print(' mean: ', np.mean(a,axis=0,dtype=np.float64), '  (analytical = 3.905)')
print('  std: ', np.std(a,axis=0,dtype=np.float64), '  (analytical = .01/sqrt(12) = 0.00288...)')
print('Instead, you get:')
print(' mean: ', np.mean(a,axis=0))
print('  std: ', np.std(a,axis=0))

Outputs:

Expected:
 mean:  [3.9050007  3.90499964]   (analytical = 3.905)
  std:  [0.00288722 0.00288692]   (analytical = .01/sqrt(12) = 0.00288...)
Instead, you get:
 mean:  [2.236962 2.236962]
  std:  [1.4956477 1.4956477]

I expect this situation to be encountered a lot and without noticing in neural network data normalization code where statistics over large training data gets computed. People tend to go for single (or half) rather than double precision because of performance and memory savings, and the fact that successful neural network training rarely requires double precision.

charris · 2018-12-08T21:54:54Z

I suppose the warning could be made length dependent.

Note that this behavour is of course inherited into `np.add.reduce` and many other reductions such as `mean` or users of this reduction, such as `cov`. This is ignored here. Closes numpygh-11331, numpygh-9393, numpygh-13734

seberg mentioned this issue Aug 16, 2018

Reducing functions (such as np.mean) return different result when axis keyword argument is specified #11748

Closed

bauks mentioned this issue Oct 8, 2018

StandardScaler obtains incorrect means for large np.float32 dtype datasets scikit-learn/scikit-learn#12333

Closed

takluyver mentioned this issue Jun 7, 2019

sum result different when slicing before sum #13734

Closed

seberg mentioned this issue Jun 7, 2019

DOC: Mention and try to explain pairwise summation in sum #13737

Merged

mattip closed this as completed in #13737 Jun 11, 2019

seberg mentioned this issue Dec 11, 2019

arr.mean(axis=0)[2] != arr[:,2].mean() #15093

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Numpy mean fails/gives huge precision issues with large arrays and axis selection #11331

Numpy mean fails/gives huge precision issues with large arrays and axis selection #11331

shachar-i commented Jun 14, 2018 •

edited

Loading

seberg commented Jun 14, 2018

shachar-i commented Jun 17, 2018

seberg commented Jun 17, 2018

omasoud commented Dec 8, 2018

charris commented Dec 8, 2018

Numpy mean fails/gives huge precision issues with large arrays and axis selection #11331

Numpy mean fails/gives huge precision issues with large arrays and axis selection #11331

Comments

shachar-i commented Jun 14, 2018 • edited Loading

seberg commented Jun 14, 2018

shachar-i commented Jun 17, 2018

seberg commented Jun 17, 2018

omasoud commented Dec 8, 2018

charris commented Dec 8, 2018

shachar-i commented Jun 14, 2018 •

edited

Loading