PERF: Fast check on equivalent arrays in PyArray_EQUIVALENTLY_ITERABLE_OVERLAP_OK #21464

eendebakpt · 2022-05-06T11:45:50Z

The PyArray_EQUIVALENTLY_ITERABLE_OVERLAP_OK has to go through all the checks for identical arrays (a common case for inplace operations). This PR adds a fast check to see whether the two arrays to be compared are identical.

Benchmark

import numpy as np
import time

x=np.random.rand(10,20)   
niter=20_0000

t0=time.perf_counter()
for ii in range(niter):
    x=np.cos(x, out=x)
dt=time.perf_counter()-t0
print(dt)

Results in:

main: 0.51
PR: 0.39

Addresses one of the items in #21455

eendebakpt · 2022-05-07T17:34:32Z

The stride1 !=0 part of the condition is to catch the case

    import numpy as np
    x = np.array(2)
    np.add(x, [1], x)

seberg · 2022-05-07T18:14:13Z

I am not wondering if it may also influence the the case of (I admit, I hope nobody does this!):

arr = np.array([5])
arr = np.broadcast_to(arr, 10)
arr.flags.writeable = True

np.add(arr, 0, out=arr)

But probably only interesting for a potential test...

I think I would remove the assert here again, it will lead to hard crashes anyway ;). Also wondering if just repeating that "macro" call may not read easier, but thats just styling now.

eendebakpt · 2022-05-07T19:04:59Z

I am not wondering if it may also influence the the case of (I admit, I hope nobody does this!):
arr = np.array([5])
arr = np.broadcast_to(arr, 10)
arr.flags.writeable = True

np.add(arr, 0, out=arr)
But probably only interesting for a potential test...

Output of your example is the same for main as for this PR: an array [5 5 5 5 5 5 5 5 5 5].

seberg · 2022-05-09T09:10:34Z

numpy/core/src/common/lowlevel_strided_loops.h

+
+    if (arr1 == arr2 && stride1 !=0) {
+        // case common for inplace operations
+        return 1;


PyArray_TRIVIAL_PAIR_ITERATION_STRIDE returns 0 for 1-sized arrays (i.e. 0-D). So we miss those cases here. OTOH, 0-D arrays are (currently) pretty rare.

Maybe we should just add that logic and move it into the return? When (arr1 == arr2), we can return size <= 1 || stride != 0; (solve_may_share_memory really does nothing in the whole arr1 == arr2 branch.)

Another thought, is to use PyArray_DATA(arr1) == PyArray_DATA(arr2), we need the data anyway and it will allow catching views (may also help subclasses for example).

I agree with the change! I am just still a bit curious if it would not make more sense to push it into solve_may_share_memory so that it is used everywhere.

Another point is, that th stride1 != 0 check here doesn't really seem right unless we care about my weird example, the check here really protects us from incorrect shape checking earlier...

Anyway, if nothing moves soon, I may just add a brief comment and merge then. But if you have an idea or think moving it is a good idea please do. Otherwise, the only thing would be style nitpicks (blank line and space after !=).

I guess it depends on the use cases of interest (how often they occur and whether they are time critical). E.g. PyArray_DATA(arr1) == PyArray_DATA(arr2) will catch more cases, but perhaps it does require additional checks (e.g. on the stride or other properties?).

Maybe open an issue with label 'good first issue' to further investigate ideas like moving the check into solve_may_share_memory or using return size <= 1 || stride != 0?

…LAP_OK

seberg · 2022-05-11T08:46:27Z

The code clutter is minimal now, so I am OK with this. But to confirm that you think it is still worthwhile?

seberg · 2022-05-11T09:03:51Z

Wait, I posted this on the wrong PR... I meant to ask for the int change one.

…ction That is a bit of a weird dynamic here, and it would probably nice to clean up. However, it is also not a reason to not add the fast-path right now.

seberg · 2022-05-14T09:27:42Z

@eendebakpt thanks! Lets put this in. I added (a longish) comment on the weird subtlety that this the stride check seems to implicitly reject output broadcasting.

eendebakpt marked this pull request as draft May 6, 2022 11:56

eendebakpt marked this pull request as ready for review May 7, 2022 17:34

seberg reviewed May 9, 2022

View reviewed changes

eendebakpt added 4 commits May 11, 2022 10:15

fast check on equivalent arrays in PyArray_EQUIVALENTLY_ITERABLE_OVER…

c6b3b25

…LAP_OK

fix for case stride1==0

fc8f77e

remove assert statements

dcc242c

whitespace

70e63d0

eendebakpt force-pushed the fast_check_PyArray_EQUIVALENTLY_ITERABLE_OVERLAP_OK branch from dc95b3b to 70e63d0 Compare May 11, 2022 08:15

DOC: Add comment that output broadcasting is rejected by overlap dete…

b39e332

…ction That is a bit of a weird dynamic here, and it would probably nice to clean up. However, it is also not a reason to not add the fast-path right now.

seberg merged commit cfbbde8 into numpy:main May 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

PERF: Fast check on equivalent arrays in PyArray_EQUIVALENTLY_ITERABLE_OVERLAP_OK #21464

PERF: Fast check on equivalent arrays in PyArray_EQUIVALENTLY_ITERABLE_OVERLAP_OK #21464

Uh oh!

eendebakpt commented May 6, 2022 •

edited

Loading

Uh oh!

eendebakpt commented May 7, 2022

Uh oh!

seberg commented May 7, 2022

Uh oh!

eendebakpt commented May 7, 2022

Uh oh!

seberg May 9, 2022

Uh oh!

eendebakpt May 9, 2022

Uh oh!

seberg commented May 11, 2022

Uh oh!

seberg commented May 11, 2022

Uh oh!

seberg commented May 14, 2022

Uh oh!

Uh oh!

Uh oh!

PERF: Fast check on equivalent arrays in PyArray_EQUIVALENTLY_ITERABLE_OVERLAP_OK #21464

PERF: Fast check on equivalent arrays in PyArray_EQUIVALENTLY_ITERABLE_OVERLAP_OK #21464

Uh oh!

Conversation

eendebakpt commented May 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eendebakpt commented May 7, 2022

Uh oh!

seberg commented May 7, 2022

Uh oh!

eendebakpt commented May 7, 2022

Uh oh!

seberg May 9, 2022

Choose a reason for hiding this comment

Uh oh!

eendebakpt May 9, 2022

Choose a reason for hiding this comment

Uh oh!

seberg commented May 11, 2022

Uh oh!

seberg commented May 11, 2022

Uh oh!

seberg commented May 14, 2022

Uh oh!

Uh oh!

eendebakpt commented May 6, 2022 •

edited

Loading