ENH: use caching memory allocator in more places #8920

juliantaylor · 2017-04-10T14:30:14Z

In particular use it in PyArray_IntpConverter which is used in many
function entrypoints.

eric-wieser · 2017-04-10T14:34:56Z

Does this now mean that PyDimMem_NEW and PyDimMem_FREE are not used anywhere?

eric-wieser · 2017-04-10T14:39:31Z

Also, npy_alloc_cache_dim contains:

/* dims + strides */
    if (NPY_UNLIKELY(sz < 2)) {
        sz = 2;
    }

This no longer strikes me as NPY_UNLIKELY now that we're not passing nd * 2, and it's not all that clear to me why this is here in the first place - is there a problem with allocating 0?

juliantaylor · 2017-04-10T14:45:35Z

yes, they should now be all gone.

that (undocumented ...) code exists so any dimension allocation and free creates an entry for the array metadata allocation to use. These always need one for the dimension and one for the strides.
With the cache now being used everywhere the unlikely is indeed questionable, it should just be a minimum.

eric-wieser · 2017-04-10T14:57:26Z

Seems risky to me, as now someone could try and change the strides with:

PyDimMem_FREE(arr->strides)
arr->strides = PyDimMem_NEW(nd);

Which would presumably leave the cache in an invalid state of some kind?

I think we need to change the implementation of PyDimMem_NEW and PyDimMem_FREE to use the cache.

As a result, the cache pointers now need to know their own size, as PyDimMem_FREE does not take that argument

eric-wieser · 2017-04-10T15:00:38Z

numpy/core/src/multiarray/alloc.c

+    /*
+     * make sure any temporary allocation can be used for array metadata which
+     * uses one memory block for both dimensions and strides
+     */


Still don't follow here - how do we end up on this code path? To me, it seems like we always passed an even number in the past.

So the only situation is when sz == 0. Why do we allocate any memory at all? Why would you ask for 0 bytes then proceed to write two of them?

The most important reason for existence of the cache is the npy_alloc_cache_dim(2 * nd) allocation in PyArray_NewFromDescr_int. In the important cases (small arrays) nd is often one.
To fill the cache with more values for arrays to use all other cache paths also allocate at least two entries.
It is probably not a particularly important optimization and could be removed with hardly an impact.

Right, but before this PR that was the only cache path.

So is this really a performance optimization so that 0d arrays preallocate data for 1d arrays? Or is this just working around calling malloc(0), which might return NULL?

On a similar note, we'll be missing the existing cache a lot more here since dims alone can be odd - is it worth looking always looking up in the cache with sz | 1 or something, so that the lookup is always odd?

hm right it there was only one path, it was premature optimization then. I probably planned to do this change a lot sooner ;)
for something to miss one already has to deal with 3d arrays. these are pretty unlikely to be so small that the cache is relevant.
The cache really does not need to be perfect. It just needs to cover the extremely common alloc + immediate dealloc loop which it does.

juliantaylor · 2017-04-10T15:01:39Z

The cache is designed so it cannot be put into an invalid state. It caches malloc/free pointers without any metadata, so you can continue to use free on something that comes from the cache.

eric-wieser · 2017-04-10T15:02:46Z

so you can continue to use free on something that comes from the cache.

Doesn't this leave it in the cache though, which uses up a slot forever? I guess you're right though, it's not strictly invalid

juliantaylor · 2017-04-10T15:09:55Z

alloc_cache removes the pointer from the cache, free_cache adds it to the cache.
using free on a cachable pointer just means you are not filling the cache back up again, this just reduces the cache hit rate for allocations as you may starve the cache by getting pointers from it but not putting them back when you don't need them anymore.
Thus there are a couple places where cache allocation is not used as the pointer goes out to the user and we do not get it back again. Using cache allocation in these cases could starve the cache.

juliantaylor · 2017-04-10T15:14:31Z

PyArray_IntpConverter passes a pointer to the user so external users can starve the cache with it. But it is used sufficiently in numpys important functions so it shouldn't be an issue in practice.

homu · 2017-04-30T17:28:17Z

☔ The latest upstream changes (presumably #8885) made this pull request unmergeable. Please resolve the merge conflicts.

homu · 2017-06-01T18:36:27Z

☔ The latest upstream changes (presumably #9202) made this pull request unmergeable. Please resolve the merge conflicts.

In particular use it in PyArray_IntpConverter which is used in many function entrypoints.

Remove unlikely, as it is being used now more it may not be correct anymore.

juliantaylor · 2017-07-11T10:37:13Z

any comments?
if not I'll merge it in the next few days as this is so prone to conflicts.

charris added 03 - Maintenance component: numpy._core labels Apr 10, 2017

eric-wieser reviewed Apr 10, 2017

View reviewed changes

juliantaylor force-pushed the more-cache-alloc branch from be165b0 to 495ed47 Compare May 1, 2017 23:14

juliantaylor added 2 commits July 11, 2017 12:34

ENH: use caching memory allocator in more places

5a08e20

In particular use it in PyArray_IntpConverter which is used in many function entrypoints.

MAINT: document minimum dim cache size == 2

c114169

Remove unlikely, as it is being used now more it may not be correct anymore.

juliantaylor force-pushed the more-cache-alloc branch from 495ed47 to c114169 Compare July 11, 2017 10:36

juliantaylor merged commit 260f18c into numpy:master Jul 20, 2017

juliantaylor deleted the more-cache-alloc branch July 20, 2017 18:20

bdrosen96 mentioned this pull request Sep 13, 2018

Occasional segfault with cacheing allocator and multiple threads #11942

Closed

eric-wieser mentioned this pull request Jun 7, 2019

MAINT: fix use of cache_dim #13724

Merged

Uh oh!

ENH: use caching memory allocator in more places #8920

ENH: use caching memory allocator in more places #8920

Uh oh!

Conversation

juliantaylor commented Apr 10, 2017

Uh oh!

eric-wieser commented Apr 10, 2017

Uh oh!

eric-wieser commented Apr 10, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

juliantaylor commented Apr 10, 2017

Uh oh!

eric-wieser commented Apr 10, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eric-wieser Apr 10, 2017

Choose a reason for hiding this comment

Uh oh!

juliantaylor Apr 10, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eric-wieser Apr 10, 2017

Choose a reason for hiding this comment

Uh oh!

eric-wieser Apr 10, 2017

Choose a reason for hiding this comment

Uh oh!

juliantaylor Apr 10, 2017

Choose a reason for hiding this comment

Uh oh!

juliantaylor commented Apr 10, 2017

Uh oh!

eric-wieser commented Apr 10, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

juliantaylor commented Apr 10, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

juliantaylor commented Apr 10, 2017

Uh oh!

homu commented Apr 30, 2017

Uh oh!

homu commented Jun 1, 2017

Uh oh!

juliantaylor commented Jul 11, 2017

Uh oh!

Uh oh!

eric-wieser commented Apr 10, 2017 •

edited

Loading

eric-wieser commented Apr 10, 2017 •

edited

Loading

juliantaylor Apr 10, 2017 •

edited

Loading

eric-wieser commented Apr 10, 2017 •

edited

Loading

juliantaylor commented Apr 10, 2017 •

edited

Loading