Add a new memleak script that does everything #5360

mdboom · 2015-10-30T15:16:20Z

This replaces our 4 memleak scripts with one that is able to test any
backend, with or without plot content, and with or without interactive
mode.

The calculation of average increase per iteration has been fixed.
Before, it assumed the increase was monotonically increasing, when in
fact it flucuates quite a bit. Therefore, it now calculates the
difference between each pair of results and averages that.

Also, the results are stored in pre-allocated Numpy arrays rather than
Python lists to avoid including the increasing size of the Python lists
in the results.

mdboom · 2015-10-30T15:16:59Z

Not sure how to milestone this. Since it's just a dev utility it can probably go just about anywhere.

Also, I should add since this uses the new tracemalloc module, it is Python 3.x only.

efiring · 2015-10-30T18:10:48Z

unit/memleak.py

+        garbage_arr[i] = garbage
+
+    print('Average memory consumed per loop: %1.4f bytes\n' %
+          (np.sum(rss_arr[starti+1:] - rss_arr[starti:-1]) / float(endi - starti)))


This looks like just the sum of the differences, which is the end value minus the start value.

((rss_arr[-1] - rss_arr[starti]) / float(endi - starti))

I suppose that's true. We need a different mechanism, then -- something that will take into account the spikiness of the data. If you select the start and end points incorrectly here you get wildly different results.

efiring · 2015-10-30T19:13:11Z

Mike, in the top subplot it looks like pymalloc is showing small fluctuations near 13M, and rss is showing small fluctuations around 150k--maybe rss units here are 512b or 1k blocks.

mdboom · 2015-10-30T19:20:03Z

Yes -- rss is in different units. I just haven't gone and implemented that (because the units depend on platform, version of platform etc.) In any case, they should be on different scales, because pymalloc (including only allocations from Python interpreter itself) will always be significantly smaller than rss. The important thing is not their relative sizes but the first derivative anyway.

This replaces our 4 memleak scripts with one that is able to test any backend, with or without plot content, and with or without interactive mode. The calculation of average increase per iteration has been fixed. Before, it assumed the increase was monotonically increasing, when in fact it flucuates quite a bit. Therefore, it now calculates the difference between each pair of results and averages that. Also, the results are stored in pre-allocated Numpy arrays rather than Python lists to avoid including the increasing size of the Python lists in the results.

mdboom · 2015-11-02T16:16:17Z

I've updated this so that the average memory increase is calculated based on the peak memory usage rather than an instantaneous reading. This should get around the problem where the reading seems artificially high if it happens to pick a valley as the start end point.

This has also been updated to use the psutil package rather than our home-grown report_memory function. This has a couple of advantages: The units are all in bytes, regardless of the version of the OS being used (the definition of rss has changed over time). It also allows use to track the number of open file handles, as file handle leaking is also something we have trouble with from time to time.

TST: Add a new memleak script that does everything

tacaswell · 2015-11-05T04:09:02Z

I don't think we have to back-port this to any other branch unless we want to go memory hunting on them.

mdboom added the status: needs review label Oct 30, 2015

mdboom mentioned this pull request Oct 30, 2015

Fix memory leaks found by memleak_hawaii3.py #5359

Merged

efiring reviewed Oct 30, 2015
View reviewed changes

QuLogic mentioned this pull request Oct 31, 2015

Fix Cairo memleak #5372

Merged

mdboom added 5 commits November 2, 2015 11:14

Fix average increase calculation by tracking peaks

6572456

Better message if running on Python < 3.5

7444a98

Fix units as reported by report_memory

457fd94

Use psutil instead of our home-grown utilities

b59627b

mdboom force-pushed the new-memleak-script branch from 6f932a1 to b59627b Compare November 2, 2015 16:16

tacaswell added a commit that referenced this pull request Nov 5, 2015

Merge pull request #5360 from mdboom/new-memleak-script

d063dee

TST: Add a new memleak script that does everything

tacaswell merged commit d063dee into matplotlib:master Nov 5, 2015

tacaswell removed the status: needs review label Nov 5, 2015

QuLogic added this to the proposed next point release (2.1) milestone Nov 5, 2015

mdboom deleted the new-memleak-script branch November 10, 2015 02:46

efiring mentioned this pull request Aug 12, 2016

large memory leak in new contour routine #6940

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add a new memleak script that does everything #5360

Add a new memleak script that does everything #5360

Uh oh!

mdboom commented Oct 30, 2015

Uh oh!

mdboom commented Oct 30, 2015

Uh oh!

efiring Oct 30, 2015

Uh oh!

mdboom Oct 30, 2015

Uh oh!

efiring commented Oct 30, 2015

Uh oh!

mdboom commented Oct 30, 2015

Uh oh!

mdboom commented Nov 2, 2015

Uh oh!

tacaswell commented Nov 5, 2015

Uh oh!

Uh oh!

Uh oh!

Add a new memleak script that does everything #5360

Add a new memleak script that does everything #5360

Uh oh!

Conversation

mdboom commented Oct 30, 2015

Uh oh!

mdboom commented Oct 30, 2015

Uh oh!

efiring Oct 30, 2015

Choose a reason for hiding this comment

Uh oh!

mdboom Oct 30, 2015

Choose a reason for hiding this comment

Uh oh!

efiring commented Oct 30, 2015

Uh oh!

mdboom commented Oct 30, 2015

Uh oh!

mdboom commented Nov 2, 2015

Uh oh!

tacaswell commented Nov 5, 2015

Uh oh!

Uh oh!