BENCH: Add a benchmark comparing block to copy in the 3D case #11965

hmaarrfk · 2018-09-16T18:04:46Z

Enhances the existing block3D benchmark to show the performance difference between block and direct memory of each individual blocks.

The way the test is written, a secondary plot on the same graph showing block should show copy allowing us to easily compare the two.

This should help motivate the design for an improved blocking algorithm for case where users are blocking 3D cubes.

===== ============ =============
--               mode           
----- --------------------------
n      block          copy    
===== ============ =============
1    39.9±0.1μs   3.44±0.02μs 
10   209±200μs     38.6±0.6μs 
100   1.06±0.03s     390±6ms   
===== ============ =============

eric-wieser · 2018-09-16T18:20:33Z

There are already benchmarks for block in another file - can extend those instead of making a new suite?

hmaarrfk · 2018-09-16T18:24:07Z

Sorry, I'm really bad at navigating this codebase. Thanks!.

eric-wieser · 2018-09-16T18:31:39Z

I'd recommend that before adding a new benchmark / test, you search the existing benchmarks / tests folders for places that already call the function you want to profile/test.

hmaarrfk · 2018-09-16T18:40:09Z

I had found the ones for masked arrays, but not the ones for dense arrays. :/

eric-wieser · 2018-09-16T18:59:08Z

benchmarks/benchmarks/bench_shape_base.py

+        if mode == 'block':
+            np.block(self.block)
+        else:
+            np.concatenate(self.arr_list)


What's the goal here of testing concatenate within an np.block test?

And why use a mode parameter rather than a separate benchmark function? Does this give a more useful asv result if the asv name is the same?

Some comments answering those questions in the code might be a good thing

This is mostly so that we can directly compare the performance of block and concatenate on the same graph. I forget where, but you can see some graphs. Therefore, when creating the new algorithm, we should be able to stop optimizing once we are close to the performance of concatenate.

It would look something like this
http://pandas.pydata.org/speed/dask/#array.TestSubs.time_subs

Sounds like a good reason to me

eric-wieser · 2018-09-16T19:21:04Z

benchmarks/benchmarks/bench_shape_base.py

+    # allows us to directly compare the benchmark of block
+    # to that of `concatenate` with the ASV framework.
+    # block and concatenate will be plotted on the same graph
+    # as opposed to being displayed as seperate benchmarks


typo: separate

hmaarrfk · 2018-09-16T19:44:44Z

Thanks for having me extend this benchmark @eric-wieser I think we can essentially use this benchmark's algorithm with a reshape to get block done in 1 shot.

eric-wieser · 2018-09-16T20:09:08Z

I think we can essentially use this benchmark's algorithm with a reshape

I find it very unlikely that that would work. You're right that we could do it in one allocation, but it will require some careful juggling of slices.

eric-wieser · 2018-09-16T23:03:09Z

@pv: Is there any easy way to make this continue the existing benchmark graphs, rather than resulting in a benchmark with a new name?

pv · 2018-09-16T23:40:16Z

I don't think there is.

hmaarrfk · 2018-09-17T00:59:42Z

Apparently there is a way to build history for a nee benchmark. But I’m not too sure how

hmaarrfk · 2018-09-17T20:31:36Z

@eric-wieser I would almost like to change this from calling concatenate to calling arr.copy() on all the arrays provided.

The goal is to show that blocking is a memory bandwidth limited algorithm.

Anyway, PR #11971 gets pretty close to the copy limit.

===== ============= ========== ========== ========== ==========
--                                     dtype                   
------------------- -------------------------------------------
n        mode       uint8      uint16     uint32     uint64  
===== ============= ========== ========== ========== ==========
1       block      180±0μs    172±0μs    183±0μs    171±0μs  
1    concatenate   47.7±0μs   50.1±0μs   47.2±0μs   45.9±0μs 
1        copy      30.4±0μs   34.6±0μs   31.5±0μs   36.1±0μs 

10      block      253±0μs    277±0μs    390±0μs    582±0μs  
10   concatenate   88.9±0μs   128±0μs    217±0μs    404±0μs  
10       copy      65.7±0μs   96.0±0μs   150±0μs    287±0μs  

50      block      6.77±0ms   14.6±0ms   27.6±0ms   52.3±0ms 
50   concatenate   6.31±0ms   12.7±0ms   26.8±0ms   53.7±0ms 
50       copy      6.50±0ms   13.2±0ms   27.1±0ms   54.6±0ms 

75      block      23.5±0ms   45.4±0ms   91.1±0ms   181±0ms  
75   concatenate   22.8±0ms   46.5±0ms   91.8±0ms   181±0ms  
75       copy      29.7±0ms   52.2±0ms   95.7±0ms   188±0ms  

100      block      57.8±0ms   121±0ms    208±0ms    394±0ms  
100   concatenate   55.8±0ms   110±0ms    236±0ms    420±0ms  
100       copy      63.2±0ms   115±0ms    212±0ms    426±0ms  

150      block      181±0ms    352±0ms    700±0ms    1.44±0s  
150   concatenate   181±0ms    350±0ms    707±0ms    1.41±0s  
150       copy      183±0ms    355±0ms    706±0ms    1.47±0s  

200      block      435±0ms    851±0ms    1.62±0s    3.45±0s  
200   concatenate   439±0ms    881±0ms    1.73±0s    3.50±0s  
200       copy      442±0ms    890±0ms    1.75±0s    3.54±0s  
===== ============= ========== ========== ========== ==========

mattip · 2018-09-21T09:46:41Z

Is this ready to be merged?

eric-wieser · 2018-09-21T16:06:26Z

@mattip: My worry here is that we break contiguity in our benchmark history by doing this - something we deliberately considered when we introduced the Block3D test.

hmaarrfk · 2018-09-21T16:28:39Z

@eric-wieser why is there going to be a new name for the benchmark? Do additional parameters trigger a new name?

eric-wieser · 2018-09-21T16:39:05Z

@hmaarrfk: Yes, I believe they do

hmaarrfk · 2018-09-21T16:44:36Z

:(

eric-wieser · 2018-09-21T16:51:07Z

An easy workaround would be to just expose the test twice, once under the old name and once under the new name. Failing that, we might want to look into patching asv to allow a params -> name function to be provided.

hmaarrfk · 2018-09-21T22:55:56Z

What happens when we add a new benchmark? Is history created automatically or will it start from "this new commit"

hmaarrfk · 2018-09-22T15:17:02Z

@eric-wieser, If we don't want to create a new benchmark, we would need to run the command:

asv run -E conda:3.6 -b Block.time_3d EXISTING

to rebuild the history for all the commits that currently have benchmarks. I'm not sure if that is a possiility.

mattip · 2018-10-19T08:12:49Z

rebased off master

mattip · 2018-10-19T08:15:36Z

As discussed at the Oct 17 weekly status meeting, the general opinion was that breaking continuity is not a PR blocker. It is relatively easy to rerun newer benchmarks on older code to determine a baseline and compare across the gap, especially since this PR only touches the benchmark itself.

mattip · 2018-10-19T08:48:07Z

Thanks @hmaarrfk

hmaarrfk force-pushed the bench_concatenate branch from fe9c3d4 to 412d4fb Compare September 16, 2018 18:54

eric-wieser reviewed Sep 16, 2018

View reviewed changes

hmaarrfk force-pushed the bench_concatenate branch from 567e774 to 68a0972 Compare September 16, 2018 19:44

hmaarrfk mentioned this pull request Sep 17, 2018

MAINT: Block algorithm with a single copy per call to block #11971

Merged

5 tasks

charris added 01 - Enhancement component: benchmarks labels Sep 17, 2018

charris changed the title ~~Add a benchmark for concatenate and block~~ ENH: Add a benchmarks for concatenate and block Sep 17, 2018

hmaarrfk changed the title ~~ENH: Add a benchmarks for concatenate and block~~ ENH: Add a benchmark comparing block to copy in the 3D case Sep 21, 2018

hmaarrfk force-pushed the bench_concatenate branch from a23a5f4 to c2086e7 Compare September 21, 2018 13:13

charris changed the title ~~ENH: Add a benchmark comparing block to copy in the 3D case~~ BENCH: Add a benchmark comparing block to copy in the 3D case Oct 10, 2018

charris added the 05 - Testing label Oct 10, 2018

charris added 28 - Benchmark and removed 01 - Enhancement 05 - Testing labels Oct 10, 2018

mattip added the 54 - Needs decision label Oct 12, 2018

hmaarrfk mentioned this pull request Oct 18, 2018

Add Pythran support to build, convert two functions scikit-image/scikit-image#3226

Merged

9 tasks

ENH: Add a benchmark comparing block to copy in the 3D case

f37b0c6

mattip force-pushed the bench_concatenate branch from c2086e7 to f37b0c6 Compare October 19, 2018 08:12

mattip removed the 54 - Needs decision label Oct 19, 2018

mattip merged commit 8d7b7b5 into numpy:master Oct 19, 2018

hmaarrfk deleted the bench_concatenate branch November 5, 2018 03:20

Uh oh!

BENCH: Add a benchmark comparing block to copy in the 3D case #11965

BENCH: Add a benchmark comparing block to copy in the 3D case #11965

Uh oh!

Conversation

hmaarrfk commented Sep 16, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eric-wieser commented Sep 16, 2018

Uh oh!

hmaarrfk commented Sep 16, 2018

Uh oh!

eric-wieser commented Sep 16, 2018

Uh oh!

hmaarrfk commented Sep 16, 2018

Uh oh!

eric-wieser Sep 16, 2018

Choose a reason for hiding this comment

Uh oh!

eric-wieser Sep 16, 2018

Choose a reason for hiding this comment

Uh oh!

hmaarrfk Sep 16, 2018

Choose a reason for hiding this comment

Uh oh!

hmaarrfk Sep 16, 2018

Choose a reason for hiding this comment

Uh oh!

eric-wieser Sep 16, 2018

Choose a reason for hiding this comment

Uh oh!

eric-wieser Sep 16, 2018

Choose a reason for hiding this comment

Uh oh!

hmaarrfk commented Sep 16, 2018

Uh oh!

eric-wieser commented Sep 16, 2018

Uh oh!

eric-wieser commented Sep 16, 2018

Uh oh!

pv commented Sep 16, 2018

Uh oh!

hmaarrfk commented Sep 17, 2018

Uh oh!

hmaarrfk commented Sep 17, 2018

Uh oh!

mattip commented Sep 21, 2018

Uh oh!

eric-wieser commented Sep 21, 2018

Uh oh!

hmaarrfk commented Sep 21, 2018

Uh oh!

eric-wieser commented Sep 21, 2018

Uh oh!

hmaarrfk commented Sep 21, 2018

Uh oh!

eric-wieser commented Sep 21, 2018

Uh oh!

hmaarrfk commented Sep 21, 2018

Uh oh!

hmaarrfk commented Sep 22, 2018

Uh oh!

mattip commented Oct 19, 2018

Uh oh!

mattip commented Oct 19, 2018

Uh oh!

mattip commented Oct 19, 2018

Uh oh!

Uh oh!

hmaarrfk commented Sep 16, 2018 •

edited

Loading