MAINT: Large overhead in some random functions #15511

przemb · 2020-02-04T23:11:48Z

slow calls to np.dtype.name replaced with np.dtype,
mtrand.pyx and _generator.pyx updated
test _test_warns_byteorder_ updated

numpy/random/_generator.pyx

charris · 2020-02-04T23:31:45Z

Do these fixes change the byte stream?

seberg · 2020-02-04T23:46:01Z

I am not sure this is a regression, since I think the legacy version did not have the dtype kwarg normally? But we might as well backport probably.

This should not change the byte streams, it should be purely maintenance in that sense. It might be nice to see if at least some of these are picked up by the benchmarks (if there are benchmarks drawing even a medium number from some of these distributions, I would expect so, however).

slow calls to np.dtype.name replaced with np.dtype, mtrand.pyx and _generator.pyx updated, test test_warns_byteorder updated before: %timeit rs.random(): 520 ns ± 33.1 ns per loop %timeit rg.random(): 6.36 µs ± 222 ns per loop after: %timeit rs.random(): 453 ns ± 6.95 ns per loop %timeit rg.random(): 594 ns ± 9.66 ns per loop

numpy/random/mtrand.pyx

numpy/random/_generator.pyx

numpy/random/mtrand.pyx

mattip · 2020-02-05T09:30:15Z

I tried to set up my laptop for consistent benchmark running (configuring the system so I could do sudo cpupower frequency-set --governor userspace then setting sudo cpupower frequency-set -f 2.1GHz which is much less than the top frequency), then running

python runtests.py --bench-compare HEAD

resulted in no significant changes.

eric-wieser · 2020-02-05T11:35:37Z

Likely means we don't have useful benchmarks. Here's a micro benchmark whoing a big improvement:

# factor of 2 savings to be gained with pre-constructed dtypes.
In [10]: %timeit dt == dt2
84.2 ns ± 4.24 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)

In [11]: %timeit dt == np.float32
191 ns ± 30.5 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)

In [12]: %timeit dt.name == 'float32'
3.87 µs ± 688 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

numpy/random/_generator.pyx

mattip · 2020-02-05T11:56:44Z

Fixes #15460. I can see a big improvement on a microbenchmark:

$ python runtests.py --ipython
...
In [1]: rs = np.random.RandomState() 
   ...: rg = np.random.default_rng()
In [2]: %timeit rs.random()                                                                                                                                                              
191 ns ± 1.11 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)

In [3]: %timeit rg.random()                                                                                                                                                              
2.54 µs ± 36.6 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

After this changeset (a15dc30)

In [2]: %timeit rs.random()                                                                                                                                                              
188 ns ± 2.07 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)

In [3]: %timeit rg.random()                                                                                                                                                              
239 ns ± 0.968 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

numpy/random/_bounded_integers.pyx.in

eric-wieser · 2020-02-05T15:49:05Z

numpy/random/_generator.pyx

@@ -312,13 +311,13 @@ cdef class Generator:

        """
        cdef double temp
-        key = np.dtype(dtype).name
-        if key == 'float64':
+        _dtype = np.dtype(dtype)


Would normally spell this

Suggested change

_dtype = np.dtype(dtype)

dtype = np.dtype(dtype)

Does cython not like that?

(same throughout)

If I change it, test are failing... seems that it is not proper.

Do you have a link to the CI failure?

Edit: caused diagnosed here: #15511 (comment)

We need the original value so we can raise an error with it later. What about descr or concrete_dtype?

numpy/random/_generator.pyx

eric-wieser

Docs still need updating as mentioned in https://github.com/numpy/numpy/pull/15511/files#r375407384

numpy/random/_generator.pyx

numpy/random/mtrand.pyx

eric-wieser

Thanks for working through the doc fixes!

numpy/random/_generator.pyx

numpy/random/mtrand.pyx

eric-wieser

LGTM

mattip · 2020-02-06T14:57:28Z

Thanks @przemb

eric-wieser reviewed Feb 4, 2020

View reviewed changes

numpy/random/_generator.pyx Outdated Show resolved Hide resolved

eric-wieser reviewed Feb 4, 2020

View reviewed changes

numpy/random/_generator.pyx Outdated Show resolved Hide resolved

charris added 03 - Maintenance 06 - Regression 09 - Backport-Candidate PRs tagged should be backported labels Feb 4, 2020

charris added this to the 1.18.2 release milestone Feb 4, 2020

charris changed the title ~~BUG: Large overhead in some random functions #15460~~ MAINT: Large overhead in some random functions #15460 Feb 4, 2020

przemb changed the title ~~MAINT: Large overhead in some random functions #15460~~ MAINT: Large overhead in some random functions Feb 4, 2020

przemb force-pushed the random_speedup_15460 branch from d307b59 to a15dc30 Compare February 5, 2020 00:20

przemb requested a review from eric-wieser February 5, 2020 07:34

przemb mentioned this pull request Feb 5, 2020

BUG: Large overhead in some random functions #15460

Closed

mattip reviewed Feb 5, 2020

View reviewed changes

numpy/random/mtrand.pyx Outdated Show resolved Hide resolved