MNT: more benchmark cleanup #26639

ngoldbaum · 2024-06-07T03:43:04Z

In the vein of #26638, this makes more of the benchmarks reproducible. It also deletes some dead code.

mattip · 2024-06-07T05:00:54Z

benchmarks/benchmarks/bench_creation.py

@@ -13,7 +13,8 @@ class MeshGrid(Benchmark):
    timeout = 10

    def setup(self, size, ndims, ind, ndtype):
-        self.grid_dims = [(np.random.ranf(size)).astype(ndtype) for
+        rnd = np.random.RandomState(1864768776)


If we are already touching these, should we use the recommended

rng = np.random.default_rng(1864768776) rng.random(size, dtype)

My reading of NEP 19 is that we're supposed to use np.random.RandomState for stable, reproducible RNG sequences across numpy verisons, which is why I used it explicitly in this PR. I guess it's been quite a while since NEP 19 - is that no longer the best recommendation?

@ngoldbaum you're correct here. The testing use case (which applies to benchmarking too) for RandomState hasn't changed.

charris · 2024-06-07T16:12:59Z

RandomState has the advantage that the bit stream is guaranteed to stay the same, we don't make that promise for the new generators, so I think it is still the right choice for testing. Hmm, I wonder if that stability guarantee has implications for the threading work?

mattip · 2024-06-09T09:14:26Z

Hmm. Not this PR, but maybe we should document the use-cases explicitly:

For stable benchmarking, use np.random.RandomState since it is guaranteed to be stable across versions. When the goal is to obtain close-to-theoretical random distribution of values, then `np.random.Generator should be used.

mattip · 2024-06-09T09:16:07Z

benchmarks/benchmarks/bench_function_base.py

        array_class = array_type[0]
-        self.arr = getattr(SortGenerator, array_class)(self.ARRAY_SIZE, dtype, *array_type[1:])
+        self.arr = getattr(SortGenerator, array_class)(self.ARRAY_SIZE, dtype, *array_type[1:], rnd)


The linter is annoyed at this line

rgommers

All LGTM, let's get this in. Thanks @ngoldbaum & reviewers.

Linter complaint is irrelevant, so ignoring. @mattip's suggestion to document the RandomState use case better in a follow-up sounds like a good idea to me.

github-actions bot added the 03 - Maintenance label Jun 7, 2024

MNT: seed more RNGs

5e0dded

ngoldbaum force-pushed the benchmark-investigation branch from 8d56a60 to 5e0dded Compare June 7, 2024 03:49

mattip reviewed Jun 7, 2024

View reviewed changes

ngoldbaum mentioned this pull request Jun 7, 2024

MNT: Reorganize non-constant global statics into structs #26607

Merged

mattip reviewed Jun 9, 2024

View reviewed changes

rgommers approved these changes Jun 11, 2024

View reviewed changes

rgommers merged commit db63ee8 into numpy:main Jun 11, 2024
63 of 68 checks passed

rgommers added this to the 2.1.0 release milestone Jun 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

MNT: more benchmark cleanup #26639

MNT: more benchmark cleanup #26639

Uh oh!

ngoldbaum commented Jun 7, 2024

Uh oh!

mattip Jun 7, 2024

Uh oh!

ngoldbaum Jun 7, 2024

Uh oh!

rgommers Jun 11, 2024

Uh oh!

charris commented Jun 7, 2024

Uh oh!

mattip commented Jun 9, 2024

Uh oh!

mattip Jun 9, 2024

Uh oh!

rgommers left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MNT: more benchmark cleanup #26639

MNT: more benchmark cleanup #26639

Uh oh!

Conversation

ngoldbaum commented Jun 7, 2024

Uh oh!

mattip Jun 7, 2024

Choose a reason for hiding this comment

Uh oh!

ngoldbaum Jun 7, 2024

Choose a reason for hiding this comment

Uh oh!

rgommers Jun 11, 2024

Choose a reason for hiding this comment

Uh oh!

charris commented Jun 7, 2024

Uh oh!

mattip commented Jun 9, 2024

Uh oh!

mattip Jun 9, 2024

Choose a reason for hiding this comment

Uh oh!

rgommers left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!