Speed up Path.iter_segments() #13039

lazka · 2018-12-22T17:19:09Z

PR Summary

Using the Gtk3Cairo backend with wire3d_animation_sgskip.py:

before: 9.16 fps
iter: 9.95 fps
iter+types: 15.26 fps

The main speedup comes from iterating and keeping the common non-curve
case simple and from converting the code type constants to the same type as the codes array to make comparisons between them faster.

PR Checklist

Has Pytest style unit tests
Code is Flake 8 compliant
New features are documented, with examples if plot related
Documentation is sphinx and numpydoc compliant
Added an entry to doc/users/next_whats_new/ if major new feature (follow instructions in README.rst there)
Documented in doc/api/api_changes.rst if API changed in a backward-incompatible way

With the performance improvements in matplotlib#13040 and matplotlib#13039 the old slow path is now faster than the previously fast one. And it also works with pycairo.

timhoffm · 2018-12-23T17:04:11Z

Welcome to matplotlib development and thanks for your contribution.
You've found a good performance improvement!

Comparing different types costs extra time as the following tests show:

native (int, np.int64)

In [16]: %%timeit ref = 5
    ...: for i in np.arange(1_000_000):
    ...:     if i == ref:
    ...:         continue
    ...:     
887 ms ± 49.6 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

int reference, np.uint8 array:

In [17]: %%timeit ref = 5
    ...: for i in np.arange(1_000_000, dtype=np.uint8):
    ...:     if i == ref:
    ...:         continue
    ...:     
4.59 s ± 87.1 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

np.uint8 reference, np.uint8 array:

In [18]: %%timeit ref = np.uint8(5)
    ...: for i in np.arange(1_000_000, dtype=np.uint8):
    ...:     if i == ref:
    ...:         continue
    ...:     
443 ms ± 23.8 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

np.uint64 reference, np.uint64 array:

In [19]: %%timeit ref = np.uint64(5)
    ...: for i in np.arange(1_000_000, dtype=np.uint64):
    ...:     if i == ref:
    ...:         continue
    ...:     
475 ms ± 36.8 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

codes seem to be always uint8 (there's a class variable Path.code_type, which is unfortunately public, but most likely only intended for reading not for writing. Also the Path docstring says codes should be utf8). I recommend changing Path.STOP and Path.NUM_VERTICES_FOR_CODE to this type to avoid casting the individual elements at all, which should make the code even a bit more faster.

Using the Gtk3Cairo backend with wire3d_animation_sgskip.py: Before: 9.16 fps After: 9.95 fps The main speedup is from iterating and keeping the common non-curve case simple.

lazka · 2018-12-23T17:58:24Z

Thanks for the benchmarks, using uint8 seems much faster indeed.

Small question, do you mean adjusting Path.STOP, or only casting it in iter_segments(). The former would make many things using the constants faster everywhere I guess, but I'm not sure regarding compatibility.

edit: I've pushed the direct change for starters.

timhoffm · 2018-12-23T18:42:48Z

Yes, I meant the direct change.

I'm 99% sure about compatibility. uint8 can drop in wherever you expect an int and the predefined constants are small enough and don't change so that we don't have to worry about overflows. Even if somebody should extend this with own implementations and use int, everything should still work.

We should probably add an API change note to be on the safe side.

That said, I'd still want the opinion of other devs on this.

… the codes array The matching types make comparisons between the constants and values of the codes array faster. Using the Gtk3Cairo backend with wire3d_animation_sgskip.py: Before: 9.95 fps After: 15.26 fps The main areas where this helps for the cairo case is the faster comparisons in Path.iter_segments() and in _append_paths() of the cairo backend.

lazka · 2018-12-23T19:10:38Z

We should probably add an API change note to be on the safe side.

I've added something to next_api_changes

timhoffm

Looks good.

Out of curiosity, have you benchmarked the new version?

lazka · 2018-12-24T08:19:38Z

Out of curiosity, have you benchmarked the new version?

Only using wire3d_animation_sgskip + pycairo, see the PR summary and commits. (best out of 5 each time)

With the performance improvements in matplotlib#13039 the old slow path is now faster than the previously fast one. And it also works with pycairo. Using the Gtk3Cairo backend with wire3d_animation_sgskip.py: cairocffi + append_fast: 13.27 fps cairo + append_slow: 15.07 fps cairocffi + append_slow: 13.54 fps

lazka mentioned this pull request Dec 22, 2018

cairo: speed up the cairo append_path() slow path #13040

Closed

6 tasks

lazka force-pushed the speed-up-iter-segments branch from dcb5fbb to a798344 Compare December 23, 2018 11:11

lazka mentioned this pull request Dec 23, 2018

cairo: remove the append_path() fast path #13042

Merged

6 tasks

Speed up Path.iter_segments()

8b65fa1

Using the Gtk3Cairo backend with wire3d_animation_sgskip.py: Before: 9.16 fps After: 9.95 fps The main speedup is from iterating and keeping the common non-curve case simple.

lazka force-pushed the speed-up-iter-segments branch from a798344 to ce7a6a6 Compare December 23, 2018 18:02

lazka force-pushed the speed-up-iter-segments branch from ce7a6a6 to 3b8689a Compare December 23, 2018 19:09

timhoffm approved these changes Dec 23, 2018

View reviewed changes

timhoffm added this to the v3.1 milestone Dec 23, 2018

timhoffm added the Performance label Dec 23, 2018

tacaswell merged commit 87085bd into matplotlib:master Dec 24, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Speed up Path.iter_segments() #13039

Speed up Path.iter_segments() #13039

Uh oh!

lazka commented Dec 22, 2018 •

edited

Loading

Uh oh!

timhoffm commented Dec 23, 2018 •

edited

Loading

Uh oh!

lazka commented Dec 23, 2018 •

edited

Loading

Uh oh!

timhoffm commented Dec 23, 2018 •

edited

Loading

Uh oh!

lazka commented Dec 23, 2018

Uh oh!

timhoffm left a comment

Uh oh!

lazka commented Dec 24, 2018 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Speed up Path.iter_segments() #13039

Speed up Path.iter_segments() #13039

Uh oh!

Conversation

lazka commented Dec 22, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

PR Checklist

Uh oh!

timhoffm commented Dec 23, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lazka commented Dec 23, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

timhoffm commented Dec 23, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lazka commented Dec 23, 2018

Uh oh!

timhoffm left a comment

Choose a reason for hiding this comment

Uh oh!

lazka commented Dec 24, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

lazka commented Dec 22, 2018 •

edited

Loading

timhoffm commented Dec 23, 2018 •

edited

Loading

lazka commented Dec 23, 2018 •

edited

Loading

timhoffm commented Dec 23, 2018 •

edited

Loading

lazka commented Dec 24, 2018 •

edited

Loading