Improve rendering when using multiple / large labels #21958

ojeda-e · 2021-12-15T03:38:08Z

PR Summary

Fixes #21895
The example reported in #21895 initially runs in 10.079 s (locally, my old laptop). By moving cache in lib/matplotlib/text.py as an attribute of Text, the example runs in 4.426 s, which translates to an improvement of ~78%. Added a unit test that uses labels of different lengths, increasing as 10**i, and assert times are shorter. Not quite as the suggestion by @tacaswell because I am not comparing text vs no text. I'm not fully convinced of this approach either, but can't think of anything better. I appreciate suggestions to make the test more solid.
No changes in matplotlib/lib/matplotlib/mathtext.py.
As a separate note, this also probably needs a benchmark in mpl_benchmarks as suggested by @jklymak on Gitter (thanks Jody!).

PR Checklist

Tests and Styling

Has pytest style unit tests (and pytest passes).
Is Flake 8 compliant (install flake8-docstrings and run flake8 --docstring-convention=all).

Documentation

[N/A] New features are documented, with examples if plot related.
[N/A] New features have an entry in doc/users/next_whats_new/ (follow instructions in README.rst there).
[N/A] API changes documented in doc/api/next_api_changes/ (follow instructions in README.rst there).
[N/A] Documentation is sphinx and numpydoc compliant (the docs should build without error).

jklymak · 2021-12-15T08:33:22Z

This seems reasonable.

Unfortunately, the figure doesn't survive a pickle - i.e. with the new cache, the objects are not serializable. I expect the figure used to have something that detached the cache, and you could likely do the same thing here? But I'm not an expert on (nor a fan of) pickling figures ;-)

ojeda-e · 2021-12-15T19:47:18Z

Thanks for the brief explanation @jklymak.
I managed to pickle the weakref by adding the __setstate__ in Text. Using the same example from the issue, the improvement is conserved. with this change, test_pickle.py passes locally.
However, some of my local tests are failing, even in the main branch. I pushed the changes and will use the CI to check if there are more tests failing. (Sorry about that, I know it isn't a good practice, but it will help me while I find out what broke in my environment.)

jklymak · 2021-12-15T20:07:24Z

Many tests fail locally for me as well. If you are pretty sure it's not due to your pr it's perfectly fine to push tothe more controlled test environment.

lib/matplotlib/text.py

lib/matplotlib/tests/test_text.py

ojeda-e · 2021-12-16T19:07:14Z

Thanks for the comments @tacaswell, I hope I addressed your comments with the recent changes.

lib/matplotlib/text.py

anntzer · 2021-12-16T19:12:53Z

I wonder(?) if it may make sense to keep the old global cache layer as well, so that if you're e.g. drawing 10 axes each with the same tick labels, the labels on the latter axes still benefit from the work done on the first axes. Of course, that would require a bit more rearchitecting of the caching...

jklymak · 2022-01-20T08:14:39Z

lib/matplotlib/text.py

@@ -107,7 +107,6 @@ class Text(Artist):
    """Handle storing and drawing of text in window or data coordinates."""

    zorder = 3
-    _cached = cbook.maxdict(50)


Is this dict really a memory hog if it gets larger? Why not just set to a large number and move on? Just because we can make a double caching structure doesn't mean we should. Given that we have had very few complaints of this nature, my assumption is that most people are not hitting the 50-element limit, so bumping it to 5000 for the few people who need it doesn't seem like a terrible idea.

Note that this largely catches users who have accidentally used categoricals and created thousands of ticks. I think, if anything, we should add logic in the categorical ticker that raises a warning of more than 100 ticks are being asked for.

QuLogic · 2022-07-05T21:08:27Z

@anntzer what is the status here after #22271? Does this just need a rebase?

anntzer · 2022-07-11T20:05:59Z

No, because then there was #22323 which further got rid of the explicit _cached in favor of a lru_cache. There may(?) still be some small benefit to caching layout info per-instance, although restoring that (which was mostly removed in #22271, which places the cache a bit earlier in the call tree) would require some more work, not a simple rebase (and I don't know if timings would show a large further improvement).

tacaswell · 2022-12-16T18:20:10Z

Moved to 3.8 as the rebase is non trivial and there have been other changes to this caching layer so we need to evaluate if we still need this.

jklymak added this to the v3.6.0 milestone Dec 15, 2021

jklymak added topic: ticks axis labels topic: text Performance status: needs revision labels Dec 15, 2021

jklymak marked this pull request as draft December 15, 2021 09:06

ojeda-e force-pushed the issue21895 branch from 55c7f43 to 6e42a75 Compare December 15, 2021 19:40

tacaswell reviewed Dec 15, 2021

View reviewed changes

lib/matplotlib/text.py Outdated Show resolved Hide resolved

tacaswell reviewed Dec 15, 2021

View reviewed changes

lib/matplotlib/tests/test_text.py Outdated Show resolved Hide resolved

ojeda-e added 3 commits December 15, 2021 16:12

moved cache to Text attribute.

0a843d0

Added __setstate__ in Text to pickle weakref

5af1662

Replaced maxdict by dict. Added draw_without_rendering in test.

7399b7f

ojeda-e force-pushed the issue21895 branch from 6e42a75 to 7399b7f Compare December 15, 2021 23:15

anntzer reviewed Dec 16, 2021

View reviewed changes

lib/matplotlib/text.py Outdated Show resolved Hide resolved

Replaced setstate with dict.

15b28c0

ojeda-e force-pushed the issue21895 branch from 9682cd9 to 15b28c0 Compare December 16, 2021 19:37

fixed Flake8

cc4307b

anntzer mentioned this pull request Jan 20, 2022

Rework/fix Text layout cache. #22271

Merged

6 tasks

jklymak reviewed Jan 20, 2022

View reviewed changes

QuLogic modified the milestones: v3.6.0, v3.7.0 Jul 5, 2022

github-actions bot added the status: needs rebase label Oct 18, 2022

tacaswell removed this from the v3.7.0 milestone Dec 16, 2022

tacaswell added this to the v3.8.0 milestone Dec 16, 2022

ksunden modified the milestones: v3.8.0, future releases Aug 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve rendering when using multiple / large labels #21958

Improve rendering when using multiple / large labels #21958

ojeda-e commented Dec 15, 2021

jklymak commented Dec 15, 2021

ojeda-e commented Dec 15, 2021

jklymak commented Dec 15, 2021

ojeda-e commented Dec 16, 2021

anntzer commented Dec 16, 2021

jklymak Jan 20, 2022

QuLogic commented Jul 5, 2022

anntzer commented Jul 11, 2022 •

edited

Loading

tacaswell commented Dec 16, 2022

Improve rendering when using multiple / large labels #21958

Are you sure you want to change the base?

Improve rendering when using multiple / large labels #21958

Conversation

ojeda-e commented Dec 15, 2021

PR Summary

PR Checklist

jklymak commented Dec 15, 2021

ojeda-e commented Dec 15, 2021

jklymak commented Dec 15, 2021

ojeda-e commented Dec 16, 2021

anntzer commented Dec 16, 2021

jklymak Jan 20, 2022

Choose a reason for hiding this comment

QuLogic commented Jul 5, 2022

anntzer commented Jul 11, 2022 • edited Loading

tacaswell commented Dec 16, 2022

anntzer commented Jul 11, 2022 •

edited

Loading