Rework/fix Text layout cache. #22271

anntzer · 2022-01-20T00:27:31Z

Instead of caching the text layout based on a bunch of properties, only
cache the computation of the text's metrics, which 1) should be the most
expensive part (everything else in _get_layout is relatively simple
iteration and arithmetic) and 2) depends on fewer implicit parameters.

In fact, the old cache key was insufficient in that it would conflate
usetex and non-usetex strings together, even though they have different
metrics; e.g. with the (extremely artificial) example

figtext(.1, .5, "foo\nbar", size=32)  # (0)
figtext(.1, .5, "foo\nbar", usetex=True, size=32, c="r", alpha=.5)  # (1)
figtext(.3, .5, "foo\nbar", usetex=True, size=32, c="r", alpha=.5)  # (2)

the linespacing of the first usetex string (1) would be "wrong": it is
bigger that the one of the second usetex string (2), because it instead
reuses the layout computed for the non-usetex string (0).

The motivation is also to in the future let the renderer have better
control on cache invalidation (with a yet-to-be-added renderer method),
e.g. multiple instances of the same renderer cache could share the same
layout info.

old:

new:

(compare the position of the left "foo")

(This is also orthogonal to #21958, which fixes another aspect of the text cache.)

PR Summary

PR Checklist

Tests and Styling

Has pytest style unit tests (and pytest passes).
Is Flake 8 compliant (install flake8-docstrings and run flake8 --docstring-convention=all).

Documentation

New features are documented, with examples if plot related.
New features have an entry in doc/users/next_whats_new/ (follow instructions in README.rst there).
API changes documented in doc/api/next_api_changes/ (follow instructions in README.rst there).
Documentation is sphinx and numpydoc compliant (the docs should build without error).

tacaswell · 2022-01-20T20:56:29Z

👍 in principle

anntzer · 2022-01-20T21:18:15Z

@greglucas Actually, switching to lru_cache would be a bit annoying, because the caching on fontproperties must be based on hash(fontproperties) (as fontproperties themselves are mutable) but then you can't get back the fontproperties from the hash...
(Again, it may be possible to add more indirection around that but it may not really be worth it for now...)

greglucas

Looks good to me. Might be nice to add your artificial example in for a quick test... Something as simple as asserting that the w, h, d returned are not the same between the first and second calls due to a proper cache miss now?

I agree, removing the maxdict cache should not hold this up. That was an orthogonal comment/question.

anntzer · 2022-01-21T07:14:38Z

Added test.

@tacaswell Actually, wrt. just using the renderer class get_text_cache_invalidation_token, I guess that mplcairo may actually return consistent metrics(?) for different output formats (TBC), but I realized that another issue is that matplotlib's own vector renderers are not used directly, but always get wrapped in a MixedModeRenderer, so plainly checking the type is not going to distinguish e.g. a pdf and a svg renderer.

dstansby · 2022-01-21T09:37:23Z

@greglucas Actually, switching to lru_cache would be a bit annoying, because the caching on fontproperties must be based on hash(fontproperties) (as fontproperties themselves are mutable) but then you can't get back the fontproperties from the hash... (Again, it may be possible to add more indirection around that but it may not really be worth it for now...)

A while ago I wrote some code to cache a property based on whether an attribute had changed or not:
https://github.com/sunpy/sunpy/blob/bf3a54f6efd104faad4f4bb7347ba542cb2f16a2/sunpy/util/decorators.py#L324
perhaps there is a use for something like that here?

anntzer · 2022-01-21T09:43:09Z

There's a few different ways to skin this cat (thanks for linking your implementation), but we can probably have that discussion in #22278?

Instead of caching the text layout based on a bunch of properties, only cache the computation of the text's metrics, which 1) should be the most expensive part (everything else in _get_layout is relatively simple iteration and arithmetic) and 2) depends on fewer implicit parameters. In fact, the old cache key was insufficient in that it would conflate usetex and non-usetex strings together, even though they have different metrics; e.g. with the (extremely artificial) example ```python figtext(.1, .5, "foo\nbar", size=32) # (0) figtext(.1, .5, "foo\nbar", usetex=True, size=32, c="r", alpha=.5) # (1) figtext(.3, .5, "foo\nbar", usetex=True, size=32, c="r", alpha=.5) # (2) ``` the linespacing of the first usetex string (1) would be "wrong": it is bigger that the one of the second usetex string (2), because it instead reuses the layout computed for the non-usetex string (0). The motivation is also to in the future let the renderer have better control on cache invalidation (with a yet-to-be-added renderer method), e.g. multiple instances of the same renderer cache could share the same layout info.

anntzer added the topic: text label Jan 20, 2022

anntzer mentioned this pull request Jan 20, 2022

[Bug]: new constrained_layout causes axes to go invisible(?) #22264

Closed

tacaswell added this to the v3.6.0 milestone Jan 20, 2022

greglucas approved these changes Jan 21, 2022

View reviewed changes

greglucas mentioned this pull request Jan 21, 2022

Deprecate/remove maxdict #22278

Closed

anntzer force-pushed the tc branch from 922b7c5 to 412cabc Compare January 21, 2022 07:11

anntzer force-pushed the tc branch from 412cabc to a5efd24 Compare January 21, 2022 10:09

greglucas mentioned this pull request Jan 25, 2022

MNT: Deprecate cbook.maxdict #22299

Closed

6 tasks

timhoffm approved these changes Jan 25, 2022

View reviewed changes

timhoffm merged commit 9844e9f into matplotlib:main Jan 25, 2022

anntzer deleted the tc branch January 26, 2022 00:24

QuLogic mentioned this pull request Jul 5, 2022

Improve rendering when using multiple / large labels #21958

Draft

2 tasks

anntzer mentioned this pull request Jul 11, 2022

[Bug]: slow rendering of multiple axes (time scales as 2nd power of label count) #21895

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Rework/fix Text layout cache. #22271

Rework/fix Text layout cache. #22271

Uh oh!

anntzer commented Jan 20, 2022 •

edited

Loading

Uh oh!

tacaswell commented Jan 20, 2022

Uh oh!

anntzer commented Jan 20, 2022

Uh oh!

greglucas left a comment

Uh oh!

anntzer commented Jan 21, 2022

Uh oh!

dstansby commented Jan 21, 2022 •

edited

Loading

Uh oh!

anntzer commented Jan 21, 2022

Uh oh!

Uh oh!

Uh oh!

Rework/fix Text layout cache. #22271

Rework/fix Text layout cache. #22271

Uh oh!

Conversation

anntzer commented Jan 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

PR Checklist

Uh oh!

tacaswell commented Jan 20, 2022

Uh oh!

anntzer commented Jan 20, 2022

Uh oh!

greglucas left a comment

Choose a reason for hiding this comment

Uh oh!

anntzer commented Jan 21, 2022

Uh oh!

dstansby commented Jan 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anntzer commented Jan 21, 2022

Uh oh!

Uh oh!

anntzer commented Jan 20, 2022 •

edited

Loading

dstansby commented Jan 21, 2022 •

edited

Loading