fix tightbbox to account for markeredgewidth #16607

brunobeltran · 2020-02-29T00:38:57Z

PR Summary

This initial PR is a naive fix for issue #16606; it will be updated as I document what the expected behavior should be for each marker type.

In short, increases the padding added to Line2D.get_window_extent in order to account for markers with non-trivial edge widths.

I also do not know what kind of test would be appropriate to add here? Just code that makes a plot that's composed of one marker with known size and edge width and then check that get_tightbbox returns the right values? Is the goal one test per marker type?

PR Checklist

Has Pytest style unit tests
Code is Flake 8 compliant
New features are documented, with examples if plot related
Documentation is sphinx and numpydoc compliant
Added an entry to doc/users/next_whats_new/ if major new feature (follow instructions in README.rst there)
Documented in doc/api/api_changes.rst if API changed in a backward-incompatible way

timhoffm

Just code that makes a plot that's composed of one marker with known size and edge width and then check that get_tightbbox returns the right values?

Sounds good.

I don‘t know if it‘s necessary/worth to distinguish different markers.

timhoffm · 2020-02-29T09:12:26Z

lib/matplotlib/lines.py

@@ -617,7 +617,8 @@ def get_window_extent(self, renderer):
                                 ignore=True)
        # correct for marker size, if any
        if self._marker:
-            ms = (self._markersize / 72.0 * self.figure.dpi) * 0.5
+            extra_pts = self._markersize + self._markeredgewidth


Suggested change

extra_pts = self._markersize + self._markeredgewidth

extra_pts = self._markersize + self._markeredgewidth / 2

I think. The edge line is drawn centered on the marker outline. So only half of the width adds to the extent.

You're right, but this is already accounted for in the * 0.5 for both marker size and edge width in the next line. In the more general case, this multiplication is no longer needed, see newest commits.

brunobeltran · 2020-03-03T18:41:20Z

Some design decisions that I didn't know how to make:

I implemented a get_centered_bbox function. Do I implement get_extents, or get_bbox, or something else instead? There doesn't seem to be an agreed-upon convention in the codebase for how to extract the bbox information from an object (lines have get_window_extents, paths have get_extents, etc.)
the information to calculate the PathEndAngles for each glyph can theoretically be extracted from the path itself. I opted to manually tabulate them instead since the number of glyphs is small and they are geometrically straightforward. A more generalized approach that identifies the points on the path that "create" the bbox, then extracts their PathEndAngles from the path itself would be more general and theoretically resilient to future/custom glyph types, but would be quite a bit more work.
more in general, I'm not familiar enough with the mpl codebase as a whole to know if there's already a convention for aggragating data like I do using BoxSides(PathEndAngles(...))
On that note, as long as we don't do something like suggested above in (2), we will now be storing more than two per-glyph-type data things (and if e.g. Sizes of different markers are not perceptually uniform #15703 gets implemented we'll have a couple of more pieces of per-glyph-type information, the area and visual scaling constants, saved as well). This might argue for a simple refactor of the current markers dictionary into a list of GlyphInfo objects, containing (for each glyph) the
a) list of valid short names (e.g. ['', ' ', 'None', None])
b) long name (e.g. 'star')
c) PathEndAngles (or whatever they end up being called)
d) area of glyph, to scale with
e) (in the future) effective visual size scaling constant

I'm happy to do that simple refactor here, since I'm adding the information, but wanted to make sure there was a shared feeling that it is necessary in the first place, and that I wouldn't be wasting too much time doing it in a style that would immediately be rejected.

brunobeltran · 2020-03-03T19:11:50Z

Just code that makes a plot that's composed of one marker with known size and edge width and then check that get_tightbbox returns the right values?

I don‘t know if it‘s necessary/worth to distinguish different markers.

Now that it's more obvious that this gets messy on a per-glyph basis, should we include one test per glyph or just test a couple of the nasty ones? (say, 'p', '*', 'o', 'x', to get examples of glyphs with miter joins, round joins, filled and not)?

brunobeltran · 2020-03-04T16:42:33Z

Weird. The tests I just added pass in a Jupyter notebook but not with the Agg backend or on Travis?

Basically, the code

_draw_marker_outlined('*', markeredgewidth=20)

produces a 480x480px image with the box in the wrong place.

The calling the identical function in Jupyter...

test_marker._draw_marker_outlined('*', markeredgewidth=20)
plt.savefig('marker_bbox_star.png')

produces a correct figure, but 345x345px.

I don't think this is a problem in my own code, since the box is incorrect even for the trivial case of markeredgewidth=0, and doing the following in Jupyter produces a 480x480px image that is also correct

test_marker._draw_marker_outlined('*', markeredgewidth=20)
plt.savefig('marker_bbox_star.png', dpi=100)

brunobeltran · 2020-03-04T16:44:04Z

@timhoffm Ready for re-review and need help.

My new tests are passing in Jupyter, but not in the console (see above).
I ended up adding a new public API element MarkerStyle.get_centered_bbox, but don't know where or how to document it appropriately besides its docstring.

timhoffm · 2020-03-04T20:15:46Z

I think this is a dpi issue. Some calculation does not Takt them into account. The jupyter backend (%matplotlib inline) sets 72dpi by default, everything else uses 100dpi. What happens if you do plt.rcParams["figure.dpi"]?

Sorry for the formatting, I‘m on mobile.

I‘m quite busy right now. Please be patient with a full review. That may take a couple of days.

brunobeltran · 2020-03-04T20:30:41Z

Thanks for the quick reply Tim! Looks like it is a DPI thing, I'll follow that lead.

Of course I assume you're very busy, and I definitely wasn't meaning to rush you. I just wanted to ping so you knew it was safe to start looking without wasting even more of your time.

Thanks in advance for your help!

ImportanceOfBeingErnest · 2020-03-04T20:56:23Z

You removed the line ms = (self._markersize / 72.0 * self.figure.dpi) * 0.5 which accounts for dpi. So the box will be wrong for every dpi setting, except 72, where 1/72*dpi==1.

brunobeltran · 2020-03-04T21:51:57Z

Haha, didn't see the "member" when I replied to you earlier @ImportanceOfBeingErnest. Hopefully I didn't come off as presumptuous!

Mostly, thanks for the good eye! I had forgotten that the markers stuff is in points. Fixed the code to make that consistent. Tests should pass now. Had to push up new "baseline_images", since they were generated at different DPI than before for whatever reason, but now look as expected.

ImportanceOfBeingErnest · 2020-03-05T00:09:24Z

To have image comparisson look like the default style one can use

@image_comparison(..., style='mpl20')

...just in case that is still relevant. One could also try to use only a single image (image comparison is rather expensive, so if it can be reduced to a minimum, that would sure be nice)

In general I haven't quite understood how wrong adding half the edge width would be and if that really warrants adding this huge code block. In particular, I would have though adding half the edgewidth would rather overestimate the bbox - which would be totally acceptable in my eyes.
Maybe an example case either here, or in the original issue #16606 would be helpful in that respect.

jklymak · 2020-03-05T00:20:38Z

I agree with @ImportanceOfBeingErnest. This is for the rare case that a marker over-spills its axes, and most folks don't have markeredgewsidths that are much larger than a point or two. This is really cool that you sorted through all the geometry, but is it overkill compared to the actual problem?

anntzer · 2020-03-06T00:35:29Z

If that's the problem, I'm happy to write a slighly more general solution that computes these properties from the path of the marker. It is possible, just a little difficult in the general case as computing the extents of stroked bezier paths is famously complicated. However, I do know how to do it correctly, and since I use a lot of custom markers, it would greatly benefit me since my fix would then apply in that case as well.

But I don't think we need to cover the general case here, just polygons and circles. I guess polygons would be relatively simpler; for circles, somehow luckily, falling back to 0.5*mew would work :-)

If I do this, will it get mainlined?

Well that's the problem as always: any code that goes in needs to be maintained and is extremely annoying to get rid of, and maintainer time is finite. That's why we're not necessarily so keen on adding a few hundreds of line of code just to handle a relatively edge case :-) (even though it's quite impressive work, as mentioned just above :-))

Conveniently, the currently hardcoded figure properties mean we would have amazing test coverage of this more general solution with no extra work I suppose....

Not a huge fan of playing the coverage numbers game :-)

brunobeltran · 2020-03-10T09:49:23Z

@anntzer , I got a little swamped with paper deadlines, but started working on code for general solution today (written, just needs testing).

However, I ran into a bit of a design issue. Right now bezier.py imports path.py (but only in a couple of functions that feel to me like they should be in path.py anyway...). Obviously a general routine that calculates bbox for a stroked path belongs as a method of Path. But I need helper routines from bezier.py so....

Should I

put my code in bezier.py/markers.py (as in current commit, less elegant, but no API breaking)]
fix the underlying issue, move appropriate code from bezier.py into path.py, do it "right" (internal API break, but that code is really only used in one place internally...)

in order to minimize resistance to this already pretty contentious PR?

anntzer · 2020-03-11T09:55:12Z

I would be fine with moving stuff from bezier.py to path.py if that's necessary.
You may consider breaking stuff into multiple consecutive PRs to prevent review from going out of control... ;)

timhoffm · 2020-03-12T00:00:15Z

Before there‘s going more work into this: Are we positive, that we want to take on the exact solution with the added amount of code vs. an approximate solution? It would be a shame to detail it all out and later decide that the maintenance burden would be too high.

I haven‘t looked into this in detail, but the amount of code needed to special-case the exact solution scares me a little.

Ping @jklymak @anntzer @tacaswell @ImportanceOfBeingErnest I think we should have a champion for this. If nobody is stepping up for it, I fear the PR will have a hard time of getting merged.

anntzer · 2020-03-12T00:26:00Z

I think this will depend on the amount of code involved... (I can't guarantee a quick review, but will try to keep a look on this.)
I just realized there may be an easier solution: I think the "true" size of a marker likely always grows as a*markersize+b*markeredgewidth+c? in which case we could just get away with estimating a, b and c by rasterizing the marker at a few ms/mew values (the results can be cached per-marker), instead of doing the complicated geometry calculations.

brunobeltran · 2020-03-12T00:30:02Z

Yeah something like @anntzer's proposal was actually my plan after you guys asked about performance, just didn't want to dump a bunch of work into this if it wasn't going to get accepted, as the current implementation works just fine for my purposes.

The current recommendation is:

Fix Path/Bezier to depend on each other in a way that makes sense, clean up redundant code (separate PR).
Add iter_corners method to compute geometrical properties directly from the Path.
Cache constants on MarkerStyle creation (a/b/c, as above) that describe how marker Bbox scales with markersize/markeredgewidth.
Add MarkerStyle.get_extents(renderer) to get exact bbox without slowdown compared to current method.

brunobeltran · 2020-04-20T13:36:20Z

Superceded by #17119, a cleaner implementation using some of my recent additions to Path.

fix tightbbox to account for markeredgewidth

90dce86

timhoffm reviewed Feb 29, 2020

View reviewed changes

brunobeltran added 6 commits March 1, 2020 13:52

cleanup definition of "point" marker

5ed49fd

document unit_regular_polygon

9088cf4

untested version of new code to get marker bbox

fd6df9b

fix for marker bbox now works except for on miter

ef2fefd

fixed mis-ordered PathEndAngles for ticks

db033e4

flake8 for new markers code

7e41bf5

brunobeltran added 3 commits March 3, 2020 10:53

factor marker bbox code to be within MarkerStyles

f1014b5

bugfix, forgot self in MarkerStyle.get_centered_bbox

42cc5db

misc bugfixes after factoring get_centered_bbox

8fcb223

brunobeltran added 4 commits March 3, 2020 11:36

markers bbox code visually tested, now works

d95e4d8

flake8 for new markers bbox code

4592cda

fixed formula for miter marker bbox, bevel broke

c08826e

bugfix caret bbox calculation, incorrect angles

7f9db16

brunobeltran mentioned this pull request Mar 4, 2020

Sizes of different markers are not perceptually uniform #15703

Open

brunobeltran added 2 commits March 4, 2020 07:38

fixed star tip angle in marker bbox calculation

1805dc7

test marker bbox. failing here, pass in jupyter

a13598d

brunobeltran added 2 commits March 4, 2020 13:46

bugfix so markers bbox api stays in pts units

0f7300a

forgot to push new test references images up

d6f1571

brunobeltran added 5 commits March 6, 2020 14:58

cleanup variable name consistency

2bee048

use conversion not magic nums for line get_extents

66694a0

iter_curves: iterate over path more conveniently

c4a45de

helper functions for bezier curve zeros/tangents

c5bdd8d

CornerInfo in bezier.py, should be path.py

41f268e

brunobeltran mentioned this pull request Mar 10, 2020

Error in Agg backend's PNG renderer when markeredgewidth > markersize #16621

Closed

brunobeltran added 3 commits March 10, 2020 02:12

update marker bbox to work for arbitrary paths

637a7f2

bugfix, new marker bbox code now runs, untested

5ac9114

pyflake fixes for marker bbox code

ef36ec2

brunobeltran added 6 commits March 11, 2020 18:22

cleanup path/bezier to prevent import triangle

82e3c12

fix prev commit, make split_in_out method of Path

8585eca

generalized path bbox code tested on some markers

54e3a3e

path bbox now works for all markers but "pixel"

b390653

reorg'd path/bezier code now builds docs no errors

85a3050

cleanup docstrings of stroked path bbox code

79aa3b7

brunobeltran mentioned this pull request Mar 12, 2020

Various issues with miter limits #9830

Open

fixed sphinx warnings in path.py's docstrings

1ea37be

This was referenced Mar 17, 2020

ENH: Allow for non-normalized and transformed markers #16773

Closed

Bezier/Path API Cleanup: fix circular import issue #16812

Merged

This was referenced Apr 17, 2020

Fix clipping of markers in PDF backend. #17163

Merged

Stroked path width #17198

Draft

Line2d window extents #17199

Draft

brunobeltran closed this Apr 20, 2020

	extra_pts = self._markersize + self._markeredgewidth
	extra_pts = self._markersize + self._markeredgewidth / 2

Uh oh!

fix tightbbox to account for markeredgewidth #16607

fix tightbbox to account for markeredgewidth #16607

Uh oh!

Conversation

brunobeltran commented Feb 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

PR Checklist

Uh oh!

timhoffm left a comment

Choose a reason for hiding this comment

Uh oh!

timhoffm Feb 29, 2020

Choose a reason for hiding this comment

Uh oh!

brunobeltran Mar 3, 2020

Choose a reason for hiding this comment

Uh oh!

brunobeltran commented Mar 3, 2020

Uh oh!

brunobeltran commented Mar 3, 2020

Uh oh!

brunobeltran commented Mar 4, 2020

Uh oh!

brunobeltran commented Mar 4, 2020

Uh oh!

timhoffm commented Mar 4, 2020 • edited by QuLogic Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brunobeltran commented Mar 4, 2020

Uh oh!

ImportanceOfBeingErnest commented Mar 4, 2020

Uh oh!

brunobeltran commented Mar 4, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ImportanceOfBeingErnest commented Mar 5, 2020

Uh oh!

jklymak commented Mar 5, 2020

Uh oh!

anntzer commented Mar 6, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brunobeltran commented Mar 10, 2020

Uh oh!

anntzer commented Mar 11, 2020

Uh oh!

timhoffm commented Mar 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anntzer commented Mar 12, 2020

Uh oh!

brunobeltran commented Mar 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brunobeltran commented Apr 20, 2020

Uh oh!

Uh oh!

brunobeltran commented Feb 29, 2020 •

edited

Loading

timhoffm commented Mar 4, 2020 •

edited by QuLogic

Loading

brunobeltran commented Mar 4, 2020 •

edited

Loading

anntzer commented Mar 6, 2020 •

edited

Loading

timhoffm commented Mar 12, 2020 •

edited

Loading

brunobeltran commented Mar 12, 2020 •

edited

Loading