[MRG] ENH Improves on the plotting api by storing weak ref to display in axes #15702

thomasjpfan · 2019-11-21T23:32:32Z

Reference Issues/PRs

Resolves #15581 (better)

What does this implement/fix? Explain your changes.

After a discussion with @tacaswell, it was concluded that adding attribute to the axes would be solution to figuring out

This allows use cases in #15581 to work:

fig, ax = plt.subplots(figsize=(10, 6))
tree_disp = plot_partial_dependence(tree, X, ["LSTAT"], ax=ax)
mlp_disp = plot_partial_dependence(mlp, X, ["LSTAT"], ax=ax,
                                   line_kw={"c": "red"})

Any other comments?

Storing the display object in the axes so it can be referenced later is a small hack, but it improves the API quite a bit.

CC @tacaswell @amueller @NicolasHug @glemaitre @qinhanmin2014

NicolasHug · 2019-11-21T23:36:56Z

Does this also allow to pass a ax that was drawn on before? Or will it be cleared like currently in master?

thomasjpfan · 2019-11-21T23:43:19Z

For this specific case, if the object was drawn on by a PartialDependenceDisplay object, it will not error as shown above.

If it was drawn on by a normal plot function, it will error. For example:

fig, ax = pyplot.subplots()
ax.plot([0, 1, 2], [1, 2, 3])
plot_partial_dependence(..., ax=ax)

NicolasHug · 2019-11-21T23:46:26Z

Is it reasonable to store the display object as an attribute of the ax?

The display objects stores all the data which can get pretty big. You would expect that the data can be GCed if the Display object isn't there anymore.

But with this PR, the data exists as long as ax exists.

NicolasHug · 2019-11-21T23:49:29Z

If we can, it would be ideal to also support that case

fig, ax = pyplot.subplots()
ax.plot([0, 1, 2], [1, 2, 3])
plot_partial_dependence(..., ax=ax)

This is typically useful if you want to plot the data points and then the PDPs on top, as done in #15582 (comment)

thomasjpfan · 2019-11-21T23:51:22Z

Yes I have to make sure the axes is a weak reference.

thomasjpfan · 2019-11-21T23:51:38Z

As in the axes has a weak reference to the display object.

NicolasHug

Some comments.

Sorry if I'm missing something but I don't see where the weak ref is being created? (this should also be tested)

NicolasHug · 2019-11-21T23:55:46Z

sklearn/utils/_plot.py

+    for used_attr in used_attrs:
+        if hasattr(ax, used_attr) and getattr(ax, used_attr):


is this equivalent to simply if any(getattr(...., None) for attr in used_attrs)?

NicolasHug · 2019-11-21T23:56:28Z

sklearn/utils/_plot.py

+    """Return true if the axes has been used"""
+    used_attrs = ['lines', 'patches', 'texts', 'tables', 'artists',
+                  'tables', 'images']
+    msg = "The ax was already used in another plot function"


a matplotlib plot function or one of scikit-learn's? (same for docstring)

NicolasHug · 2019-11-21T23:58:33Z

sklearn/inspection/_partial_dependence.py

+            if hasattr(ax, "_sklearn_display_object"):
+                if not isinstance(ax._sklearn_display_object, self.__class__):
+                    raise ValueError("The ax was already used by another "
+                                     "display object")


I think we should say "by another display object which is not an instance of self.class.name"

thomasjpfan · 2019-11-22T00:03:51Z

Sorry if I'm missing something but I don't see where the weak ref is being created? (this should also be tested)

I was just thinking about the weakref on the train. I am going to move this back to WIP for a little bit.

thank you for the early reviews.

thomasjpfan · 2019-11-22T16:56:49Z

If we can, it would be ideal to also support that case
fig, ax = pyplot.subplots()
ax.plot([0, 1, 2], [1, 2, 3])
plot_partial_dependence(..., ax=ax)
This is typically useful if you want to plot the data points and then the PDPs on top, as done in #15582 (comment)

Supporting this can run into an unclear API. For example:

fig, ax = pyplot.subplots()
ax.plot([0, 1, 2], [1, 2, 3])

plot_partial_dependence(est, features=[0, 1, 2], ax=ax)

For this case, I do not think a user would want the first call to plot to be used in all 3 partial dependence plots.

The only case where this can make sense is when there is only one feature:

fig, ax = pyplot.subplots()
ax.plot([0, 1, 2], [1, 2, 3])

plot_partial_dependence(est, features=[0], ax=ax)

I am -0.2 on this. It will lead to another code path to handle len(features)==1 and isinstance(ax, plt.Axes).

NicolasHug · 2019-12-01T18:25:02Z

Please ping when ready @thomasjpfan .

Also, I was looking at pandas internal: when you pass ax to df.plot() but they need to clear the figure, they raise a UserWarning. Maybe we should do that too to avoid surprises. To reproduce:

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

d = {'a': np.arange(10),
     'b': np.arange(10) + 23 * 4}

fig, ax = plt.subplots()

df = pd.DataFrame(d)
df.plot(subplots=True, ax=ax)  # ask for subplots but also for a single ax => warning
plt.show()

thomasjpfan · 2019-12-02T02:47:21Z

@NicolasHug This is ready for another review.

I slightly prefer the current behavior where we do not clear the axes and raise an error if the axes has been used. If we relax this restriction and automatically clear the axes with a warning, there will be code out there that depends on this behavior.

NicolasHug · 2019-12-02T13:48:28Z

please refresh my memory: if I pass an array-like as ax, with stuff already drawn-on, do we raise an error?

In retrospect I'm not even sure we want to raise errors/warnings.

thomasjpfan · 2019-12-02T20:22:25Z

If there as an array like of ax with stuff already drawn on, it will raise an error if the number of axes in the array like is not the expected number of axes. If it has the expected number of axes, then they will be drawn on directly.

#15760 provides a better solution to how plot_partial_dependence checks the axes.

…ndence_tom

thomasjpfan added 5 commits November 21, 2019 16:06

ENH Better axes api

42f81c4

REV Less diffs

c50c2b9

REV Less diffs

c230f35

TST Adds more tests

2ca60b5

STY Less lines

d202585

NicolasHug reviewed Nov 22, 2019

View reviewed changes

thomasjpfan changed the title ~~[MRG] ENH Improves on the plotting api when dealing with multiple plots~~ [WIP] ENH Improves on the plotting api when dealing with multiple plots Nov 22, 2019

CLN Adds another helper for weakrefs

06eb3cb

thomasjpfan changed the title ~~[WIP] ENH Improves on the plotting api when dealing with multiple plots~~ [MRG] ENH Improves on the plotting api when dealing with multiple plots Nov 22, 2019

TST Adds gc.collect for python 3.5

1c1ef1b

DOC Adds comment

943ce7b

thomasjpfan changed the title ~~[MRG] ENH Improves on the plotting api when dealing with multiple plots~~ [MRG] ENH Improves on the plotting api by storing weak ref to display in axes Dec 5, 2019

github-actions bot added module:inspection module:utils labels Mar 2, 2020

thomasjpfan mentioned this pull request Aug 13, 2020

ENH Add CalibrationDisplay plotting class #17443

Merged

thomasjpfan added 3 commits August 13, 2020 20:27

Merge remote-tracking branch 'upstream/master' into plot_partial_depe…

00e0ade

…ndence_tom

FIX Updates new files

2e0dd6a

Merge remote-tracking branch 'upstream/master' into plot_partial_depe…

d71d155

…ndence_tom

Base automatically changed from master to main January 22, 2021 10:51

		for used_attr in used_attrs:
		if hasattr(ax, used_attr) and getattr(ax, used_attr):

Uh oh!

[MRG] ENH Improves on the plotting api by storing weak ref to display in axes #15702

Are you sure you want to change the base?

[MRG] ENH Improves on the plotting api by storing weak ref to display in axes #15702

Uh oh!

Conversation

thomasjpfan commented Nov 21, 2019

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

NicolasHug commented Nov 21, 2019

Uh oh!

thomasjpfan commented Nov 21, 2019

Uh oh!

NicolasHug commented Nov 21, 2019

Uh oh!

NicolasHug commented Nov 21, 2019

Uh oh!

thomasjpfan commented Nov 21, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thomasjpfan commented Nov 21, 2019

Uh oh!

NicolasHug left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NicolasHug Nov 21, 2019

Choose a reason for hiding this comment

Uh oh!

NicolasHug Nov 21, 2019

Choose a reason for hiding this comment

Uh oh!

NicolasHug Nov 21, 2019

Choose a reason for hiding this comment

Uh oh!

thomasjpfan commented Nov 22, 2019

Uh oh!

thomasjpfan commented Nov 22, 2019

Uh oh!

NicolasHug commented Dec 1, 2019

Uh oh!

thomasjpfan commented Dec 2, 2019

Uh oh!

NicolasHug commented Dec 2, 2019

Uh oh!

thomasjpfan commented Dec 2, 2019

Uh oh!

Uh oh!

thomasjpfan commented Nov 21, 2019 •

edited

Loading

NicolasHug left a comment •

edited

Loading