FEA Callbacks base infrastructure + progress bars #27663

jeremiedbb · 2023-10-25T16:49:56Z

Extracted from #22000

This PR implements a smaller portion of #22000, with only the base infrastructure for callbacks and the implementation of a single callback (progress bars). It targets the callbacks branch and the goal is to have the full callback implementation done in several smaller PRs merged in this branch before we merge it in main.

This PR proposes significant changes compared to #22000 that should imo improve a lot its change of getting merged 😄
The main improvement is that it no longer requires writing on the disk, but instead relies on multiprocessing.Managers and queues. It simplifies the code a lot.

In #22000 I adapted some estimators to work with the callbacks which I did not include here to keep the PR as light as possible. You can however experiment the callbacks on estimators that I wrote for testing purpose:

from sklearn.callback import ProgressBar
from sklearn.callback.tests._utils import Estimator, MetaEstimator
est = Estimator()
meta_est = MetaEstimator(est, n_jobs=2)
meta_est._set_callbacks(ProgressBar())
meta_est.fit(None, None)

You add sleep calls in these testing estimators to simulate how it changes when the computations take longer.

The plan is to then have several PRs to implement the other callbacks, adapt a few estimators to work with the callbacks, add some documentation and examples, add more tests...

jeremiedbb · 2023-11-23T18:10:10Z

Hi @rth, I think this is getting ready for a second review :)
This PR introduces the base infrastructure for callbacks and a first callback for progress bars. Next steps are detailed in #27676

glemaitre

LGTM in my side. @rth please ignore all the lock file changes. this does not require any attention.

rth

Very sorry for the response time @jeremiedbb . Hopefully I fixed my emails filters to see mentions from scikit-learn .

I read it though once, and the main thing I don't understand is how do you compute levels dict. I would have expected the build_computation_tree to build the computation tree, but actually it takes levels as parameters which is already some form of that tree.

Say you have a ColumnTransformer with 2 pipelines inside first with 2 steps, the other one with 3 steps. How do you compute the computation tree in that case?

rth · 2023-12-18T17:15:51Z

sklearn/base.py

@@ -115,6 +116,10 @@ def _clone_parametrized(estimator, *, safe=True):

    params_set = new_object.get_params(deep=False)

+    # attach callbacks to the new estimator
+    if hasattr(estimator, "_callbacks"):


BTW, this might break something for https://github.com/microsoft/FLAML/blob/0415638dd1e1d3149fb17fb8760520af975d16f6/flaml/automl/model.py#L1587 which also adds this attribute to scikit-learn's base estimator in their library. But there is no reserved private namespaces, so probably they could adapt if it does.

This can also break quite easily if the callback object keeps references to attributes of the old estimator. Why aren't we creating a copy here?

BTW, this might break something for https://github.com/microsoft/FLAML/blob/0415638dd1e1d3149fb17fb8760520af975d16f6/flaml/automl/model.py#L1587 which also adds this attribute to scikit-learn's base estimator in their library

I can change the name to _sk_callbacks or any derivative of that.

Why aren't we creating a copy here?

The motivation is a use case like this

monitoring = Monitoring() lr = LogisitcRegression()._set_callbacks(monitoring) GridSearchCV(lr, param_grid).fit(X,y) monitoring.plot()

The monitoring callback will gather information across all copies of logistic regression made in the grid search. If we made a copy of the callback in clone, then we couldn't retrieve any information once the grid search is finished.

The object itself can disable copy by implementing __copy__ and __deep_copy__, and then they would be in the space of "we know what we're doing" and we don't need to worry about it.

I think the kind of state you're referring to here, is something which can be outside the callback object, like a file / a database / an external singleton object, and the callback method just writes into that storage, and at the end one can use that data to plot/investigate/etc.

I think the kind of state you're referring to here, is something which can be outside the callback object, like a file / a database / an external singleton object

good point and actually in my latest design of the callbacks I didn't rely on a shared state so you can ignore my previous comment :) And we can't get around copies anyway since the clones can happen in subprocesses (in a grid search for instance). I updated to clone the callbacks as any other param

rth · 2023-12-18T17:19:02Z

sklearn/base.py

+        """
+        if hasattr(sub_estimator, "_callbacks") and any(
+            callback.auto_propagate for callback in sub_estimator._callbacks
+        ):


What happens if you call this method twice? Wouldn't this be raised on the second run. If so it feels a bit fragile. What's the harm of not checking this?

It should never be called twice. The meta estimator only propagates its callbacks once to its sub-estimator(s).

This error is important to have an informative error message if a user tries something like

lr = LogisiticRegression()._set_callbacks(ProgressBar) GridSearchCV(lr, param_grid)

It would crash without telling the user what's the right way to do it.

rth · 2023-12-18T17:53:31Z

sklearn/callback/_computation_tree.py

+            - descr: str
+                A description of the level
+            - max_iter: int or None
+                The number of its children. None means it's a leaf.


Wait so max_iter has nothing to do with the number of iterations of an algorithm. Why is it called that way then.

Also if it's expected to have only a few keys, typing wise might have been better to make it a dataclass or something rather than a dict, but no strong opinion.

adrinjalali

A very shallow review.

adrinjalali · 2024-01-31T11:44:44Z

sklearn/base.py

@@ -115,6 +116,10 @@ def _clone_parametrized(estimator, *, safe=True):

    params_set = new_object.get_params(deep=False)

+    # attach callbacks to the new estimator
+    if hasattr(estimator, "_callbacks"):


This can also break quite easily if the callback object keeps references to attributes of the old estimator. Why aren't we creating a copy here?

sklearn/base.py

adrinjalali · 2024-01-31T16:58:14Z

sklearn/base.py

+                try:
+                    return fit_method(estimator, *args, **kwargs)
+                finally:
+                    estimator._eval_callbacks_on_fit_end()


this is becoming larger than just a validation wrapper. We can simplify debugging and magic by having a BaseEstimator.fit which calls self._fit(...) and does all the common stuff before and after. That seems a lot better to understand and debug.

The motivation when I introduced _fit_context was not only for validation but to have a generic context manager to handle everything we need to do before and after fit. That's why I gave it this generic name.

Although having a consistent framework where we'd have a BaseEstimator.fit and every child estimator implements _fit is appealing, I think it goes far beyond the scope of this PR and requires rewriting a lot of estimators.

Btw do you why BaseEstimator does not implement fit in the first place ?

Also, note that _fit_context also handles partial_fit, but I don't think we want BaseEstimator to implement partial_fit

BaseEstimator doesn't implement fit cause we don't generally have methods which raise NotImplementedError. They're simply not there. But now that we have all this work, we can certainly have it in BaseEstimator, and children only implement a __sklearn_fit__ kind of method instead.

I still think it's outside the scope of this PR. Using the existing context manager is just 1 line addition whereas implementing __sklearn_fit__ means countless PRs :)

sklearn/utils/__init__.py

rth · 2024-03-05T18:48:41Z

+1 to merge this. I personally find that it would be more valuable to mark it as experimental (it's private anyway so far) let the users use this for a version or two, aggregate their feedback and iterate in future details if needed.

Overall the API sounds reasonable to me. Rather than to approve a SLEP on this, only then to realize that users would have preferred something else, or that there are some weird edge cases for some estimator that need special handling.

jeremiedbb · 2024-03-05T22:20:20Z

Note that this PRis targeting the callbacks branch, not main so merging this is not a big commitment anyway 😄
And it would allow me to implement the rest that I did not include in this PR to keep it as small as possible.

I personally find that it would be more valuable to mark it as experimental

I agree and we already discussed that with @glemaitre. I plan to do that in a follow up PR.

jeremiedbb · 2024-03-05T22:20:41Z

I still need to figure out the issue with the CI though
EDIT: good now

Scot-Survivor · 2024-06-28T15:52:44Z

Any news on this being merged?

glemaitre · 2024-06-28T15:54:23Z

Any news on this being merged?

We currently working on the design and need an agreement among core developers.

ignaceHelsen · 2024-10-31T12:48:04Z

Any news on this? It would be amazing would this be added.

jeremiedbb added 30 commits December 16, 2021 20:08

callback API

272e75f

cln nmf and test reconstruction attributes

584bdf7

cln snapshot + test snapshot + uuid for computation tree

bb32ff3

cln

7a1825d

black

3e3b25f

lint

26dbb69

wip

eb7b824

Merge branch 'master' into callback-api

9b913fd

class

f78442e

more tests

34bab15

cln

596a58e

wip

4f9363c

Merge remote-tracking branch 'upstream/main' into callback-api

030f68b

wip

35c5284

wip

115e184

wip

bdb4990

Merge remote-tracking branch 'upstream/main' into callback-api

d1bb5eb

wip

7a43c30

Merge remote-tracking branch 'upstream/main' into callback-api

573fd5d

wip

a218068

update poor_score

f794694

Merge remote-tracking branch 'upstream/main' into pr/jeremiedbb/22000

ab74f19

wip

37e569b

wip

d7208fa

Merge remote-tracking branch 'upstream/main' into pr/jeremiedbb/22000

774ff69

cln

b8ac1a5

Merge remote-tracking branch 'upstream/main' into pr/jeremiedbb/22000

e544cc4

wip

b644430

wip

3ab3d7f

wip

39c04cc

Merge branch 'callbacks' into base

a3e2b35

glemaitre approved these changes Nov 24, 2023

View reviewed changes

jondo mentioned this pull request Nov 29, 2023

[WIP] Callback API continued #22000

Draft

4 tasks

rth reviewed Dec 18, 2023

View reviewed changes

adrinjalali reviewed Jan 31, 2024

View reviewed changes

jeremiedbb added 11 commits February 9, 2024 16:32

Merge branch 'callbacks' into base

e13516d

mixin for callback propagation

a0667c4

rename _skl_callbacks

2fdbda3

clone callbacks

aea9af7

some renaming and cleanup

44b615a

Merge branch 'callbacks' into base

fabe932

Merge branch 'callbacks' into base (continued)

07a6875

Merge branch 'callbacks' into base

02ecb2e

fix imports

6433ba3

Merge remote-tracking branch 'upstream/callbacks' into base

052f9d2

update lock files

268d5cf

jeremiedbb mentioned this pull request Mar 1, 2024

SLEP023: Callback API scikit-learn/enhancement_proposals#90

Open

jeremiedbb added 5 commits March 6, 2024 13:41

Merge remote-tracking branch 'upstream/callbacks' into base

2381645

debug ci

d392b63

iter

9177757

iter

436bcad

iter

5bf6608

jeremiedbb mentioned this pull request Apr 3, 2024

FEA Callbacks base infrastructure + progress bars (Alternative to #27663) #28760

Open

Uh oh!

FEA Callbacks base infrastructure + progress bars #27663

Are you sure you want to change the base?

FEA Callbacks base infrastructure + progress bars #27663

Uh oh!

Conversation

jeremiedbb commented Oct 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jeremiedbb commented Nov 23, 2023

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

rth left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rth Dec 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rth commented Mar 5, 2024

Uh oh!

jeremiedbb commented Mar 5, 2024

Uh oh!

jeremiedbb commented Mar 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Scot-Survivor commented Jun 28, 2024

Uh oh!

glemaitre commented Jun 28, 2024

Uh oh!

ignaceHelsen commented Oct 31, 2024

Uh oh!

Uh oh!

jeremiedbb commented Oct 25, 2023 •

edited

Loading

rth Dec 18, 2023 •

edited

Loading

jeremiedbb commented Mar 5, 2024 •

edited

Loading