FIX always expose best_loss_, validation_scores_, and best_validation_score #24683

glemaitre · 2022-10-17T16:49:55Z

The first step towards #24411

MLPClassifier and MLPRegressor are not consistent by not exposing some parameters related to early stopping. Therefore, they do not appear in the documentation.

This PR set them to None if not relevant and documents the values stored in those arrays.

In addition, validation_scores_ is changed from a Python list to a NumPy array that is more consistent with other estimators.

…_score

doc/whats_new/v1.2.rst

betatim · 2022-10-18T15:03:45Z

Only comment is about english.

The thing I can't judge is if the idea of only storing a value in the attributes when early stopping is used/not used makes sense. Given that the values aren't calculated, I assume this was thought about in previous PRs.

How are type changes (python list to numpy array) handled in terms of deprecation?

glemaitre · 2022-10-18T15:07:53Z

The thing I can't judge is if the idea of only storing a value in the attributes when early stopping is used/not used makes sense. Given that the values aren't calculated, I assume this was thought about in previous PRs.

One issue is that we cannot document it otherwise. It makes the API more consistent with HistGradientBoosting* early-stopping for instance.

How are type changes (python list to numpy array) handled in terms of deprecation?

I am not sure if we need to handle it. An array would behave as a Python list. The isinstance would be problematic but I don't know if a user would do that.

Co-authored-by: Tim Head <betatim@gmail.com>

haiatn · 2022-10-21T22:16:08Z

Does this need a Waiting for Second Reviewer tag?

thomasjpfan

Thank you for the PR!

thomasjpfan · 2022-10-25T21:27:57Z

doc/whats_new/v1.2.rst

+  `validation_scores_`, and `best_validation_score_`. `best_loss_` is set to
+  `None` when `early_stopping=True`, while `validation_scores_` and
+  `best_validation_score_` are set to `None` when `early_stopping=False`.
+  `validation_scores_` also changed from a Python list to a Numpy array.


I do not think we can make validation_scores_ a ndarray because it breaks warm_start:

from sklearn.neural_network import MLPRegressor from sklearn.datasets import make_regression X, y = make_regression(random_state=42) mlp = MLPRegressor(max_iter=10, random_state=0, warm_start=True, early_stopping=True) mlp.fit(X, y) mlp.set_params(max_iter=20) # Fails with this PR mlp.fit(X, y)

Good catch. I will revert this and add a non-regression test as well.

thomasjpfan

Minor comment on testing otherwise LGTM

thomasjpfan · 2022-10-26T14:08:34Z

sklearn/neural_network/tests/test_mlp.py

+    mlp.fit(X_iris, y_iris)
+    mlp.set_params(max_iter=20)
+    mlp.fit(X_iris, y_iris)


Can we check that length of validation_scores_ to make sure it is updated?

I think that there is an underlying bug: with max_iter=20, we end up with n_iter_ == 30 for the regressor. I need to investigate. But it could be fixed in another PR.

Isn't this because there were ten (original) + twenty (warm start) iterations?

Not sure yet. I have to go into details. I have an issue understanding the semantics there.

Having n_iter_ > max_iter is surprising. Calling fit just mean that you restart the full process from scratch but not a random init.

The above behaviour would be what I expect if I do a partial_fit because in this case, I am starting from a trained model and incrementally learning.

My expectation came from GradientBoostingClassifier's warm_start. It doesn't have a partial_fit(), so calling fit() is the only game in town.

So maybe at the very least there is room for improving the consistency?

We should just make it clear what to expect. But we can delay that in the issue that I open: #24764

jjerphan

LGTM. Thank you, @glemaitre.

I just have one minor duplicated remark.

sklearn/neural_network/_multilayer_perceptron.py

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

…_score (scikit-learn#24683) Co-authored-by: Tim Head <betatim@gmail.com> Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

FIX always expose best_loss_, validation_scores_, and best_validation…

bb36379

…_score

github-actions bot added the module:neural_network label Oct 17, 2022

glemaitre added 3 commits October 17, 2022 18:54

DOC add pr number

fb07cd4

FIX avoid hyperlink in docstring

ed90853

Merge branch 'main' into attribute_mlp

072935a

betatim reviewed Oct 18, 2022

View reviewed changes

doc/whats_new/v1.2.rst Outdated Show resolved Hide resolved

Update doc/whats_new/v1.2.rst

2980fb9

Co-authored-by: Tim Head <betatim@gmail.com>

betatim approved these changes Oct 18, 2022

View reviewed changes

thomasjpfan reviewed Oct 25, 2022

View reviewed changes

FIX make sure early stopping and warm start works together

48d1c4c

thomasjpfan approved these changes Oct 26, 2022

View reviewed changes

glemaitre added 2 commits October 26, 2022 17:03

add a check with size of validation_scores

36ecfa2

Merge branch 'main' into attribute_mlp

e801132

glemaitre added the To backport PR merged in master that need a backport to a release branch defined based on the milestone. label Dec 5, 2022

glemaitre added this to the 1.2 milestone Dec 5, 2022

jjerphan approved these changes Dec 5, 2022

View reviewed changes

sklearn/neural_network/_multilayer_perceptron.py Outdated Show resolved Hide resolved

sklearn/neural_network/_multilayer_perceptron.py Outdated Show resolved Hide resolved

Apply suggestions from code review

f958658

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

jjerphan enabled auto-merge (squash) December 6, 2022 09:03

jjerphan merged commit 1f9dc71 into scikit-learn:main Dec 6, 2022

ilya-causalens mentioned this pull request Feb 24, 2023

MLPRegressor.partial_fit produces an error when early_stopping is True #25693

Closed

lesteve mentioned this pull request Sep 29, 2023

Remove a redundant self-assignment in multilayer perceptron #27497

Merged

Uh oh!

FIX always expose best_loss_, validation_scores_, and best_validation_score #24683

FIX always expose best_loss_, validation_scores_, and best_validation_score #24683

Uh oh!

Conversation

glemaitre commented Oct 17, 2022

Uh oh!

Uh oh!

betatim commented Oct 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

glemaitre commented Oct 18, 2022

Uh oh!

haiatn commented Oct 21, 2022

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jjerphan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

betatim commented Oct 18, 2022 •

edited

Loading