FIX Extract estimator objects before aggregating dict of scores #17745

alfaro96 · 2020-06-26T13:57:35Z

Reference Issues/PRs

Coming from #17332.

What does this implement/fix? Explain your changes.

This PR extracts the estimator objects for each cross-validation split before executing the _aggregate_score_dicts in cross_validation function to avoid that nested estimators are converted to arrays.

Any other comments?

WDYT @thomasjpfan about this solution?

Another idea to solve this issue?

…build]

glemaitre · 2020-06-26T14:07:32Z

You are super-efficient. You open a PR before that I found that the CI was broken :)

glemaitre · 2020-06-26T14:16:33Z

We should probably have a test to trigger this usage. I am thinking that this is maybe the job of the _aggregate_* function to take care to not convert everything to an array (but I have to look more into details).

alfaro96 · 2020-06-26T14:18:47Z

You are super-efficient. You open a PR before that I found that the CI was broken :)

I was looking to the failures in the CRON jobs and I realize that the documentation build was failing. Since I love keep things properly working, I propose the PR :)

sklearn/model_selection/_validation.py

sklearn/model_selection/tests/test_validation.py

glemaitre

LGTM. Maybe @thomasjpfan wants to have a look

glemaitre · 2020-06-26T16:02:47Z

So maybe your solution is better then :)

thomasjpfan · 2020-06-26T17:26:32Z

sklearn/model_selection/_validation.py

+        if key.endswith(("time", "score"))
+        else [score[key] for score in scores]


I would prefer to dispatch based on the type:

return { key: np.asarray([score[key] for score in scores]) if isinstance(scores[0][key], numbers.Number) else [score[key] for score in scores] for key in scores[0] }

and then update the docstring:

Aggregate a list of dicts to a dict of ndarray or list.

WTYD @glemaitre ?

I made the change and merged so that PRs are not failing. We can have a followup PR if we want to go back to checking the key.

Hmm... the only recent PR that was blocked was #17743

thomasjpfan · 2020-06-27T01:16:10Z

Thank you @alfaro96 for being so quick with the fix!

alfaro96 · 2020-06-27T06:34:39Z

Thanks @thomasjpfan and @glemaitre for taking care and finalizing this PR!

…it-learn#17745) Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

alfaro96 added 2 commits June 26, 2020 15:53

FIX Extract estimator objects before aggregating dict of scores [doc …

2481c11

…build]

Minor changes [doc build]

9c0d38d

github-actions bot added the module:model_selection label Jun 26, 2020

glemaitre reviewed Jun 26, 2020

View reviewed changes

sklearn/model_selection/_validation.py Outdated Show resolved Hide resolved

CLN Apply suggested changes [doc build]

9a0c3ac

alfaro96 requested a review from glemaitre June 26, 2020 14:57

glemaitre reviewed Jun 26, 2020

View reviewed changes

sklearn/model_selection/tests/test_validation.py Show resolved Hide resolved

glemaitre approved these changes Jun 26, 2020

View reviewed changes

MNT Link PR to the test [doc build]

f7e5dfb

maikia mentioned this pull request Jun 26, 2020

MRG Deprecates 'normalize' in LinearRegression (_base.py) #17743

Merged

thomasjpfan reviewed Jun 26, 2020

View reviewed changes

thomasjpfan added 2 commits June 26, 2020 19:47

CLN Minor adjustments

615e619

BLD [doc build]

aee62b0

thomasjpfan approved these changes Jun 27, 2020

View reviewed changes

thomasjpfan merged commit 0e33229 into scikit-learn:master Jun 27, 2020

alfaro96 deleted the fix_named_steps branch June 28, 2020 08:47

glemaitre pushed a commit to glemaitre/scikit-learn that referenced this pull request Jul 17, 2020

FIX Extract estimator objects before aggregating dict of scores (scik…

59d6f74

…it-learn#17745) Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

jayzed82 pushed a commit to jayzed82/scikit-learn that referenced this pull request Oct 22, 2020

FIX Extract estimator objects before aggregating dict of scores (scik…

5b1b984

…it-learn#17745) Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX Extract estimator objects before aggregating dict of scores #17745

FIX Extract estimator objects before aggregating dict of scores #17745

alfaro96 commented Jun 26, 2020

glemaitre commented Jun 26, 2020

glemaitre commented Jun 26, 2020

alfaro96 commented Jun 26, 2020

glemaitre left a comment

glemaitre commented Jun 26, 2020

thomasjpfan Jun 26, 2020

thomasjpfan Jun 27, 2020

thomasjpfan Jun 27, 2020

thomasjpfan commented Jun 27, 2020

alfaro96 commented Jun 27, 2020 •

edited

Loading

		if key.endswith(("time", "score"))
		else [score[key] for score in scores]

FIX Extract estimator objects before aggregating dict of scores #17745

FIX Extract estimator objects before aggregating dict of scores #17745

Conversation

alfaro96 commented Jun 26, 2020

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

glemaitre commented Jun 26, 2020

glemaitre commented Jun 26, 2020

alfaro96 commented Jun 26, 2020

glemaitre left a comment

Choose a reason for hiding this comment

glemaitre commented Jun 26, 2020

thomasjpfan Jun 26, 2020

Choose a reason for hiding this comment

thomasjpfan Jun 27, 2020

Choose a reason for hiding this comment

thomasjpfan Jun 27, 2020

Choose a reason for hiding this comment

thomasjpfan commented Jun 27, 2020

alfaro96 commented Jun 27, 2020 • edited Loading

alfaro96 commented Jun 27, 2020 •

edited

Loading