[MRG] Improve multi-metric scorer speed #10979

gamazeps · 2018-04-14T22:28:12Z

Reference Issues/PRs

Works on improving #10802

What does this implement/fix? Explain your changes.

Previously multi metric scoring called the predict method of an
estimator once for each scorer, this could lead to drastic increases in
costs.

This change avoids calling the scorers directly and instead allows to
call the scorer with the predicted results.

This is only done for _PredictScorer and _ProbaScorer generated with
make_scorer, this means that _ThresholdScorer and scorers not
generated with make_scorer do not benefit from this change

Any other comments?

This implements a solution proposed by @jimmywan and the demarch seemed ok to @jnothman another solution would have been to force the estimator to cache its predict, predict_proba and decision_function methods.

gamazeps · 2018-04-23T08:02:42Z

ping @jnothman ?

jnothman

I'd tweak this a bit. Firstly let's rename score_predict to score_predictions or score_precomputed. Secondly, let's add a staticmethod to these scorers called precompute_predictions.

In _multimetric_score we can then do something like:

precomputed = {}
for name, scorer in scorers.items():
    if hasattr(scorer, 'score_precomputed'):
        func = scorer.precompute_predictions
        if func not in precomputed:
            precomputed[func] = func(X_test)
        score = scorer.score_precomputed(precomputed[func], y_test)
    else:
        score = scorer(X_test, y_test)
    
    ...

Do you think this would be much harder to understand? It still can call prediction functions more times than memoising.

jnothman · 2018-04-23T08:19:10Z

sklearn/model_selection/_validation.py

+    def _is_predict(x):
+        return isinstance(x, _PredictScorer)
+
+    # We want to keep the memmap and score types in a single


I don't know what you mean by the memmap here.

In the loop at https://github.com/scikit-learn/scikit-learn/pull/10979/files#diff-60033c11a662f460e1567effd5faa6f0R625

The scores are checked for being scalars and are unwrapped if they are memmaped.

tmp_scores contains the scores before being processed this way (it avoids doing the checks for each type of scorers)

Ah, I now see what you mean. I think this makes sense to you as the one changing the code, but it would be hard for the reader to follow what you're referring to in this comment. It's better without the comment.

jnothman · 2018-04-23T08:40:19Z

And thanks for the ping

glemaitre · 2018-04-30T10:11:12Z

@gamebusterz
Usually we use the signature func(y_true, y_pred) if you can just exchange and I see that the score_predict do not follow it. Could you change that.

glemaitre · 2018-04-30T10:12:27Z

I'd tweak this a bit. Firstly let's rename score_predict to score_predictions or score_precomputed. Secondly, let's add a staticmethod to these scorers called precompute_predictions.

I agree with this statement of @jnothman. Changing the name will be more explicit.

Previously multi metric scoring called the `predict` method of an estimator once for each scorer, this could lead to drastic increases in costs. This change avoids calling the scorers directly and instead allows to call the scorer with the predicted results. This is only done for `_PredictScorer` and `_ProbaScorer` generated with `make_scorer`, this means that `_ThresholdScorer` and scorers not generated with `make_scorer` do not benefit from this change. Works on improving scikit-learn#10802

gamazeps · 2018-04-30T12:07:36Z

@glemaitre Just did the changes (I forgot to push them last week...).

Regarding the static method, I am not convinced this is the best way to go.
It feels to me as a pretty roundabout fix for the lack of caching in the predict function of estimators,
it seems healthier (for the codebase), to invest time in putting real caching instead of working around it.
Indeed I fear that such a patch would stay a long time in the codebase, even after caching is merged, and this could be a mine waiting to explode.

In order to be constructive, I propose to attack the caching right away, and volunteer to do it (it may however be a big piece so I expect it to take a while).

jnothman · 2018-04-30T12:49:46Z

I don't think we generally want caching in the estimators themselves...

gamazeps · 2018-04-30T13:15:32Z

@jnothman Hmm, I may have misunderstood you in the issue then ^^

If it's not too much of a hassle, why exactly wouldn't we want caching in the estimators (I'm not too familiar with the implementation of sklearn so I may miss something obvious) ?

jnothman · 2018-04-30T13:31:14Z

Because the dataset being predicted might be large, and we don't want to force the storage of that data.

gamazeps · 2018-04-30T13:59:25Z

@jnothman When you say it like that, there is indeed an obvious reason to avoid storage by default :)

Should I try the method proposed (above)[https://github.com//pull/10979#pullrequestreview-114281530] or is this good to go now that the changes in the names are applied ?

glemaitre · 2018-05-07T12:29:38Z

@gamazeps I find the proposed method by @jnothman quite understandable. I would go for that one.

glemaitre · 2018-05-25T12:20:42Z

@gamazeps Any news

gamazeps · 2018-05-25T12:53:47Z

Wooopsy...

I git caught up with administrative issues and did not have much time to dedicate to open source in the last weeks, I expect to have more time in two weeks. Does that work for you ?

Cheers
Felix

jnothman · 2019-01-17T07:12:34Z

@gamazeps are you able to work on this, or should we find someone else to complete it?

(Another alternative we may be considering would allow a single scorer function to compute multiple results, returning a dict.)

NicolasHug · 2019-04-05T15:06:30Z

(Another alternative we may be considering would allow a single scorer function to compute multiple results, returning a dict.)

Along these lines, WDYT about introducing make_multi_scorer and a MultiScorer class?

jnothman · 2019-04-06T11:37:38Z

Along these lines, WDYT about introducing make_multi_scorer and a MultiScorer class?

Not sure how this specifically helps. I'd like to understand better the API that cross validation would expect and how it would produce results, and then work out how to help the user express what they want... but the names don't tell me much.

marctorsoc · 2019-07-31T20:12:31Z

Do you need a hand to complete this? Happy to help @amueller @jnothman

jnothman · 2019-08-01T00:53:14Z

#14484 is the current candidate, @marctorrellas. Review from a user perspective is very welcome.

marctorsoc · 2019-08-01T09:18:17Z

maybe this can be closed then?

NicolasHug · 2019-08-01T11:48:21Z

We usually don't close PR unless an equivalent implementation has been merged elsewhere. When/if #14484 gets merged, we will close this one

amueller · 2019-09-11T19:47:36Z

fixed in #14593

[DOC] Fix _PredicScorer parameter description

20fcae2

gamazeps changed the title ~~Scorer~~ [WIP] Improve multi-metric scorer speed Apr 14, 2018

gamazeps force-pushed the scorer branch 3 times, most recently from 65bbb14 to 9459bf9 Compare April 15, 2018 19:39

gamazeps changed the title ~~[WIP] Improve multi-metric scorer speed~~ [MRG] Improve multi-metric scorer speed Apr 15, 2018

jnothman reviewed Apr 23, 2018

View reviewed changes

gamazeps force-pushed the scorer branch from 9459bf9 to d699066 Compare April 30, 2018 12:00

gamazeps force-pushed the scorer branch from d699066 to 4469067 Compare April 30, 2018 12:01

jnothman mentioned this pull request Oct 18, 2018

Multi-metric scoring is incredibly slow because it repeats predictions for every metric #10802

Closed

thomasjpfan mentioned this pull request Jul 5, 2019

[WIP] ENH Adds caching to multimetric scoring with a wrapper class #14261

Closed

thomasjpfan mentioned this pull request Jul 26, 2019

[MRG] Adds _MultimetricScorer for Optimized Scoring #14484

Closed

thomasjpfan mentioned this pull request Aug 7, 2019

[MRG] Adds _MultimetricScorer for Optimized Scoring #14593

Merged

amueller closed this Sep 11, 2019

Uh oh!

[MRG] Improve multi-metric scorer speed #10979

[MRG] Improve multi-metric scorer speed #10979

Uh oh!

Conversation

gamazeps commented Apr 14, 2018

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

gamazeps commented Apr 23, 2018

Uh oh!

jnothman left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jnothman Apr 23, 2018

Choose a reason for hiding this comment

Uh oh!

gamazeps Apr 23, 2018

Choose a reason for hiding this comment

Uh oh!

jnothman Apr 23, 2018

Choose a reason for hiding this comment

Uh oh!

jnothman commented Apr 23, 2018

Uh oh!

glemaitre commented Apr 30, 2018

Uh oh!

glemaitre commented Apr 30, 2018

Uh oh!

gamazeps commented Apr 30, 2018

Uh oh!

jnothman commented Apr 30, 2018 via email

Uh oh!

gamazeps commented Apr 30, 2018

Uh oh!

jnothman commented Apr 30, 2018

Uh oh!

gamazeps commented Apr 30, 2018

Uh oh!

glemaitre commented May 7, 2018

Uh oh!

glemaitre commented May 25, 2018

Uh oh!

gamazeps commented May 25, 2018

Uh oh!

jnothman commented Jan 17, 2019

Uh oh!

NicolasHug commented Apr 5, 2019

Uh oh!

jnothman commented Apr 6, 2019 via email

Uh oh!

marctorsoc commented Jul 31, 2019

Uh oh!

jnothman commented Aug 1, 2019

Uh oh!

marctorsoc commented Aug 1, 2019

Uh oh!

NicolasHug commented Aug 1, 2019

Uh oh!

amueller commented Sep 11, 2019

Uh oh!

Uh oh!

jnothman left a comment •

edited

Loading