[MRG] Adds _MultimetricScorer for Optimized Scoring #14484

thomasjpfan · 2019-07-26T17:11:44Z

Reference Issues/PRs

Fixes #10802
Alternative to #10979

What does this implement/fix? Explain your changes.

This PR creates a _MultimetricScorer that subclasses dict which is used to reduce the number of calls to predict, predict_proba, and decision_function.
The public interface of objects and functions using scoring are unchanged.
The cache is only used when it is beneficial to use, as defined in _MultimetricScorer._use_cache.

Any other comments?

I do have plans to support custom callables that return dictionaries from the user. This was not included in this PR to narrow the scope of this PR to _MultimetricScorer.

jnothman · 2019-07-31T08:15:08Z

Thanks for this @thomasjpfan!

Is there any reason not to just generically support scoring: Callable[[Estimator, X, y, ...], Dict[Str, Numeric]] for multiple metrics? Your multimetric scorer has a couple of additional public attributes: the score_infos and returns_dict. Would it be sufficient to dynamically determine the return type and keys when the scorer is called?

If so, I think this becomes conceptually simpler for users (the existing list and dict specifications for scoring become shorthands for such a callable), and doesn't involve introducing a custom type.

thomasjpfan · 2019-07-31T18:46:29Z

@jnothman

Is there any reason not to just generically support scoring: Callable[[Estimator, X, y, ...], Dict[Str, Numeric]] for multiple metrics?

A user would require to build their callable to be smart about calling predict, predict_proba and decision_function. This does not allow a user to build effective multimetric scorers out of individual scikit-learn scorers.

Your multimetric scorer has a couple of additional public attributes: the score_infos and returns_dict.

This is internally used by _fit_and_score, cross_validate, and BaseSearchCV to inspect the scorer object. On master, this information is transferred by _check_multimetric_scoring through a dictionary and a bool. The dictionary gives the names of each scorer, and the bool is if it is multimetric. This PR extends _Scorer to wrap the information provided by _check_multimetric_scoring.

I have been considering making make_multimetric_scorer private and keeping the public interface exactly the same. Our private make_multimetric_scorer can do smart things with scorers made by make_scorer.

amueller · 2019-07-31T20:51:03Z

Just had a discussion with @thomasjpfan with some ideas of how to simplify this.

What I find strange about the current design is that our scorers are used just to provide meta-data and I think I would like to represent that meta-data more explicitly.

jnothman · 2019-07-31T23:53:25Z

A user would require to build their callable to be smart about calling

predict, predict_proba and decision_function. This does not allow a user to build effective multimetric scorers out of individual scikit-learn scorers. But it allows them to easily avoid repeated computation in something like precision_recall_fscore_support... Essentially to efficiently reuse a single confusion matrix computation.

thomasjpfan · 2019-08-01T02:19:03Z

But it allows them to easily avoid repeated computation in something like
precision_recall_fscore_support... Essentially to efficiently reuse a
single confusion matrix computation.

Good point. Allowing users to be able to pass a callable that returns a dictionary would be pretty useful.

Would it be sufficient to dynamically determine the return type and keys when the scorer is called?

We can get away with this if the following did not exist:

scikit-learn/sklearn/model_selection/_validation.py

Lines 511 to 516 in e23f58d

    
           if is_multimetric: 
        
               test_scores = dict(zip(scorer.keys(), 
        
                                  [error_score, ] * n_scorers)) 
        
               if return_train_score: 
        
                   train_scores = dict(zip(scorer.keys(), 
        
                                       [error_score, ] * n_scorers))

With all this feedback, this PR will be in flux, thus WIP.

NicolasHug · 2019-08-01T12:43:41Z

Please ping me when you need reviews!

…c_no_dict

thomasjpfan · 2019-08-07T01:17:06Z

@NicolasHug

This is ready for a review.

This PR creates a _MultimetricScorer that subclasses dict which is used to reduce the number of calls to predict, predict_proba, and decision_function.
The public interface of objects and functions using scoring are unchanged.
The cache is only used when it is beneficial to use, as defined in _MultimetricScorer._use_cache.

I do have plans to support custom callables that return dictionaries from the user. This was not included in this PR to narrow the scope of this PR to _MultimetricScorer.

NicolasHug · 2019-08-07T14:10:18Z

@thomasjpfan do you need a thorough review or just feedback on the API? For the former I could use a user guide, some examples and some comments / docstrings ;)

thomasjpfan · 2019-08-07T15:17:23Z

For the former I could use a user guide, some examples and some comments / docstrings ;)

There are no public api changes. I added docstrings for the private _MultimetricScorer. This PR moves the multimetric scoring logic into metrics/scorer.py, which is a slightly more natural then doing the logic in model_selection/_validation.py.

thomasjpfan · 2019-08-07T20:35:56Z

Closed and reappeared at #14593

thomasjpfan changed the title ~~[MRG] Adds Multimetric Scorer~~ [MRG] Extends Scorer for Multimetric Scoring Jul 26, 2019

amueller mentioned this pull request Jul 31, 2019

Multi-metric scoring is incredibly slow because it repeats predictions for every metric #10802

Closed

thomasjpfan changed the title ~~[MRG] Extends Scorer for Multimetric Scoring~~ [WIP] Extends Scorer for Multimetric Scoring Jul 31, 2019

jnothman mentioned this pull request Aug 1, 2019

[MRG] Improve multi-metric scorer speed #10979

Closed

thomasjpfan added 20 commits August 5, 2019 07:55

ENH Adds multimetric scorer

6c190c5

WIP

608ab48

ENH Sublcass dict

64941c1

STY Flake8

ddc6adb

Merge remote-tracking branch 'upstream/master' into scorer_multimetric

2af99c3

REV Less diffs

b5a2521

REV Less diffs

1ddd4c3

REV Less diffs

e62f27a

WIP

960b95b

WIP Failing

aee5032

WIP validation failing

792d0c4

CLN _fit_and_score now returns dicts

da384f2

CLN Calls scroers

d819b7a

REV Returns dicts again

74bb101

Merge remote-tracking branch 'upstream/master' into scorer_multimetric

c7d9781

STY flake8

b7269c6

Merge remote-tracking branch 'upstream/master' into scorer_multimetric

27e9bc8

CLN Less diffs

de4e953

Merge remote-tracking branch 'upstream/master' into scorer_multimetri…

748128d

…c_no_dict

ENH Uses dict to become a scorer

2d685bf

ENH Adds smarter caching

114e31d

thomasjpfan force-pushed the scorer_refactor_rb branch from e0734e4 to 114e31d Compare August 7, 2019 01:05

thomasjpfan changed the title ~~[WIP] Extends Scorer for Multimetric Scoring~~ [MRG] Adds _MultimetricScorer for Optimized Scoring Aug 7, 2019

DOC Adds dev docstrings

5bd576b

thomasjpfan closed this Aug 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG] Adds _MultimetricScorer for Optimized Scoring #14484

[MRG] Adds _MultimetricScorer for Optimized Scoring #14484

Uh oh!

thomasjpfan commented Jul 26, 2019 •

edited

Loading

Uh oh!

jnothman commented Jul 31, 2019

Uh oh!

thomasjpfan commented Jul 31, 2019

Uh oh!

amueller commented Jul 31, 2019

Uh oh!

jnothman commented Jul 31, 2019 via email

Uh oh!

thomasjpfan commented Aug 1, 2019

Uh oh!

NicolasHug commented Aug 1, 2019

Uh oh!

thomasjpfan commented Aug 7, 2019

Uh oh!

NicolasHug commented Aug 7, 2019

Uh oh!

thomasjpfan commented Aug 7, 2019 •

edited

Loading

Uh oh!

thomasjpfan commented Aug 7, 2019

Uh oh!

Uh oh!

Uh oh!

[MRG] Adds _MultimetricScorer for Optimized Scoring #14484

[MRG] Adds _MultimetricScorer for Optimized Scoring #14484

Uh oh!

Conversation

thomasjpfan commented Jul 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

jnothman commented Jul 31, 2019

Uh oh!

thomasjpfan commented Jul 31, 2019

Uh oh!

amueller commented Jul 31, 2019

Uh oh!

jnothman commented Jul 31, 2019 via email

Uh oh!

thomasjpfan commented Aug 1, 2019

Uh oh!

NicolasHug commented Aug 1, 2019

Uh oh!

thomasjpfan commented Aug 7, 2019

Uh oh!

NicolasHug commented Aug 7, 2019

Uh oh!

thomasjpfan commented Aug 7, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thomasjpfan commented Aug 7, 2019

Uh oh!

Uh oh!

thomasjpfan commented Jul 26, 2019 •

edited

Loading

thomasjpfan commented Aug 7, 2019 •

edited

Loading