[MRG] Allow the `scorer` callable to return any object #7424

acu192 · 2016-09-14T19:10:26Z

This _score function is (indirectly) a helper only for the cross_val_score function. I posit that we should allow the scorer argument (a callable) to return any object for these reasons:

It is safe to do so. The objects returned by the scorer callable are only collected into an ndarray. Easy squeezy.
When a user-defined scorer is passed-in, this allows for very flexible behavior. E.g. I have a scorer which returns a tuple of several metrics I want to track. This change allows me to actually use my scorer! I've tested it and it works great!

Please consider. Thank you all for such a great library.

This `_score` function is (indirectly) a helper only for the `cross_val_score` function. I posit that we should allow the `scorer` argument (a callable) to return any object for these reasons: 1. It is safe to do so. The objects returned by the `scorer` callable are only collected into an ndarray. Easy squeezy. 2. When a user-defined `scorer` is passed-in, this allows for very flexible behavior. E.g. I have a scorer which returns a tuple of several metrics I want to track. This change allows me to actually use my scorer! I've tested it and it works great! Please consider. Thank you all for such a great library.

amueller · 2016-09-14T19:11:23Z

GridSearchCV and RandomizedSearchCV compare the scores to select the best estimator.

acu192 · 2016-09-14T20:02:59Z

@amueller Good point. I would argue that this change is still okay, because:

This change only affects user-defined scorer arguments.
The user might want their user-defined scorer to return non-numbers.Number objects even when using GridSearchCV or RandomizedSearchCV. Looking at BaseSearchCV._fit, as long as the returned objects support the *=, +=, /=, and < operators, it will all still work. Yay duck typing. (Granted, this scenario seems unlikely, but see my next point.)
If the user's scorer returns weird objects that don't work with GridSearchCV and RandomizedSearchCV, an error will still be thrown. E.g. In my tuple example, I get an error like:

TypeError: unsupported operand type(s) for +=: 'int' and 'tuple'

amueller · 2016-09-14T20:38:19Z

@acu192 I think this was put there particularly to throw a better error than "TypeError: unsupported operand types", probably because someone spend hours of debugging on it and then complained in the issue tracker ;)
We are actually working on scorers that return more than just a single float, and using multiple scorers, see #7388. The fix will be a bit more complicated than this, though. In particular, we want grid-search on more complex metrics.

jnothman · 2016-09-14T23:40:28Z

Back in the days before scorer objects (pre scikit-learn 0.13, iirc), this kind of thing worked. We've been trying to fix it ever since. We should have support for, say, tuples of arrays of floats in 0.19 if everything works out correctly. This is, unfortunately, the wrong fix.

jnothman closed this Sep 14, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] Allow the `scorer` callable to return any object #7424

[MRG] Allow the `scorer` callable to return any object #7424

acu192 commented Sep 14, 2016

amueller commented Sep 14, 2016

acu192 commented Sep 14, 2016

amueller commented Sep 14, 2016

jnothman commented Sep 14, 2016

[MRG] Allow the scorer callable to return any object #7424

[MRG] Allow the scorer callable to return any object #7424

Conversation

acu192 commented Sep 14, 2016

amueller commented Sep 14, 2016

acu192 commented Sep 14, 2016

amueller commented Sep 14, 2016

jnothman commented Sep 14, 2016

[MRG] Allow the `scorer` callable to return any object #7424

[MRG] Allow the `scorer` callable to return any object #7424