More intuitive scoring argument for loss and error #5023

arjoly · 2015-07-23T16:17:44Z

Using the grid search meta-estimator with the "mean_square_error", the "mean_absolute_error", the "median_absolute_error" or the "log_loss" as scoring parameters leads to the negation of those metrics. This is confusing especially for new users.

I suggest that we prefix those strings by "neg-" or "negative_". This would make clear from the start that the score is obtained from the negation of the loss / error.

The text was updated successfully, but these errors were encountered:

kastnerkyle · 2015-07-23T17:27:31Z

1000x agreed

On Thu, Jul 23, 2015 at 12:17 PM, Arnaud Joly notifications@github.com
wrote:

Using grid search with the "mean_square_error", the "mean_absolute_error",
the "median_absolute_error" or the "log_loss"` as scoring parameter lead
to the negation of those metrics.
This is confusing especially for new users.

I suggest that we prefix those strings by "neg-" or "negative_". This
would make clear from the start that the score is obtained from the
negation of the loss / error.

—
Reply to this email directly or view it on GitHub
#5023.

mblondel · 2015-07-27T02:54:37Z

See also #2439.

There is also the solution of flipping the sign back but more people seem to favor the neg- prefix solution.

GaelVaroquaux · 2015-07-27T05:26:55Z

+1

amueller · 2015-07-30T20:55:35Z

I'd prefer to flip the sign back (which needs an additional line in GridSearchCV and friends).
As we can't seem to be able to get a consensus for that, I guess we have to go with some prefix like that.

jnothman · 2015-08-03T01:39:06Z

Prefix could just be - of course...

arjoly · 2015-09-09T08:26:08Z

Recently, I met one more people that was bitten by this. This person was very frustrated after loosing hours to understand what was happening.

mblondel · 2015-09-09T08:39:21Z

We could quickly implement both strategies (prefix and flipping sign back) in a branch and try them on an actual use case. This will probably help make our mind.

amueller · 2015-09-09T15:09:58Z

well we had flipping the sign back in master. The main criticism was that it requires one line of code in GridSearchCV iirc.
I'm not sure implementing the prefix will help us decide, because it is more about user perception. The implementation is pretty straight-forward, right?
The question is just if users will be confused that we don't have mse, but instead negative_mse / -mse

mblondel · 2015-09-09T16:59:41Z

We should A/B test our API :)

well we had flipping the sign back in master

I didn't know, when?

amueller · 2015-09-09T18:05:41Z

When we introduced the scoring. I thought you removed it in the refactoring.

mblondel · 2015-09-09T18:15:02Z

Strange, I'm my self +1 for flipping the sign back.

amueller · 2015-09-09T18:19:46Z

My bad, it was @larsmans 393b996#diff-b1caa89fb43c7835879f68a3ec494560

larsmans · 2015-09-09T18:29:11Z

Yeah, that was me... +1 on the neg_ prefix, I think I suggested that somewhere, too.

amueller · 2015-09-10T00:36:54Z

https://issues.apache.org/jira/browse/SPARK-10097 ^^

jnothman · 2015-09-10T00:39:30Z

https://issues.apache.org/jira/browse/SPARK-10097 ^^

They managed to close the issue in under 24 hours. Lol.

amueller · 2015-09-10T00:40:43Z

@larsmans why don't you like the other approach?

amueller · 2015-09-10T00:41:56Z

@jnothman They seem to be slightly more efficient then us lol. But I think they also have a couple of people full time on it.

larsmans · 2015-09-10T09:45:02Z

Because it requires adding attributes on functions, which at the time I deemed to hacky and complicated for custom metrics. However, I'm willing to drop that concern. We need to solve the issue one way or another.

amueller · 2015-10-21T08:12:01Z

Maybe we can discuss this at the sprint. This is really bad: https://github.com/scikit-learn/scikit-learn/pull/5498/files#diff-ff4438be9ee77932848f5abfcec060efR690

ogrisel · 2015-10-21T08:26:49Z

Initially I was in favor of having an explicit flag to state whether higher is better. I think I still prefer this solution over the neg- prefix solution.

amueller · 2015-10-23T13:24:13Z

We just had a discussion and agreed on the neg- fix

lvrzhn · 2015-11-16T19:40:56Z

Working on this now.

OmidSaremi · 2015-11-16T19:41:43Z

Working on it with @lvrzhn

amueller · 2015-12-11T00:24:10Z

@lvrzhn do you have a pull request?

davidthaler · 2015-12-11T09:44:33Z

Is anybody on this? If not, I'd like to take a shot at it. Also, is this supposed to be fixed at 0.17x or 0.18? This part of the code changed a lot between those milestones.

lvrzhn · 2015-12-11T19:17:57Z

David, I'm on it. Will do pull request today.

On Fri, Dec 11, 2015, 01:45 David Thaler notifications@github.com wrote:

Is anybody on this? If not, I'd like to take a shot at it. Also, is this
supposed to be fixed at 0.17x or 0.18? This part of the code changed a lot
between those milestones.

—
Reply to this email directly or view it on GitHub
#5023 (comment)
.

raghavrv · 2016-04-30T14:33:17Z

@davidthaler @lvrzhn Could I take up this issue?

jnothman · 2016-04-30T14:49:20Z

it looks like the PR at #6028 had been accidentally closed. I've reopened it on the basis that there was no indication it should be closed. @lvrzhn are you still intending to fix it up?

raghavrv · 2016-04-30T14:53:31Z

Ah! Sorry I missed that PR!

jykong · 2016-06-10T21:31:11Z

+1 from a confused new user

amueller · 2016-09-12T14:13:35Z

fixed in #7261.

arjoly added the API label Jul 23, 2015

amueller mentioned this issue Oct 21, 2015

[WIP] Storing the best attributes of (non-GridSearch) CV models #5498

Closed

amueller mentioned this issue Dec 14, 2015

Deprecated negative valued scorers #6028

Closed

davidthaler mentioned this issue Jan 7, 2016

mean_absolute_error gives out negative result #6129

Closed

raghavrv mentioned this issue Sep 12, 2016

[MRG + 2] Rename scorers like mse to neg_mse #7261

Merged

4 tasks

amueller closed this as completed Sep 12, 2016

Uh oh!

More intuitive scoring argument for loss and error #5023

More intuitive scoring argument for loss and error #5023

Comments

arjoly commented Jul 23, 2015

kastnerkyle commented Jul 23, 2015

Uh oh!

mblondel commented Jul 27, 2015

Uh oh!

GaelVaroquaux commented Jul 27, 2015 via email

Uh oh!

amueller commented Jul 30, 2015

Uh oh!

jnothman commented Aug 3, 2015

Uh oh!

arjoly commented Sep 9, 2015

Uh oh!

mblondel commented Sep 9, 2015

Uh oh!

amueller commented Sep 9, 2015

Uh oh!

mblondel commented Sep 9, 2015

Uh oh!

amueller commented Sep 9, 2015

Uh oh!

mblondel commented Sep 9, 2015

Uh oh!

amueller commented Sep 9, 2015

Uh oh!

larsmans commented Sep 9, 2015

Uh oh!

amueller commented Sep 10, 2015

Uh oh!

jnothman commented Sep 10, 2015

Uh oh!

amueller commented Sep 10, 2015

Uh oh!

amueller commented Sep 10, 2015

Uh oh!

larsmans commented Sep 10, 2015

Uh oh!

amueller commented Oct 21, 2015

Uh oh!

ogrisel commented Oct 21, 2015

Uh oh!

amueller commented Oct 23, 2015

Uh oh!

lvrzhn commented Nov 16, 2015

Uh oh!

OmidSaremi commented Nov 16, 2015

Uh oh!

amueller commented Dec 11, 2015

Uh oh!

davidthaler commented Dec 11, 2015

Uh oh!

lvrzhn commented Dec 11, 2015

Uh oh!

raghavrv commented Apr 30, 2016

Uh oh!

jnothman commented Apr 30, 2016

Uh oh!

raghavrv commented Apr 30, 2016

Uh oh!

jykong commented Jun 10, 2016

Uh oh!

amueller commented Sep 12, 2016

Uh oh!