[MRG+2] ENH: median_absolute_error consistent with other regression metrics #3764

FlorianWilhelm · 2014-10-13T18:44:16Z

As discussed in #3761, this PR should make metrics.median_absolute_error consistent to the other regression metrics like mean_absolute_error in case of multi-output.

larsmans · 2014-10-13T19:37:08Z

Ping @arjoly. The test failure is unrelated and will be fixed by #3765 (I hope).

arjoly · 2014-10-14T07:59:02Z

@FlorianWilhelm Thanks for handling this case.

It looks good. However, are you sure that is how it should be computed? I am not familiar with this metrics. Would it be better to have L1 space median? In doubt, I would be in favor of raising an error with multi-output data for now. Sorry if my previous comment weren't clear.

FlorianWilhelm · 2014-10-14T10:52:39Z

@arjoly I think in this case, there is no wrong and right, only consistency. It could also be argued that the way mean_absolute_error handles multi-output might be a bad idea depending on your use-case. What if for instance the first and the second column of y have a completely different scale? Averaging row-wise might then not be best idea.
This PR would do the same thing in case of median_absolute_error only in a robust way by using the median. I think this is a consistent solution and way better than raising an error.

arjoly · 2014-10-14T13:01:55Z

I am +1 when what happens with multi-output data will be stated clearly in model_evaluation.rst.

Thanks for polishing the small details!

FlorianWilhelm · 2014-10-14T13:43:01Z

@arjoly I added a few words about our approach in model_evaluation.rst. What do you think?

arjoly · 2014-10-14T13:47:43Z

I would be more precise in the median absolute error section. Something like

With multi-output data, the median absolute error is averaged over all outputs.

This current situation will change and improve whenever #3474 is merged.

coveralls · 2014-10-14T13:58:24Z

Coverage decreased (-0.0%) when pulling 7f2b1a4 on FlorianWilhelm:median_absolute_error into 031a3fc on scikit-learn:master.

FlorianWilhelm · 2014-10-14T14:03:14Z

@arjoly Added a line directly to the median absolute error section.

arjoly · 2014-10-14T14:03:57Z

doc/modules/model_evaluation.rst

+and :func:`r2_score`. Multioutput is handled with a two level approach. First,
+the chosen metric is applied to each row of the output matrix in order to
+generate a single value per sample. This creates a one-dimensional output
+vector. In a second step, the metric is applied to the output vector.


Reading this again. This might not be true for r2_score.

What do you suggest? Should I remove my sentences or should r2_score be changed?

+1 for removing the sentence.

arjoly · 2014-10-15T09:23:23Z

+1 from my side !
Thanks @FlorianWilhelm !

MechCoder · 2014-10-27T10:58:12Z

I am not familiar about the use case of this metric. Intuitively, does it not make sense to find the median across every output and then find the median across all such medians?

However in all other metrics, it has been done the same way, average across all outputs, and then average across all samples, instead of averaging the metrics across all outputs, so at least the user has an idea of what to expect.

So I am +1 for consistency. Thanks,

MechCoder · 2014-10-27T23:11:12Z

On second thoughts, I'm not sure taking median across the outputs, (i.e axis=1 ) is meaningful (my limited knowledge, ofc). Unless there is a particular use case (or some reference) that you have in mind, I think just raising an error might be better if y_true.ndim > 1

FlorianWilhelm · 2014-11-03T10:29:37Z

@MechCoder I think the behavior is as wrong or right as in the case of applying the mean in the other metrics but it would be consistent and it is stated in the documentation like that. Why not just merge this as a first step and in a second larger step we should reconsider the whole multi-output handling anyway. Maybe the user should be able to supply a cost functional to judge the quality of a multi-output.
I think the metrics should be defined only on scalar output, and in case of mulit-output there should be default cost functionals provided that transform a vector into a scalar. Right now these two things are mixed together which is not clean in my opinion.

jnothman · 2014-11-03T13:00:13Z

I agree that it is better not to support multi-output (here; don't worry
about the existing implementation) until someone who has expertise in that
area proposes an algorithm from the literature.

On 3 November 2014 21:29, Florian Wilhelm notifications@github.com wrote:

@MechCoder https://github.com/MechCoder I think the behavior is as
wrong or right as in the case of applying the mean in the other metrics but
it would be consistent and it is stated in the documentation like that. Why
not just merge this as a first step and in a second larger step we should
reconsider the whole multi-output handling anyway. Maybe the user should be
able to supply a cost functional to judge the quality of a multi-output.
I think the metrics should be defined only on scalar output, and in case
of mulit-output there should be default cost functionals provided that
transform a vector into a scalar. Right now these two things are mixed
together which is not clean in my opinion.

—
Reply to this email directly or view it on GitHub
#3764 (comment)
.

…an_absolute_error Conflicts: doc/modules/model_evaluation.rst

FlorianWilhelm · 2014-11-19T20:27:48Z

Sorry for letting you wait, was quite busy the last weeks. I removed the multioutput feature of median_absolute_error as requested.

MechCoder · 2014-11-20T13:06:39Z

Looks good. @arjoly please merge the next time, when you see it.

arjoly · 2014-11-20T15:12:02Z

merged as 167e96e

thanks @FlorianWilhelm !

ENH: median_absolute_error consistent with other regression metrics

301a129

FlorianWilhelm mentioned this pull request Oct 13, 2014

Median absolute error #3761

Merged

Merge branch 'master' into median_absolute_error

67c63a2

DOC: Explained multioutput in model_evalution

7f2b1a4

DOC: Better explanation of multi-output

1ff4bb0

arjoly reviewed Oct 14, 2014
View reviewed changes

DOC: Removed explanation about multioutput regression metric

32fbaee

arjoly changed the title ~~ENH: median_absolute_error consistent with other regression metrics~~ [MRG+1] ENH: median_absolute_error consistent with other regression metrics Oct 15, 2014

FlorianWilhelm mentioned this pull request Oct 27, 2014

[MRG+1] TheilSen robust linear regression #2949

Merged

MechCoder force-pushed the master branch from 6deaea0 to 3f49cee Compare November 3, 2014 12:36

FlorianWilhelm added 3 commits November 19, 2014 19:46

Merge branch 'master' into median_absolute_error

cf0938e

ENH: Removed multioutput of median_absolute_error

53a1991

Merge remote-tracking branch 'origin/median_absolute_error' into medi…

0116fd0

…an_absolute_error Conflicts: doc/modules/model_evaluation.rst

DOC: Removed wrong sentence

afaf728

MechCoder changed the title ~~[MRG+1] ENH: median_absolute_error consistent with other regression metrics~~ [MRG+2] ENH: median_absolute_error consistent with other regression metrics Nov 20, 2014

arjoly closed this Nov 20, 2014

arjoly reopened this Nov 20, 2014

MechCoder closed this Nov 20, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG+2] ENH: median_absolute_error consistent with other regression metrics #3764

[MRG+2] ENH: median_absolute_error consistent with other regression metrics #3764

FlorianWilhelm commented Oct 13, 2014

larsmans commented Oct 13, 2014

arjoly commented Oct 14, 2014

FlorianWilhelm commented Oct 14, 2014

arjoly commented Oct 14, 2014

FlorianWilhelm commented Oct 14, 2014

arjoly commented Oct 14, 2014

coveralls commented Oct 14, 2014

FlorianWilhelm commented Oct 14, 2014

arjoly Oct 14, 2014

FlorianWilhelm Oct 14, 2014

arjoly Oct 14, 2014

FlorianWilhelm Oct 15, 2014

arjoly commented Oct 15, 2014

MechCoder commented Oct 27, 2014

MechCoder commented Oct 27, 2014

FlorianWilhelm commented Nov 3, 2014

jnothman commented Nov 3, 2014

FlorianWilhelm commented Nov 19, 2014

MechCoder commented Nov 20, 2014

arjoly commented Nov 20, 2014

[MRG+2] ENH: median_absolute_error consistent with other regression metrics #3764

[MRG+2] ENH: median_absolute_error consistent with other regression metrics #3764

Conversation

FlorianWilhelm commented Oct 13, 2014

larsmans commented Oct 13, 2014

arjoly commented Oct 14, 2014

FlorianWilhelm commented Oct 14, 2014

arjoly commented Oct 14, 2014

FlorianWilhelm commented Oct 14, 2014

arjoly commented Oct 14, 2014

coveralls commented Oct 14, 2014

FlorianWilhelm commented Oct 14, 2014

arjoly Oct 14, 2014

Choose a reason for hiding this comment

FlorianWilhelm Oct 14, 2014

Choose a reason for hiding this comment

arjoly Oct 14, 2014

Choose a reason for hiding this comment

FlorianWilhelm Oct 15, 2014

Choose a reason for hiding this comment

arjoly commented Oct 15, 2014

MechCoder commented Oct 27, 2014

MechCoder commented Oct 27, 2014

FlorianWilhelm commented Nov 3, 2014

jnothman commented Nov 3, 2014

FlorianWilhelm commented Nov 19, 2014

MechCoder commented Nov 20, 2014

arjoly commented Nov 20, 2014