Feature request: add absolute=True/False to regression metric mean_absolute_error #17853

raybellwaves · 2020-07-06T17:50:05Z

Apologies if this has been brought up before. I did have a quick check of https://github.com/scikit-learn/scikit-learn/issues?q=is%3Aissue+sort%3Aupdated-desc+mean+error+is%3Aclosed+label%3Amodule%3Ametrics

Describe the workflow you want to enable

I am interested in the sign of the error and if possible I would rather do it in scikit-learn than numpy.

mean_absolute_error(y_true, y_pred, absolute=False) to return np.average(y_pred - y_true)
mean_absolute_error(y_true, y_pred) returns same as before

Describe your proposed solution

Adding the absolute=False will update https://github.com/scikit-learn/scikit-learn/blob/fd237278e/sklearn/metrics/_regression.py#L122L190 to be

                           absolute=True, sample_weight=None,
                           multioutput='uniform_average'):
...
    Parameters
    ----------
    absolute: bool
        Return MAE or ME.
...
    if absolute:
        output_errors = np.average(np.abs(y_pred - y_true),
                                   weights=sample_weight, axis=0)
    else:
        output_errors = np.average(y_pred - y_true,
                                   weights=sample_weight, axis=0)

Describe alternatives you've considered, if relevant

Not sure if a new metric is needed i.e. mean_error.

I believe this approach of a new argument was applied to mean_squared_error (squared=True). See #12895

Additional context

If this is of interest I'll be happy to work on this during the scipy sprint.

The text was updated successfully, but these errors were encountered:

thomasjpfan · 2020-07-06T19:42:46Z

I would be -1 on this.

Adding an absolute=False to mean_absolute_error would be counter to the function's name.
I think this would be a poor metric because errors can cancel out:

y_true = np.array([1, -1, 1, -1])
y_pred = np.array([10000, -10000, 10000, -10000])

np.average(y_true - y_pred)
# 0.0

raybellwaves · 2020-07-06T20:24:24Z

True. I guess it's not really an error metric but more so its further analysis on the average distribution of the errors (bias is probably a better term for np.average(y_true - y_pred)?).

raybellwaves added the New Feature label Jul 6, 2020

raybellwaves closed this as completed Jul 6, 2020

raybellwaves mentioned this issue Jul 6, 2020

Feature Request: bias regression metric #17854

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Feature request: add absolute=True/False to regression metric mean_absolute_error #17853

Feature request: add absolute=True/False to regression metric mean_absolute_error #17853

raybellwaves commented Jul 6, 2020

thomasjpfan commented Jul 6, 2020

Uh oh!

raybellwaves commented Jul 6, 2020

Uh oh!

Uh oh!

Feature request: add absolute=True/False to regression metric mean_absolute_error #17853

Feature request: add absolute=True/False to regression metric mean_absolute_error #17853

Comments

raybellwaves commented Jul 6, 2020

Describe the workflow you want to enable

Describe your proposed solution

Describe alternatives you've considered, if relevant

Additional context

thomasjpfan commented Jul 6, 2020

Uh oh!

raybellwaves commented Jul 6, 2020

Uh oh!