Add balanced_accuracy_score metrics #3506

arjoly · 2014-07-30T08:28:05Z

There have been some discussion about adding a balanced accuracy metrics (see this article for the definition) on the mailing list. This is a good opportunity for a first contribution.

This implies coding the function, checking correctness through tests and highlight your work with documentations. In order to be easily used by many, a balanced accuracy scorer is a good idea. As a bonus, it could also support sample_weight.

The text was updated successfully, but these errors were encountered:

lazywei · 2014-07-30T17:38:39Z

I'm trying to work on this. I have a little question, however.
I don't really understand the definition of TP, TN, FP, FN when the classification type is not binary.
I mean, what is "positive" when it is multiclass, or multilabel?

By the way, sensitive is by definition TP / (TP + FN). I'm wondering what if TP + FN = 0?
That is, when y_true = [0, 0, 0, 0]?

I'm not really familiar with the terms TP, FN etc. So please correct me if I make any mistake.

Thanks.

jnothman · 2014-07-30T22:29:46Z

Perhaps it should just support the binary case at first.

On 31 July 2014 03:38, Bert Chang notifications@github.com wrote:

I'm trying to work on this. I have a little question, however.
I don't really understand the definition of TP, TN, FP, FN when the
classification type is not binary.
I mean, what is "positive" when it is multiclass, or multilabel?

Thanks.

—
Reply to this email directly or view it on GitHub
#3506 (comment)
.

larsmans · 2014-07-31T20:16:22Z

TP, TN, FP and FN are only well-defined when one class is considered negative (which is quite common). Usually we pick the class that is smallest according to Python's standard ordering unless overridden by the user.

adam-m-mcelhinney · 2014-10-21T21:15:19Z

@lazywei , did you get a chance to start on this? If not, I may take a stab at it.

lazywei · 2014-10-22T06:58:09Z

@adam-m-mcelhinney
I have started a PR at #3511 , and I thought I've finished...
Anyway, I think you could check out my branch to do more base on it if you want. In that case, let me know when you have done, I'll close my PR.
Thanks

adam-m-mcelhinney · 2014-10-22T14:26:58Z

Got it. Thanks.

ogrisel · 2014-12-02T21:44:52Z

@ppuggioni you have worked on this during the sprint: can you please open a PR with a [WIP] marker in the title with the current state of your work?

ppuggioni · 2014-12-02T23:04:09Z

@ogrisel yes, but yesterday I realised that @lazywei had possibly already done the job. I will push mine and open a PR with [WIP], but I was thinking that it might be worth for me to compare mine to his to avoid duplications and double review process?

ogrisel · 2014-12-02T23:18:35Z

Indeed, please submit yours and cross reference the 2 PR to compare the results.

xuewei4d · 2015-02-25T22:19:50Z

Hi all,

May I take over this issue? It seems that the referenced two PRs have stalled.

I have done coding, documentation, and testing. I found the balanced accuracy score is equal to the average of positive label recall and negative label recall. I did not take the balance weight into account. Since in the wikipedia, it is fixed as 0.5. So my code is quite simple. May I open a [WIP]PR? @ogrisel @jnothman

I am new to scikit-learn development, but I have used the package for a long time. I am a PhD student studying in machine learning. I hope I can join in GSOC 2015.

Thanks,

adam-m-mcelhinney · 2015-02-25T22:29:37Z

I haven't touched it. I believe someone else had finished it however.

On Wed, Feb 25, 2015 at 4:20 PM, Wei Xue notifications@github.com wrote:

Hi all,

May I take over this issue? It seems that the referenced two PRs have
stalled.

I have done coding, documentation, and testing. I found the balanced
accuracy score is equal to the average of positive label recall and
negative label recall. I did not take the balance weight into account.
Since in the wikipedia, it is fixed as 0.5. So my code is quite simple. May
I open a [WIP]PR?

I am new to scikit-learn development, but I have used the package for a
long time. I am a PhD student studying in machine learning. I hope I can
join in GSOC 2015.

Thanks,

—
Reply to this email directly or view it on GitHub
#3506 (comment)
.

xuewei4d · 2015-02-25T23:09:02Z

hi, @adam-m-mcelhinney,

Searching keywords balanced accuracy shows only two relevant PR #3929 and #3511. They have no progress since last year. If some else had finished it, he/she would have cross reference this issue #3506.

adam-m-mcelhinney · 2015-02-26T16:56:35Z

Looks like its all yours then. Let me know if you want to collaborate on
anything.

On Wed, Feb 25, 2015 at 5:09 PM, Wei Xue notifications@github.com wrote:

hi, @adam-m-mcelhinney https://github.com/adam-m-mcelhinney,

Searching keywords balanced accuracy shows only two relevant PR #3929
#3929 and #3511
#3511. They have no
progress since last year. If some else had finished it, he/she would have
cross reference this issue #3506
#3506.

—
Reply to this email directly or view it on GitHub
#3506 (comment)
.

lesteve · 2017-10-18T08:05:18Z

Closed by #8066.

arjoly added New Feature labels Jul 30, 2014

lazywei mentioned this issue Jul 31, 2014

[WIP] balanced accuracy score #3511

Closed

ppuggioni mentioned this issue Nov 29, 2014

add crossvalidation to the overfitting / underfitting example #3904

Closed

ppuggioni mentioned this issue Dec 2, 2014

[WIP] add balanced_accuracy_score metric #3506 #3929

Closed

xuewei4d mentioned this issue Feb 27, 2015

[WIP] implement balanced_accuracy_score #4300

Closed

TTRh mentioned this issue Oct 24, 2015

[MRG] Add balanced accuracy score in metrics #5588

Closed

qinhanmin2014 mentioned this issue Oct 18, 2017

Several fixed issues/PRs that might be closed #9948

Closed

lesteve closed this as completed Oct 18, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add balanced_accuracy_score metrics #3506

Add balanced_accuracy_score metrics #3506

arjoly commented Jul 30, 2014

lazywei commented Jul 30, 2014

jnothman commented Jul 30, 2014

larsmans commented Jul 31, 2014

adam-m-mcelhinney commented Oct 21, 2014

lazywei commented Oct 22, 2014

adam-m-mcelhinney commented Oct 22, 2014

ogrisel commented Dec 2, 2014

ppuggioni commented Dec 2, 2014

ogrisel commented Dec 2, 2014

xuewei4d commented Feb 25, 2015

adam-m-mcelhinney commented Feb 25, 2015

xuewei4d commented Feb 25, 2015

adam-m-mcelhinney commented Feb 26, 2015

lesteve commented Oct 18, 2017

Add balanced_accuracy_score metrics #3506

Add balanced_accuracy_score metrics #3506

Comments

arjoly commented Jul 30, 2014

lazywei commented Jul 30, 2014

jnothman commented Jul 30, 2014

larsmans commented Jul 31, 2014

adam-m-mcelhinney commented Oct 21, 2014

lazywei commented Oct 22, 2014

adam-m-mcelhinney commented Oct 22, 2014

ogrisel commented Dec 2, 2014

ppuggioni commented Dec 2, 2014

ogrisel commented Dec 2, 2014

xuewei4d commented Feb 25, 2015

adam-m-mcelhinney commented Feb 25, 2015

xuewei4d commented Feb 25, 2015

adam-m-mcelhinney commented Feb 26, 2015

lesteve commented Oct 18, 2017