NDCG score doesn't work with binary relevance and a list of 1 element #21335

cBournhonesque · 2021-10-14T23:46:40Z

See this code example:

>>> t = [[1]]
>>> p = [[0]]
>>> metrics.ndcg_score(t, p)
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/Users/cbournhonesque/.pyenv/versions/bento/lib/python3.8/site-packages/sklearn/utils/validation.py", line 63, in inner_f
    return f(*args, **kwargs)
  File "/Users/cbournhonesque/.pyenv/versions/bento/lib/python3.8/site-packages/sklearn/metrics/_ranking.py", line 1567, in ndcg_score
    _check_dcg_target_type(y_true)
  File "/Users/cbournhonesque/.pyenv/versions/bento/lib/python3.8/site-packages/sklearn/metrics/_ranking.py", line 1307, in _check_dcg_target_type
    raise ValueError(
ValueError: Only ('multilabel-indicator', 'continuous-multioutput', 'multiclass-multioutput') formats are supported. Got binary instead

It works correctly when the number of elements is bigger than 1: https://stackoverflow.com/questions/64303839/how-to-calculate-ndcg-with-binary-relevances-using-sklearn

The text was updated successfully, but these errors were encountered:

adrinjalali · 2021-10-15T10:03:15Z

It doesn't seem like a well-defined problem in the case of a single input to me. I'm not sure what you'd expect to get

cBournhonesque · 2021-10-15T16:39:14Z

I'm skipping the computation if there are 0 relevant documents (any(truths) is False), since the metric is undefined.
For a single input, where truth = [1], I would expect to get 1 if prediction is 1, or 0 if predictions is 0 (according to the ndcg definition)

adrinjalali · 2021-10-16T09:03:17Z

pinging @jeremiedbb and @jeromedockes who worked on the implementation.

jeromedockes · 2021-10-18T14:10:45Z

I would expect to get 1 if prediction is 1, or 0 if predictions is 0 (according to the ndcg definition)

which ndcg definition, could you point to a reference? (I ask because IIRC there is some variability in the definitions people use).

Normalized DCG is the ratio between the DCG obtained for the predicted and true rankings, and in my understanding when there is only one possible ranking (when there is only one candidate as in this example), both rankings are the same so this ratio should be 1. (this is the value we obtain if we disable this check).

however, ranking a list of length 1 is not meaningful, so if y_true has only one column it seems more likely that there was a mistake in the formatting/representation of the true gains, or that a user applied this ranking metric to a binary classification task. Therefore raising an error seems reasonable to me, but I guess the message could be improved (although it is hard to guess what was the mistake). showing a warning and returning 1.0 could also be an option

jeromedockes · 2021-10-18T14:13:34Z

note this is a duplicate of #20119 AFAICT

cBournhonesque · 2021-10-18T14:25:57Z

HI jerome, you are right, I made a mistake. I'm using the definition on wikipedia
It looks like the results would be 0.0 if the document isn't a relevant one (relevance=0), or 1.0 if it is (relevance > 0). So the returned value could be equal to y_true[0] > 0. ?
In any case, I think that just updating error messages but keeping the current behaviour could be fine too

jeromedockes · 2021-10-18T15:53:24Z

indeed when all documents are truly irrelevant and the ndcg is thus 0 / 0 (undefined) currently 0 is returned (as seen here).

but still I think measuring ndcg for a list of 1 document is not meaningful (regardless of the value of the relevance), so raising an error about the shape of y_true makes sense.

glemaitre · 2021-12-17T19:08:18Z

So we should improve the error message in this case.

georged4s · 2022-11-01T10:25:40Z

I am happy to work on this if it hasn’t been assigned yet

glemaitre · 2022-11-02T09:40:00Z

@georged4s I can see that #24482 has been open but it seems stalled. I think that you can claim the issue and propose a fix. You can also look at the review done in the older PR.

georged4s · 2022-11-02T12:24:26Z

Thanks @glemaitre for replying and for the heads up. Cool, I will look into this one.

kayuksel · 2023-01-21T16:00:41Z

I came here as I have suffered the same problem, it doesn't support binary targets.

Also, it would be great if it could be calculated simultaneously for a batch of users.

JanFidor · 2023-02-12T16:09:43Z

Hi, there doesn't seem to be a linked PR (excluding the stalled one), could I pick it up?

lene · 2023-02-23T18:40:08Z

Picking it up as part of the PyLadies "Contribute to scikit-learn" workshop

glemaitre mentioned this issue Dec 17, 2021

Getting error while calculating NDCG using sklearn #20119

Closed

glemaitre added the Enhancement label Dec 17, 2021

cmarmo added module:metrics good first issue Easy with clear instructions to resolve help wanted labels Sep 14, 2022

mae5357 mentioned this issue Sep 20, 2022

Metric.ndcg score #24482

Closed

cmarmo removed the help wanted label Sep 20, 2022

lene mentioned this issue Feb 23, 2023

FIX improve error message when computing NDCG with a single document #25672

Merged

glemaitre closed this as completed in #25672 Mar 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NDCG score doesn't work with binary relevance and a list of 1 element #21335

NDCG score doesn't work with binary relevance and a list of 1 element #21335

cBournhonesque commented Oct 14, 2021

adrinjalali commented Oct 15, 2021

cBournhonesque commented Oct 15, 2021 •

edited

Loading

adrinjalali commented Oct 16, 2021

jeromedockes commented Oct 18, 2021

jeromedockes commented Oct 18, 2021

cBournhonesque commented Oct 18, 2021 •

edited

Loading

jeromedockes commented Oct 18, 2021

glemaitre commented Dec 17, 2021

georged4s commented Nov 1, 2022

glemaitre commented Nov 2, 2022

georged4s commented Nov 2, 2022

kayuksel commented Jan 21, 2023

JanFidor commented Feb 12, 2023

lene commented Feb 23, 2023

NDCG score doesn't work with binary relevance and a list of 1 element #21335

NDCG score doesn't work with binary relevance and a list of 1 element #21335

Comments

cBournhonesque commented Oct 14, 2021

adrinjalali commented Oct 15, 2021

cBournhonesque commented Oct 15, 2021 • edited Loading

adrinjalali commented Oct 16, 2021

jeromedockes commented Oct 18, 2021

jeromedockes commented Oct 18, 2021

cBournhonesque commented Oct 18, 2021 • edited Loading

jeromedockes commented Oct 18, 2021

glemaitre commented Dec 17, 2021

georged4s commented Nov 1, 2022

glemaitre commented Nov 2, 2022

georged4s commented Nov 2, 2022

kayuksel commented Jan 21, 2023

JanFidor commented Feb 12, 2023

lene commented Feb 23, 2023

cBournhonesque commented Oct 15, 2021 •

edited

Loading

cBournhonesque commented Oct 18, 2021 •

edited

Loading