Skip to content

[WIP] Added DET curve to classification metrics (Rebased) #8724

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 10 commits into from

Conversation

jucor
Copy link
Contributor

@jucor jucor commented Apr 9, 2017

Reference Issue

Resumes #4980 started by @jkarnows , which was left almost entirely finished.

What does this implement/fix? Explain your changes.

The great PR #4980 was left somewhat unfinished, and due to changes to the main trunk could not be merged any more.

The present PR rebases on master, and offers to finish this DET curve that I need for my own work.

Any other comments?

First time I contribute to scikit-learn, I do not mean to steal @jkarnows work, but it'd be a waste not to build on his great contribution.

@jucor jucor changed the title Rebased det curve [WIP] Added DET curve to classification metrics (Rebased) Apr 9, 2017
Copy link
Contributor

@dmohns dmohns left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the following two things should be added to this PR:

  1. The naming of inputs and outputs should be a little consistent with the rest of the ranking module e.g. fpr where rates are returned or fps when counts are required. See roc_curve or _binary_clf_curve
  2. Some basic testing would be nice like for ROC and PR curves.

.. [1] `Wikipedia entry for Detection error tradeoff
<https://en.wikipedia.org/wiki/Detection_error_tradeoff>`_
.. [2] `The DET Curve in Assessment of Detection Task Performance
<http://www.itl.nist.gov/iad/mig/publications/storage_paper/det.pdf>`_
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This link doesn't work. It appears to be a problem on https://www.nist.gov/ site though.

.. [2] `The DET Curve in Assessment of Detection Task Performance
<http://www.itl.nist.gov/iad/mig/publications/storage_paper/det.pdf>`_
.. [3] `2008 NIST Speaker Recognition Evaluation Results
<http://www.itl.nist.gov/iad/mig/tests/sre/2008/official_results/>`_
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

see above

.. [3] `2008 NIST Speaker Recognition Evaluation Results
<http://www.itl.nist.gov/iad/mig/tests/sre/2008/official_results/>`_
.. [4] `DET-Curve Plotting software for use with MATLAB
<http://www.itl.nist.gov/iad/mig/tools/DETware_v2.1.targz.htm>`_
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

see above


Returns
-------
fps : array, shape = [n_thresholds]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If I am not mistaken the DET curve should return rates not counts. This looks like a copy/paste legacy which we should update accordingly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module:metrics Superseded PR has been replace by a newer PR
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants