Binary risk control - implementation batch 1 #735

Valentin-Laurent · 2025-07-29T15:07:33Z

Context

Implementation of binary classification risk control in its simplest form:

Mono-risk
Unidimensional lambda (using thresholding on predict_proba)

Content

This PR includes (lists incrementally updated):

Existing risk-control code:

Remove useless check on lambda=None in ltt_procedure
Remove useless p_values from ltt_procedure outputs
Add possibility to pass an array of n_obs to ltt_procedure and subsequent p-values calculations (needed for binary classification). Unit test this behaviour.
Fix parametrizing of existing test

New risk-control code:

Implement BinaryClassificationRisk class, instances, and related unit test
Implement BinaryClassificationController class

Copilot

Pull Request Overview

This PR implements binary classification risk control in its simplest form, focusing on mono-risk and unidimensional lambda using thresholding on predict_proba. The implementation includes cleanup of existing risk control code and introduction of new binary classification risk control components.

Key changes:

Remove unnecessary components from existing LTT procedure (lambda=None check, p_values output)
Add support for array-based n_obs in p-value calculations for binary classification scenarios
Implement BinaryClassificationRisk class with predefined instances for precision, recall, and accuracy
Introduce BinaryClassificationController for threshold-based binary classification risk control

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
mapie/control_risk/ltt.py	Remove p_values return, lambda=None check, and add array support for n_obs
mapie/control_risk/p_values.py	Add array support for n_obs parameter in Hoeffding-Bentkus p-value computation
mapie/risk_control.py	Add BinaryClassificationRisk class and predefined risk instances
mapie/risk_control_draft.py	Implement BinaryClassificationController with threshold-based risk control
mapie/tests/test_control_risk.py	Update tests for LTT changes and add n_obs array testing
mapie/tests/test_risk_control.py	Add comprehensive tests for BinaryClassificationRisk instances
mapie/init.py	Remove risk_control_draft from exports

Comments suppressed due to low confidence (2)

mapie/tests/test_risk_control.py:848

The test should verify the specific condition that leads to None result (effective_sample_size == 0) rather than using a general elif clause. Consider adding an explicit assertion that effective_sample_func returns 0 when result is None.

    elif result is None:

mapie/risk_control_draft.py:18

The entire BinaryClassificationController class is marked with pragma: no cover, indicating missing test coverage. This class implements core functionality and should have comprehensive unit tests.

class BinaryClassificationController:  # pragma: no cover

mapie/risk_control.py

mapie/risk_control_draft.py

- Use BinaryClassificationRisk to compute risk - Use warning instead of error when risk is not controled. Throw error when predicting - Remove useless check on lambda=None in ltt_procedure - Remove useless p_values from ltt_procedure outputs - Add possibility to pass an array of n_obs to ltt_procedure and subsequent p-values calculations (needed for binary classification)

- Fix bentkus_p_value calculation - Fix and move higher_is_better logic in the same place - Implement unit test for BinaryClassificationRiskControl - Fix parametrizing of existing test

… positive predictions)

Valentin-Laurent force-pushed the binary-risk-control-v1 branch from f883ca9 to 13462fc Compare July 30, 2025 09:03

Valentin-Laurent requested a review from Copilot July 30, 2025 13:40

Copilot AI reviewed Jul 30, 2025

View reviewed changes

mapie/risk_control.py Outdated Show resolved Hide resolved

mapie/risk_control.py Outdated Show resolved Hide resolved

mapie/risk_control_draft.py Show resolved Hide resolved

mapie/risk_control_draft.py Outdated Show resolved Hide resolved

Valentin-Laurent added 9 commits July 31, 2025 14:47

ENH: implement BinaryClassificationRisk and related instances

ad2b889

ENH: simplify BinaryClassificationRisk API

1e71603

ENH & MTN & FIX

f4be83c

- Fix bentkus_p_value calculation - Fix and move higher_is_better logic in the same place - Implement unit test for BinaryClassificationRiskControl - Fix parametrizing of existing test

TEST - hoeffdding_bentkus_p_value with n_obs as an array

67b0ba3

FIX - linting

0799a7f

ENH - Performance, warning and docstring improvements

5ab4606

FIX - Fix local typing issue, investigate CI typing issues

c7c30f5

FIX - Continue investigating CI typing issues

ed18964

Valentin-Laurent force-pushed the binary-risk-control-v1 branch from 8f2a127 to ed18964 Compare July 31, 2025 12:50

Valentin-Laurent force-pushed the binary-risk-control branch from 37c79dc to 4404ad1 Compare July 31, 2025 12:50

Valentin-Laurent added 4 commits July 31, 2025 15:25

MTN - remove relative import

d6dd8d1

ENH & TEST - Handle the case of undefined risk (ex: precision with no…

ec3e8d0

… positive predictions)

MTN - Revert formatting to avoid changes unrelated to current PR

3d6f46b

MTN - Clarify code

d08fbea

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Binary risk control - implementation batch 1 #735

Binary risk control - implementation batch 1 #735

Uh oh!

Valentin-Laurent commented Jul 29, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Binary risk control - implementation batch 1 #735

Are you sure you want to change the base?

Binary risk control - implementation batch 1 #735

Uh oh!

Conversation

Valentin-Laurent commented Jul 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Content

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Valentin-Laurent commented Jul 29, 2025 •

edited

Loading