TST add balance property check for linear models #22892

lorentzenchr · 2022-03-18T16:07:09Z

Reference Issues/PRs

none

What does this implement/fix? Explain your changes.

This adds a test for all suitable models in sklearn.linear_model that the balance property holds when fit with fit_intercept=True: sum(predicted) = sum(observed) on the training data.

Any other comments?

Placed in linear_model.test.test_common.py.
Failing tests are marked as xfail.

lorentzenchr · 2022-10-07T21:52:50Z

@rth @agramfort @TomDLT might be interested.

agramfort

regarding I don't know. @TomDLT should know better.

sklearn/linear_model/tests/test_common.py

lorentzenchr · 2022-10-09T10:18:08Z

I'm not sure where the best place for such a test is. Other models like trees and isotonic regression (should) also fulfil this property and pass such a test.

TomDLT

LGTM

Failing tests:

For SGDRegressor, it is expected that the algorithm does not fully converge to a high-precision solution. It is due to the stochastic nature of the SGD algorithm, I would just skip the test.
For SAG/SAGA, it is a bug, which is due to the fact that the random-sample selection does not take into account the sample weights. To fix the bug, we should implement an importance sampling scheme (see LogisticRegression with SAGA using sample_weight does not converge #21305).

jjerphan

LGTM. Thank you for adding this theoretical assertion, @lorentzenchr!

sklearn/linear_model/tests/test_common.py

TST add balance property check for linear models

b61bb36

github-actions bot added the module:linear_model label Mar 18, 2022

lorentzenchr added No Changelog Needed and removed module:linear_model labels Mar 18, 2022

lorentzenchr added 3 commits March 18, 2022 17:10

CLN remove layman's print statements

b75c89c

TST mark SGDRegressor as xfail

05140aa

CLN set normalize=False in OrthogonalMatchingPursuit

7410b8f

lorentzenchr added the module:linear_model label Mar 18, 2022

lorentzenchr mentioned this pull request Apr 20, 2022

Clarify and test dropping categories in linear models #23172

Open

2 tasks

lorentzenchr mentioned this pull request Aug 25, 2022

[MRG] Implement Centered Isotonic Regression #21454

Closed

lorentzenchr added 3 commits October 7, 2022 23:35

Merge branch 'main' into balance_property_linear_models

5518a0a

DOC improve comments

6912093

CLN

bbafb7f

DOC better wording of comment

34a1e72

agramfort reviewed Oct 9, 2022

View reviewed changes

sklearn/linear_model/tests/test_common.py Outdated Show resolved Hide resolved

CLN fix typos

5e14417

TomDLT approved these changes Oct 11, 2022

View reviewed changes

lorentzenchr added 4 commits October 25, 2022 14:15

CLN better xfail reason for SAGA

4cc8df9

Merge branch 'main' into balance_property_linear_models

69e97b9

TST loosen tolerance for SGDRegressor

1ad58fc

CLN remove normalize and increase max_iter

887b6da

cmarmo added the Waiting for Second Reviewer First reviewer is done, need a second one! label Oct 28, 2022

jjerphan approved these changes Nov 18, 2022

View reviewed changes

sklearn/linear_model/tests/test_common.py Outdated Show resolved Hide resolved

sklearn/linear_model/tests/test_common.py Show resolved Hide resolved

sklearn/linear_model/tests/test_common.py Show resolved Hide resolved

CLN address review comments

090688c

lorentzenchr removed the Waiting for Second Reviewer First reviewer is done, need a second one! label Nov 18, 2022

jjerphan merged commit aec3735 into scikit-learn:main Nov 18, 2022

lorentzenchr deleted the balance_property_linear_models branch November 18, 2022 16:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TST add balance property check for linear models #22892

TST add balance property check for linear models #22892

lorentzenchr commented Mar 18, 2022

lorentzenchr commented Oct 7, 2022

agramfort left a comment

lorentzenchr commented Oct 9, 2022

TomDLT left a comment

jjerphan left a comment

TST add balance property check for linear models #22892

TST add balance property check for linear models #22892

Conversation

lorentzenchr commented Mar 18, 2022

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

lorentzenchr commented Oct 7, 2022

agramfort left a comment

Choose a reason for hiding this comment

lorentzenchr commented Oct 9, 2022

TomDLT left a comment

Choose a reason for hiding this comment

jjerphan left a comment

Choose a reason for hiding this comment