TST use global_random_seed in sklearn/linear_model/tests/test_base.py #23465

svenstehle · 2022-05-25T18:28:34Z

Reference Issues/PRs

Partially addresses #22827

Task List

What does this implement/fix? Explain your changes.

Added global_random_seed to all seeded tests in sklearn/linear_model/tests/test_base.py
The only test that failed on a few seeds was test_linear_regression_sample_weights
- Fixed by removing the arbitrary check assert reg.score(X, y) > 0.5 - scores can be below 0.5 for some cases of fit_intercept=False
- Now test passes for all seeds, more info see below

Long version:

Observations:

for specific seeds, i.e. seeds (3, 15, 17, 20, 23, 25, 54, 64, 79, 91, 99), the test test_linear_regression_sample_weights fails if fit_intercept=False. The test passes if fit_intercept=True:

>       assert reg.score(X, y) > 0.5
E       assert 0.3016564425292758 > 0.5
E        +  where 0.3016564425292758 = <bound method RegressorMixin.score of LinearRegression(fit_intercept=False)>(array([[-0.60612102, -1.05993975, -0.55091967, -0.27568627,  1.22225373],\n       [-0.90585899,  0.06935217,  2.1786556...215,  2.11737829, -0.44639435, -0.66693906],\n       [ 0.48079725,  1.98498011,  0.39190894,  2.3982028 ,  2.41736027]]), array([-1.61411983, -0.21642658,  1.66363212, -0.12201627,  1.39842066,\n       -0.63836468]))
E        +    where <bound method RegressorMixin.score of LinearRegression(fit_intercept=False)> = LinearRegression(fit_intercept=False).score

sklearn/linear_model/tests/test_base.py:80: AssertionError

In the failing cases with one of these seeds and fit_intercept=False, reg.score is lower than 0.5. Setting fit_intercept=True fixes this for all seeds.

Any other combination than n_samples, n_features = 6, 5 results in more failed tests. E.g. reducing to n_samples, n_features = 5, 4 does not help even with fit_intercept=True
n_samples needs to remain at a low value, because it will be hard to fit a linear regression to more than a few randomly chosen samples/targets. E.g. every test fails with n_samples = 30.

Possible solutions

remove assert reg.score(X, y) > 0.5
- we picked that one

run only tests with working combinations of global_random_seed and fit_intercept; e.g.

if np.isin(global_random_seed, (3, 15, 17, 20, 23, 25, 54, 64, 79, 91, 99)) and not fit_intercept:
  pytest.skip("Unsupported Configuration")

mark bad combinations as xfail - we have to introduce a gate-keeper fixture for this:

@pytest.fixture
def xfail_selected_intercept_seed_combos(request):
    fit_intercept = request.getfixturevalue("fit_intercept")
    seed = request.getfixturevalue("global_random_seed")

    # test is known to fail on these seeds for `fit_intercep=False`
    allowed_failures = ((False, s) for s in (3, 15, 17, 20, 23, 25, 54, 64, 79, 91, 99))
    if (fit_intercept, seed) in allowed_failures:
        request.node.add_marker(
            pytest.mark.xfail(reason="skipping bad combinations of tests", strict=True)
        )


@pytest.mark.parametrize("array_constr", [np.array, sparse.csr_matrix])
@pytest.mark.parametrize("fit_intercept", [True, False])
@pytest.mark.usefixtures("xfail_selected_intercept_seed_combos")
def test_linear_regression_sample_weights(
    array_constr, fit_intercept, global_random_seed
):

Either one does the job and introduces more seeds to the test. I prefer the latter option though. It tells a better story: it tests every combination and safeguards against silently passing combinations in the future.

Comments and opinions are very much appreciated! :)

Any other comments?

I created a task list to track the implementation progress of global_random_seed. Also, let's discuss if we tackle the two version 1.2 related FIXME items in this or in another (linked?) PR (how is that kept track of normally?)

…ts, which fails on a few seeds now

…fit_intercept=True for test to pass

… redundancy

lorentzenchr · 2022-05-31T13:22:46Z

@svenstehle Thanks for working on this. I think xfail on specific seeds is an anti-pattern that we should certainly avoid. Only skipping over the code changes, I see the following options:

Increase tolerances where applicable.
Improve the test where possible or remove it (e.g. why should a score be > 0.5, pretty arbitrary).
Mark the whole test as xfail.

svenstehle · 2022-06-01T18:25:56Z

@svenstehle Thanks for working on this. I think xfail on specific seeds is an anti-pattern that we should certainly avoid. Only skipping over the code changes, I see the following options:

Increase tolerances where applicable.

Improve the test where possible or remove it (e.g. why should a score be > 0.5, pretty arbitrary).

Mark the whole test as xfail.

Hi @lorentzenchr thanks for your reply and input. A valid idea to drop the check for reg.score.

As far as I understand it, we are checking the correct behaviour of the sample_weight parameter
reg.score is unnecessary for that. reg.score can also be smaller than 0, if the model error is larger than the baseline ((y_true - y_true.mean()) ** 2).sum() - which is the case for some of those seeds, because we have fit_intercept=False

…al 0

…pose

sklearn/linear_model/tests/test_base.py

lorentzenchr

LGTM. Thanks for fixing this.

svenstehle · 2022-06-04T11:20:05Z

LGTM. Thanks for fixing this.

Thanks @lorentzenchr . There are more tests to check in this file though, I will go through them over the weekend to see how many need changes/rework. Depending on that, we can discuss how to proceed.

Update: at least one other test is impacted.
I will open a task-list, other approaches are welcome.

…reater_than_1d

sklearn/linear_model/tests/test_base.py

…tcome

sklearn/linear_model/tests/test_base.py

svenstehle · 2022-06-04T13:43:14Z

As a conversation point, I added address FIXMEs for sklearn version 1.2 in code (currently at 1.2.dev0) to the task list. But we should probably not address these two FIXME items in this PR. They are about deprecation of normalize. Will they be handled in a PR concerning sklearn/linear_model/_base.py?

lorentzenchr · 2022-06-04T21:58:00Z

@svenstehle Please keep the instructions of #22827 in mind, e.g.

We probably do not need to convert all scikit-learn tests to use this fixture. We should instead focus our efforts on tests that actually check for important mathematical properties of our estimators or model evaluation tools. For instance, there is no need to check for the seed-insensitivity of tests that checks for the exception messages raised when passing invalid inputs.

For instance, test_raises_value_error_if_sample_weights_greater_than_1d does not need the global random seed fixture.

svenstehle · 2022-06-05T11:37:08Z

@svenstehle Please keep the instructions of #22827 in mind, e.g.

We probably do not need to convert all scikit-learn tests to use this fixture. We should instead focus our efforts on tests that actually check for important mathematical properties of our estimators or model evaluation tools. For instance, there is no need to check for the seed-insensitivity of tests that checks for the exception messages raised when passing invalid inputs.

For instance, test_raises_value_error_if_sample_weights_greater_than_1d does not need the global random seed fixture.

Good catch, thanks for raising this. Addressed

…hts_greater_than_1d since we are only checking that no value errors are raised

sklearn/linear_model/tests/test_base.py

…tests-test_base

…nse, test will be deprecated in v1.2

…base' of https://github.com/svenstehle/scikit-learn into tst-global_random_seed-sklearn-linear_model-tests-test_base

sklearn/linear_model/tests/test_base.py

glemaitre · 2022-06-14T09:29:37Z

sklearn/linear_model/tests/test_base.py

@@ -233,8 +235,9 @@ def test_linear_regression_sparse_equal_dense(normalize, fit_intercept):
    assert_allclose(clf_dense.coef_, clf_sparse.coef_)


-def test_linear_regression_multiple_outcome(random_state=0):
+def test_linear_regression_multiple_outcome(global_random_seed):


We can set a local random state here.
We only check that we fit idependently on different target and the randomness will not impact anything apart of creating the dataset.

Good point, addressed.

glemaitre · 2022-06-14T09:30:28Z

sklearn/linear_model/tests/test_base.py

    # Test multiple-outcome linear regressions
+    random_state = check_random_state(global_random_seed)
    X, y = make_regression(random_state=random_state)


you can pass directly random_state=0

glemaitre · 2022-06-14T09:35:17Z

sklearn/linear_model/tests/test_base.py

    # Test multiple-outcome linear regressions with sparse data
-    random_state = check_random_state(random_state)
+    random_state = check_random_state(global_random_seed)
    X, y = make_sparse_uncorrelated(random_state=random_state)


I would say that this is the same case than above but for sparse matrix so real need to parametrize the random state here.

glemaitre · 2022-06-14T09:36:56Z

sklearn/linear_model/tests/test_base.py

    # Test multiple-outcome nonnegative linear regressions
-    random_state = check_random_state(random_state)
+    random_state = check_random_state(global_random_seed)
    X, y = make_sparse_uncorrelated(random_state=random_state)


same as above

glemaitre

I think that there is a couple of places that we can avoid to parametrize. Otherwise LGTM

sklearn/linear_model/tests/test_base.py

…execution order Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

…te' and replace all 'random_state' variables with 'rng'

sklearn/linear_model/tests/test_base.py

lorentzenchr

LGTM

sklearn/linear_model/tests/test_base.py

Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

glemaitre · 2022-06-22T09:18:48Z

Thanks @svenstehle LGTM. Merging

…scikit-learn#23465) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

add global_random_seed fixture to test_linear_regression_sample_weigh…

1669b80

…ts, which fails on a few seeds now

github-actions bot added the module:linear_model label May 25, 2022

svenstehle added 2 commits May 28, 2022 02:28

add workaround for global_random_seed with offending seeds requiring …

9b825f4

…fit_intercept=True for test to pass

add comment for explanation and change list of seeds to tuple of seeds

1cc89ce

svenstehle changed the title ~~[WIP] TST use global_random_seed in sklearn/linear_model/tests/test_base.py~~ TST use global_random_seed in sklearn/linear_model/tests/test_base.py May 28, 2022

svenstehle added 4 commits May 28, 2022 03:04

change code to skip bad seed and fit_intercept combinations to remove…

0418d27

… redundancy

update code with xfail version, skip version still in comments

6712141

update xfail with strict=True

05a59aa

change list to tuple

378ffa9

svenstehle added 3 commits June 1, 2022 20:50

change arbitrary reg.score sanity check to make sure score is not equ…

78d76e3

…al 0

remove blank line

feeefbf

remove check for reg.score, which is only arbitrary and serves no pur…

de9d395

…pose

svenstehle commented Jun 2, 2022

View reviewed changes

sklearn/linear_model/tests/test_base.py Show resolved Hide resolved

lorentzenchr approved these changes Jun 3, 2022

View reviewed changes

lorentzenchr added the Quick Review For PRs that are quick to review label Jun 3, 2022

add global_random_seed to test_raises_value_error_if_sample_weights_g…

5687adc

…reater_than_1d

svenstehle commented Jun 4, 2022

View reviewed changes

sklearn/linear_model/tests/test_base.py Outdated Show resolved Hide resolved

add global_random_seed to test_linear_regression_sparse

d4264d9

svenstehle commented Jun 4, 2022

View reviewed changes

sklearn/linear_model/tests/test_base.py Outdated Show resolved Hide resolved

add global_random_seed to test_linear_regression_sparse_equal_dense

119c56f

svenstehle commented Jun 4, 2022

View reviewed changes

sklearn/linear_model/tests/test_base.py Show resolved Hide resolved

add global_random_seed to test_linear_regression_multiple_outcome

e21b48f

svenstehle commented Jun 4, 2022

View reviewed changes

sklearn/linear_model/tests/test_base.py Outdated Show resolved Hide resolved

svenstehle added 2 commits June 4, 2022 14:00

add global_random_seed to test_linear_regression_sparse_multiple_outcome

db57b7f

add global_random_seed to test_linear_regression_positive_multiple_ou…

b6ba18d

…tcome

svenstehle commented Jun 4, 2022

View reviewed changes

sklearn/linear_model/tests/test_base.py Outdated Show resolved Hide resolved

svenstehle commented Jun 4, 2022

View reviewed changes

sklearn/linear_model/tests/test_base.py Outdated Show resolved Hide resolved

remove global_random_seed from test_raises_value_error_if_sample_weig…

d298b42

…hts_greater_than_1d since we are only checking that no value errors are raised

svenstehle commented Jun 5, 2022

View reviewed changes

sklearn/linear_model/tests/test_base.py Outdated Show resolved Hide resolved

svenstehle commented Jun 5, 2022

View reviewed changes

sklearn/linear_model/tests/test_base.py Outdated Show resolved Hide resolved

svenstehle commented Jun 5, 2022

View reviewed changes

sklearn/linear_model/tests/test_base.py Show resolved Hide resolved

glemaitre self-requested a review June 13, 2022 18:25

Merge branch 'main' into tst-global_random_seed-sklearn-linear_model-…

40ea540

…tests-test_base

glemaitre added the No Changelog Needed label Jun 13, 2022

svenstehle added 2 commits June 13, 2022 20:29

remove global_random_seed from test_linear_regression_sparse_equal_de…

a7cb763

…nse, test will be deprecated in v1.2

Merge branch 'tst-global_random_seed-sklearn-linear_model-tests-test_…

2ae6739

…base' of https://github.com/svenstehle/scikit-learn into tst-global_random_seed-sklearn-linear_model-tests-test_base

glemaitre reviewed Jun 14, 2022

View reviewed changes

sklearn/linear_model/tests/test_base.py Outdated Show resolved Hide resolved

glemaitre reviewed Jun 14, 2022

View reviewed changes

svenstehle added 3 commits June 17, 2022 11:14

parametrize test_raises_value_error_if_sample_weights_greater_than_1d

3d8e94d

remove global_random_seed from test_linear_regression_multiple_outcome

d05f226

reintroduce global rng = check_random_state(0)

000bdc9

lorentzenchr reviewed Jun 17, 2022

View reviewed changes

svenstehle and others added 2 commits June 19, 2022 16:02

Remove global RandomState object to decouple tests "randomness" from …

5a737fe

…execution order Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

replace all 'check_random_state' statements with 'np.random.RandomSta…

434ccc8

…te' and replace all 'random_state' variables with 'rng'

svenstehle commented Jun 19, 2022

View reviewed changes

sklearn/linear_model/tests/test_base.py Show resolved Hide resolved

lorentzenchr approved these changes Jun 20, 2022

View reviewed changes

sklearn/linear_model/tests/test_base.py Outdated Show resolved Hide resolved

clean up matmul syntax

d66099e

Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

glemaitre merged commit 158b620 into scikit-learn:main Jun 22, 2022

Uh oh!

TST use global_random_seed in sklearn/linear_model/tests/test_base.py #23465

TST use global_random_seed in sklearn/linear_model/tests/test_base.py #23465

Uh oh!

Conversation

svenstehle commented May 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

Task List

What does this implement/fix? Explain your changes.

Long version:

Observations:

Possible solutions

Any other comments?

Uh oh!

lorentzenchr commented May 31, 2022

Uh oh!

svenstehle commented Jun 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

lorentzenchr left a comment

Choose a reason for hiding this comment

Uh oh!

svenstehle commented Jun 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

svenstehle commented Jun 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lorentzenchr commented Jun 4, 2022

Uh oh!

svenstehle commented Jun 5, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glemaitre Jun 14, 2022

Choose a reason for hiding this comment

Uh oh!

svenstehle Jun 17, 2022

Choose a reason for hiding this comment

Uh oh!

glemaitre Jun 14, 2022

Choose a reason for hiding this comment

Uh oh!

svenstehle Jun 17, 2022

Choose a reason for hiding this comment

Uh oh!

glemaitre Jun 14, 2022

Choose a reason for hiding this comment

Uh oh!

glemaitre Jun 14, 2022

Choose a reason for hiding this comment

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lorentzenchr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

svenstehle commented May 25, 2022 •

edited

Loading

svenstehle commented Jun 1, 2022 •

edited

Loading

svenstehle commented Jun 4, 2022 •

edited

Loading

svenstehle commented Jun 4, 2022 •

edited

Loading