Improve tests by using global_random_seed fixture to make them less seed-sensitive #22827

ogrisel · 2022-03-14T10:15:11Z

Context: the new `global_random_seed` fixture

#22749 introduces a new global_random_seed fixture to make it possible to run the same test with any seed between 0 and 99 included. By default, when SKLEARN_TESTS_GLOBAL_RANDOM_SEED is not set, this fixture is deterministically returning 42 to keep test runs deterministic by default and avoid any unnecessary disruption. However different CI builds set this seed to other arbitrary values (still deterministic) and nightly schedule builds on Azure now use SKLEARN_TESTS_GLOBAL_RANDOM_SEED="any" to progressively explore any seed on the 0-99 range.

Motivation

The aim of this new fixture is to make sure that we avoid writing tests that artificially depend on a specific value of the random seed and therefore hiding a real mathematical problem in our code unknowingly (see e.g. #21701 (comment)). At the same time we still want to keep the test deterministic and independent of the execution order by default to avoid introducing unnecessary maintenance overhead.

In addition to making the tests insensitive, randomizing those tests with different seeds has the side benefit of making the assertions of those tests robust to small numerical variations that could otherwise stem from other sources such as platform-specific / dependency-specific numerical rounding variations that we do not cover in our existing CI infrastructure.

More details about the fixture in the online dev doc for the SKLEARN_TESTS_GLOBAL_RANDOM_SEED env variable:

https://scikit-learn.org/dev/computing/parallelism.html#environment-variables

Guidelines to convert existing tests

We probably do not need to convert all scikit-learn tests to use this fixture. We should instead focus our efforts on tests that actually check for important mathematical properties of our estimators or model evaluation tools. For instance, there is no need to check for the seed-insensitivity of tests that checks for the exception messages raised when passing invalid inputs.
To avoid having to review huge PRs that impact many files at once and can lead to conflicts, let's open PRs that edit at most one test file at a time. For instance use a title such as:

TST use global_random_seed in sklearn/_loss/tests/test_glm_distribution.py

Please reference #22827 in the description of the PR and put the full filename of the test file you edit in the title of the PR.
To convert an existing test with a fixed seed, the general pattern is to rewrite a function such as:

def test_some_function():
    rng = np.random.RandomState(0)
    ...

to:

def test_some_function(global_random_seed):
    rng = np.random.RandomState(global_random_seed)
    ...

and then check that the test function is actually seed-insensitive by running with all seeds between 0 and 99 locally (can be slow! only run for one specific test at a time!):

SKLEARN_TESTS_GLOBAL_RANDOM_SEED="all" pytest sklearn/some_module/test/test_some_module.py -k test_some_function

If this is not the case, the test will probably need to be reworked to find a more stable to way to check the interesting mathematical properties.

if the failing assertions are related to the generalization performance of a model, maybe the training set size should be slightly bigger (while keeping the test runtime as fast as possible), or with fewer noisy features or the training should be done with stronger regularization. Or more simply we can relax the tolerance threshold while ensuring it does not become trivial (e.g. by comparing to a trivial baseline);
if the failing assertions depend on some regularities of a synthetically generated dataset, making decreasing the noise level of the datasets;
some tests might also fail when encountering data that trigger edge cases such as (near-)tied distances between datapoints that make the outcome of computation unstable. Changing the data generation code to significantly decrease the likelihood of those edge case (e.g. by adding more noise to the input features) can help in those cases.
Note: in most cases, tweaking the tolerances of the assertions is not the appropriate way to make the tests pass. The first thing to do is try to understand what the test is checking, if the test is correct, if the expectations of the test are realistic. Then if the test seems correct and should pass for all random seed but doesn't, investigate if the estimator or function is bugged. As a last resort, tolerances can be loosened if the test is considered valid but aims to check a statistical property that is highly sensitive to the random seed.

In some cases, it might be very hard to write a seed-insensitive test that tolerate all seeds between 0 and 99 while still running in less than 1s. In those (hopefully rare) cases, I think it's fine to reduce the range of admissible seeds with the following pattern:

def test_some_function(global_random_seed):
    # Making this test seed-insensitive for the 0-99 range would
    # be too costly. Restricting to the 0-9 range is necessary to
    # use small enough datasets that avoid increasing the run time
    # too much.
    rng = np.random.RandomState(global_random_seed % 10)
    ...

Run the CI for tests that take a global_random_seed by pushing a commit message with the following structure:

<title> [all random seeds]
<test_name_1>
<test_name_2>
...

Note, running git commit --allow-empty allows you to have a commit message without any changes.

See the following issue for more details on why testing on the CI is necessary:

Local testing of global_random_seed is not enough #28959

List of test modules to upgrade

find sklearn -name "test_*.py"

Note that some of those files might not have any test to update.

The text was updated successfully, but these errors were encountered:

ogrisel · 2022-03-14T10:43:22Z

I labeled this issue as hard, as I expect some tests to be hard to upgrade. However I expect the majority of the tests to be easy to convert but we cannot know which will be easy in advance.

…seed on test cases

thomasjpfan · 2022-06-23T14:20:41Z

I updated the original message with the feature introduced in #23026 where we can run the CI with all the seeds by pushing a commit with the proper message. @ogrisel Feel free to update the wording if you find it unclear.

Run the CI for tests that take a global_random_seed by pushing a commit message with the following structure:

<title> [all random seeds]
<test_name_1>
<test_name_2>
...

Note, running git commit --allow-empty allows you to have a commit message without any changes.

svenstehle · 2022-07-03T19:23:55Z

working on sklearn/linear_model/tests/test_bayes.py

marenwestermann · 2022-07-04T13:27:07Z

Working on sklearn/cluster/tests/test_affinity_propagation.py

sortofamudkip · 2025-02-19T19:21:40Z

Working on sklearn/linear_model/tests/test_linear_loss.py

ArturoSbr · 2025-02-28T00:00:25Z

To anyone interested in contributing, remember to add an empty commit to your PR (in accordance to this issue):

$ git commit --allow-empty -m "empty commit to trigger all CI jobs [all random seeds]
test_parallel"

Rishab260 · 2025-03-02T16:14:13Z

Working on sklearn/decomposition/tests/test_truncated_svd.py
Working on sklearn/ensemble/tests/test_bagging.py

DeaMariaLeon · 2025-03-27T10:14:23Z

Working on sklearn/compose/tests/test_column_transformer.py

DeaMariaLeon · 2025-03-28T07:58:03Z

Working on sklearn/compose/tests/test_target.py

DeaMariaLeon · 2025-04-10T07:39:48Z

Commenting here as a friendly reminder @glemaitre

sklearn/datasets/tests/test_svmlight_format.py should be removed from the list. Please see Improve tests by using global_random_seed fixture to make them less seed-sensitive #22827 (comment)
I think that sklearn/datasets/tests/test_samples_generator.py can be taken off the list too, as it is used to generate data sets.

jeremiedbb · 2025-04-10T08:15:55Z

Thanks @DeaMariaLeon for the reminder. However, briefly looking at sklearn/datasets/tests/test_samples_generator.py, there are tests to check the properties of the generated datasets, so I think it's legitimate to not cross it off the list.

DeaMariaLeon · 2025-04-11T08:04:54Z

Sorry for wasting your time. I was looking at the wrong file. 🫣
Now I'm working on sklearn/datasets/tests/test_samples_generator.py

DeaMariaLeon · 2025-04-15T09:18:30Z

Per Guillaume, commenting here that I opened a PR for sklearn/decomposition/tests/test_fastica.py

DeaMariaLeon · 2025-04-16T08:47:50Z

Working on sklearn/decomposition/tests/test_sparse_pca.py

edit: These tests need to be removed from the list - the PRs have already been merged: sklearn/decomposition/tests/test_truncated_svd.py #30922
sklearn/ensemble/tests/test_bagging.py #30923

DeaMariaLeon · 2025-04-24T16:14:50Z

I think that sklearn/ensemble/tests/test_common.py doesn't benefit from the global random seed.
If someone can kindly double check, it would be great:
https://github.com/scikit-learn/scikit-learn/blob/main/sklearn/ensemble/tests/test_common.py

DeaMariaLeon · 2025-04-25T16:56:27Z

I worked on sklearn/decomposition/tests/test_incremental_pca.py

DeaMariaLeon · 2025-05-07T09:37:58Z

I'll work on sklearn/linear_model/tests/test_logistic.py

ogrisel added help wanted Moderate Anything that requires some knowledge of conventions and best practices Hard Hard level of difficulty and removed Moderate Anything that requires some knowledge of conventions and best practices labels Mar 14, 2022

ogrisel added the module:test-suite everything related to our tests label Mar 14, 2022

ogrisel mentioned this issue Mar 15, 2022

TST ensure that sklearn/_loss/tests/test_loss.py is seed insensitive #22847

Merged

jjerphan mentioned this issue Mar 16, 2022

TST Ensure that sklearn/metrics/tests/test_pairwise_distances_reduction.py is seed insensitive #22862

Merged

jjerphan added the Meta-issue General issue associated to an identified list of tasks label Mar 18, 2022

thomasjpfan mentioned this issue Mar 18, 2022

ENH Use simultaenous sort in tree splitter #22868

Merged

MaxwellLZH mentioned this issue Mar 19, 2022

TST use global_random_seed in sklearn/ensemble/tests/test_iforest.py #22901

Merged

tahmid-haque mentioned this issue Apr 1, 2022

Update global random seed test logistic TheBicPen/scikit-learn#5

Open

Tony-beeper added a commit to TheBicPen/scikit-learn that referenced this issue Apr 2, 2022

implemented based on issue scikit-learn#22827, updated global_random_…

d887532

…seed on test cases

Tony-beeper mentioned this issue Apr 2, 2022

Update global random seed test_sparsefuncs TheBicPen/scikit-learn#6

Open

ShoaibKhan mentioned this issue Apr 2, 2022

TST Ensure that sklearn/cluster/tests/test_k_means.py is seed insensitive TheBicPen/scikit-learn#7

Open

adrinjalali mentioned this issue Apr 5, 2022

MAINT Common parameter validation #22722

Merged

ShoaibKhan mentioned this issue May 16, 2022

TST Ensure that sklearn/cluster/tests/test_k_means.py is seed insensitive #23388

Merged

matthewmercuri mentioned this issue May 25, 2022

TST update test_dist_metrics to use global_random_seed env variable #23453

Closed

This was referenced May 25, 2022

[WIP] TST use global_random_seed in sklearn/linear_model/tests/test_base.py #23464

Closed

TST use global_random_seed in sklearn/linear_model/tests/test_base.py #23465

Merged

marenwestermann mentioned this issue Jun 20, 2022

TST use global_random_seed in sklearn/cluster/tests/test_feature_agglomeration.py #23700

Closed

This was referenced Jun 23, 2022

TST use global_random_seed in sklearn/_loss/tests/test_glm_distribution.py #23741

Merged

TST use global_random_seed in sklearn/_loss/tests/test_link.py #23751

Merged

svenstehle mentioned this issue Jul 3, 2022

TST use global_random_seed in sklearn/linear_model/tests/test_bayes.py #23826

Merged

This was referenced Feb 18, 2025

TST use global_random_seed in sklearn/metrics/tests/test_classification.py #30851

Merged

TST use global_random_seed in sklearn/utils/tests/test_stats.py #30855

Closed

TST use global_random_seed in sklearn/utils/tests/test_stats.py #30857

Merged

This was referenced Feb 19, 2025

TST use global_random_seed in sklearn/linear_model/tests/test_linear_loss.py #30863

Merged

TST use global_random_seed in sklearn/metrics/tests/test_regression.py #30865

Merged

ArturoSbr mentioned this issue Feb 27, 2025

TST Use global_random_seed in test_huber.py #30912

Open

This was referenced Mar 2, 2025

TST use global_random_seed in sklearn/decomposition/tests/test_truncated_svd.py #30922

Merged

TST use global_random_seed in sklearn/ensemble/tests/test_bagging.py #30923

Merged

StefanieSenger mentioned this issue Mar 18, 2025

Add links to examples from the docstrings and user guide #30621

Open

ogrisel mentioned this issue Mar 25, 2025

FIX Fix multiple severe bugs in non-metric MDS #30514

Merged

DeaMariaLeon mentioned this issue Mar 27, 2025

TST use global_random_seed in sklearn/compose/tests/test_column_transformer.py #31092

Closed

DeaMariaLeon mentioned this issue Mar 28, 2025

TST use global_random_seed in sklearn/compose/tests/test_target.py #31100

Closed

This was referenced Apr 11, 2025

TST Use global_random_seed in sklearn/datasets/tests/test_samples_generator.py #31181

Merged

TST Use global_random_seed in sklearn/decomposition/tests/test_fastica.py #31203

Merged

DeaMariaLeon mentioned this issue Apr 16, 2025

TST use global_random_seed in sklearn/decomposition/tests/test_sparse_pca.py #31213

Merged

DeaMariaLeon mentioned this issue Apr 25, 2025

TST use global_random_seed in sklearn/decomposition/tests/test_incremental_pca.py #31250

Open

TheAyos mentioned this issue May 5, 2025

TST use global_random_seed in sklearn/feature_extraction/tests/test_image.py #31310

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve tests by using global_random_seed fixture to make them less seed-sensitive #22827

Improve tests by using global_random_seed fixture to make them less seed-sensitive #22827

ogrisel commented Mar 14, 2022 •

edited by glemaitre

Loading

ogrisel commented Mar 14, 2022 •

edited

Loading

thomasjpfan commented Jun 23, 2022 •

edited

Loading

svenstehle commented Jul 3, 2022 •

edited

Loading

marenwestermann commented Jul 4, 2022

sortofamudkip commented Feb 19, 2025

ArturoSbr commented Feb 28, 2025 •

edited

Loading

Rishab260 commented Mar 2, 2025 •

edited

Loading

DeaMariaLeon commented Mar 27, 2025

DeaMariaLeon commented Mar 28, 2025

DeaMariaLeon commented Apr 10, 2025

jeremiedbb commented Apr 10, 2025

DeaMariaLeon commented Apr 11, 2025

DeaMariaLeon commented Apr 15, 2025

DeaMariaLeon commented Apr 16, 2025 •

edited

Loading

DeaMariaLeon commented Apr 24, 2025

DeaMariaLeon commented Apr 25, 2025

DeaMariaLeon commented May 7, 2025

Improve tests by using global_random_seed fixture to make them less seed-sensitive #22827

Improve tests by using global_random_seed fixture to make them less seed-sensitive #22827

Comments

ogrisel commented Mar 14, 2022 • edited by glemaitre Loading

Context: the new global_random_seed fixture

Motivation

Guidelines to convert existing tests

List of test modules to upgrade

ogrisel commented Mar 14, 2022 • edited Loading

thomasjpfan commented Jun 23, 2022 • edited Loading

svenstehle commented Jul 3, 2022 • edited Loading

marenwestermann commented Jul 4, 2022

sortofamudkip commented Feb 19, 2025

ArturoSbr commented Feb 28, 2025 • edited Loading

Rishab260 commented Mar 2, 2025 • edited Loading

DeaMariaLeon commented Mar 27, 2025

DeaMariaLeon commented Mar 28, 2025

DeaMariaLeon commented Apr 10, 2025

jeremiedbb commented Apr 10, 2025

DeaMariaLeon commented Apr 11, 2025

DeaMariaLeon commented Apr 15, 2025

DeaMariaLeon commented Apr 16, 2025 • edited Loading

DeaMariaLeon commented Apr 24, 2025

DeaMariaLeon commented Apr 25, 2025

DeaMariaLeon commented May 7, 2025

ogrisel commented Mar 14, 2022 •

edited by glemaitre

Loading

Context: the new `global_random_seed` fixture

ogrisel commented Mar 14, 2022 •

edited

Loading

thomasjpfan commented Jun 23, 2022 •

edited

Loading

svenstehle commented Jul 3, 2022 •

edited

Loading

ArturoSbr commented Feb 28, 2025 •

edited

Loading

Rishab260 commented Mar 2, 2025 •

edited

Loading

DeaMariaLeon commented Apr 16, 2025 •

edited

Loading