Make the test suite itself thread-safe to be able to detect thread-safety problems with or without free-threading #30041

ogrisel · 2024-10-10T10:19:02Z

This is an investigative branch, not meant to be merged as such, to investigate what effort would be required to tackle #30007 as a whole (using pytest-run-parallel instead of pytest-freethreaded).

It includes commits from:

That can be merged independently.

On top of this, I started to use @pytest.mark.parallel_threads(1) (EDIT: now @pytest.mark.thread_unsafe) on tests that are fundamentally not thread-safe or that use other fixtures such as tmpdir that would require to be initialized in the thread running the test to function properly (as opposed to running in the main pytest thread).

Many more similar changes to silence all the non-informative failures, but I will stop there for now.

This already highlights that the most common patterns are:

testing for warnings with pytest.warns;
testing for sys.stdout with capsys;
usage of the tmpdir fixture;
usage of the monkeypatch fixture.

EDIT: since the first version of this PR, pytest-run-parallel has been updated to make the tmpdir and tmp_path fixture work by default and automatically detect tests that use problematic fixtures and code patterns involving warnings automatically. The remaining problematic fixtures can probably be added by configuration.

I have also found a failure in test_minibatch_kmeans_partial_fit_init with the lambda init case using:

pytest --parallel-threads=4  sklearn/cluster/tests/test_k_means.py -k test_minibatch_kmeans_partial_fit_init

and I cannot explain it yet (using regular Python with GIL enabled).

…ta_routing=True)

…t functions

github-actions · 2024-10-10T10:20:23Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: c47401e. Link to the linter CI: here}

ogrisel · 2024-10-10T10:26:57Z

Note that, I don't plan to continue manually flagging test functions individually. It's too tedious and invasive.

sklearn/cluster/tests/test_affinity_propagation.py

sklearn/cluster/tests/test_birch.py

sklearn/cluster/tests/test_hierarchical.py

sklearn/cluster/tests/test_k_means.py

sklearn/datasets/tests/test_openml.py

…ithub UI

…st-run-parallel

sklearn/ensemble/tests/test_gradient_boosting.py

ogrisel · 2025-05-09T21:05:22Z

I updated the PR to simplify it and rely on automated thread-unsafety heuristics of the latest version of pytest-run-parallel as much as possible. The heuristics seem to work as expected and the amount of manual annotation is much reduced as a result.

I started to run pytest --parallel-threads=4 --iterations=3 on a nogil install. However, I interrupted it after 12% because it's very slow and using all the RAM available on my machine.

Note that tests collection itself is slow, but still bearable compared to the actual test execution time and the memory usage problem. I guess we will need to profile a run on submodule of scikit-learn to check whether the slowdown is expected: when running with --parallel-threads=4 --iterations=3 there are 4 x 3 more tests to run compared to a regular pytest run.

For the record, I created the nogil env using conda-forge:

mamba create -n nogil -c conda-forge python-freethreading ipython numpy scipy cython meson-python compilers

ogrisel · 2025-05-10T09:04:04Z

To avoid the memory usage problem, I reduced the parallelism to 2 threads and run 1 iteration of the tests per thread:

pytest -v --parallel-threads=2 --iterations=1  2>&1 | tee pytest_all.log

Here are the result: pytest_all.log

There are 146 errors. Many of those are caused by the use of stateful generators in @pytest.mark.parametrize.

EDIT: the specific problem with generators is now tracked at: Quansight-Labs/pytest-run-parallel#57

EDIT 2: actually, the generator state is not the culprit, but state within objects returned by the generators. Those tests are fundamentally thread-unsafe and need to be updated if we want to use pytest-run-parallel.

ogrisel · 2025-05-13T08:55:26Z

sklearn/decomposition/tests/test_online_lda.py

@@ -430,6 +430,7 @@ def check_verbosity(
    ],
 )
 @pytest.mark.parametrize("csr_container", CSR_CONTAINERS)
+@pytest.mark.thread_unsafe  # manually captured stdout


Note: If we want to avoid having to manually add those thread_unsafe annotations, we should use the capsys fixture instead of manually monkey patching sys.stdout and pytest-run-parallel will automatically treat this test as thread-unsafe.

…dataset.py

…or_checks.py

ogrisel added 8 commits October 9, 2024 17:16

MAINT replace enable_slep006 fixture by @config_context(enable_metada…

925b8c7

…ta_routing=True)

MAINT remove side-effects in test_partial_dependence

a5f3149

MAINT use @pytest.mark.parallel_threads(1) for all capsys fixtured tests

2478bbd

Merge branch 'drop-enable_slep006-fixture' into thread-parallel-tests

ad14988

MAINT use @pytest.mark.parallel_threads(1) for more thread-unsafe tes…

22dd1ed

…t functions

Avoid test side effect in parametrized estimator instance

bf593f0

MAINT use @pytest.mark.parallel_threads(1) for more thread-unsafe tes…

03d1741

…t functions

MAINT use @pytest.mark.parallel_threads(1) for more thread-unsafe tes…

812d04a

…t functions

ogrisel added the No Changelog Needed label Oct 10, 2024

ogrisel mentioned this pull request Oct 10, 2024

Upgrade free-threading CI to run with pytest-freethreaded instead of pytest-xdist #30007

Open

ogrisel commented May 9, 2025

View reviewed changes

ogrisel added 2 commits May 9, 2025 16:48

Started to remove some @pytest.mark.parallel_threads(1) markers via g…

21a17cf

…ithub UI

Remove markers for cases that should be handled automatically by pyte…

6c3581f

…st-run-parallel

ogrisel commented May 9, 2025

View reviewed changes

sklearn/ensemble/tests/test_gradient_boosting.py Outdated Show resolved Hide resolved

sklearn/ensemble/tests/test_gradient_boosting.py Outdated Show resolved Hide resolved

ogrisel added 5 commits May 9, 2025 17:29

Improve comment

bd740d9

Merge branch 'main' into thread-parallel-tests

34f90f1

Mark monkeypatching fixtures as thread-unsafe

2ff3190

Fix pyproject.toml

b763e7c

Fix side effect in test_all_init

c26fd32

ogrisel mentioned this pull request May 9, 2025

MNT Mark cython extensions as free-threaded compatible #31342

Open

ogrisel commented May 13, 2025

View reviewed changes

ogrisel mentioned this pull request May 13, 2025

Automatic handling of @pytest.mark.parametrize("param_name", param_value_generator) annotated tests Quansight-Labs/pytest-run-parallel#57

Closed

ogrisel added 4 commits May 22, 2025 15:22

Make test_seq_dataset_shuffle thread safe

d4b148f

Fix remaining thread-safety problems in sklearn/utils/tests/test_seq_…

b1f1fde

…dataset.py

Make sklearn/utils/tests/test_response.py thread-safe

ba83589

Make sklearn/utils/tests/test_pprint.py thread-safe

8f210eb

ogrisel added 6 commits May 22, 2025 16:21

Make sklearn/utils/tests/test_extmath.py thread-safe

e63c48e

Manually mark thread-unsafe tests in sklearn/utils/tests/test_estimat…

89cf3ee

…or_checks.py

Fix the sklearn.tree.export_tree function to make it thread-safe

9abc43e

Make sklearn/tests/test_multioutput.py thread-safe

fa23e5d

Make sklearn/tests/test_metaestimators_metadata_routing.py thread-safe

05c7fd2

Make sklearn/tests/test_metaestimators.py thread-safe

c47401e

ogrisel changed the title ~~DEBUG Thread parallel tests~~ Make the test suite itself thread-safe to be able to detect thread-safety problems with or without free-threading May 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Make the test suite itself thread-safe to be able to detect thread-safety problems with or without free-threading #30041

Make the test suite itself thread-safe to be able to detect thread-safety problems with or without free-threading #30041

Uh oh!

ogrisel commented Oct 10, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Oct 10, 2024 •

edited

Loading

Uh oh!

ogrisel commented Oct 10, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ogrisel commented May 9, 2025 •

edited

Loading

Uh oh!

ogrisel commented May 10, 2025 •

edited

Loading

Uh oh!

ogrisel May 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Make the test suite itself thread-safe to be able to detect thread-safety problems with or without free-threading #30041

Are you sure you want to change the base?

Make the test suite itself thread-safe to be able to detect thread-safety problems with or without free-threading #30041

Uh oh!

Conversation

ogrisel commented Oct 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

ogrisel commented Oct 10, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ogrisel commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ogrisel commented May 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ogrisel May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ogrisel commented Oct 10, 2024 •

edited

Loading

github-actions bot commented Oct 10, 2024 •

edited

Loading

ogrisel commented May 9, 2025 •

edited

Loading

ogrisel commented May 10, 2025 •

edited

Loading

ogrisel May 13, 2025 •

edited

Loading