TST use global_dtype in sklearn/metrics/tests/test_pairwise.py #22666

jjerphan · 2022-03-03T13:42:53Z

Reference Issues/PRs

Partially addresses #22881
Precedes #22590

What does this implement/fix? Explain your changes.

This parametrizes tests from test_pairwise.py to run on 32bit datasets.

Any other comments?

We could introduce a mechanism to be able to able to remove tests' execution on 32bit datasets if this takes too much time to complete.

ogrisel

Let's systematically add assertions on the expected dtype for the results of the pairwise distance computation when the dtype of the input is fully specified in the test.

I only did some suggestions via github because I cannot suggestion on folded lines but almost all the newly parametrized tests would deserved such a treatment to make the parametrization more useful.

sklearn/metrics/tests/test_pairwise.py

glemaitre · 2022-06-09T09:56:41Z

sklearn/metrics/tests/test_pairwise.py

@@ -291,7 +296,7 @@ def callable_rbf_kernel(x, y, **kwds):
        (pairwise_kernels, callable_rbf_kernel, {"gamma": 0.1}),
    ],
 )
-@pytest.mark.parametrize("dtype", [np.float64, int])
+@pytest.mark.parametrize("dtype", [np.float64, np.float32, int])


Isn't weird to not use np.int32 or np.int64 since int would be platform dependent here.

It could also be worth to check the output dtype here.

Checking the dtype fails here because it turns out that the returned dtype depends on n_jobs. This is a bug imo. I'll open an issue about that.

Was the issue opened? If so we could link it here.

I am not sure that an issue was created.

@jeremiedbb: do you have a snippet which reproduces this bug? This way we could create an issue and resolve it. Thanks!

I opened #24502

sklearn/metrics/tests/test_pairwise.py

ogrisel

LGTM. This is already a net improvement even if all the tests have not yet been global_dtype fixtured.

I locally ran:

SKLEARN_RUN_FLOAT32_TESTS=1 pytest -v sklearn/metrics/tests/test_pairwise.py

and I get 244 passed test instead of 194. No failures and no new warnings.

sklearn/metrics/tests/test_pairwise.py

ogrisel · 2022-09-19T09:32:37Z

sklearn/metrics/tests/test_pairwise.py

@@ -291,7 +296,7 @@ def callable_rbf_kernel(x, y, **kwds):
        (pairwise_kernels, callable_rbf_kernel, {"gamma": 0.1}),
    ],
 )
-@pytest.mark.parametrize("dtype", [np.float64, int])
+@pytest.mark.parametrize("dtype", [np.float64, np.float32, int])


Was the issue opened? If so we could link it here.

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

jeremiedbb

LGTM. Thanks @jjerphan

TST Adapt test_pairwise.py to test implementations on 32bit datasets

f0418e2

github-actions bot added the module:metrics label Mar 3, 2022

jjerphan added the No Changelog Needed label Mar 3, 2022

jjerphan marked this pull request as ready for review March 3, 2022 15:01

ogrisel reviewed Mar 3, 2022

View reviewed changes

sklearn/metrics/tests/test_pairwise.py Show resolved Hide resolved

sklearn/metrics/tests/test_pairwise.py Show resolved Hide resolved

sklearn/metrics/tests/test_pairwise.py Show resolved Hide resolved

jjerphan added 3 commits March 23, 2022 15:26

Merge branch 'main' into tst/test_pairwise-32bit

67eb4f1

Review comments

1cc2a16

TST Use assert_allclose

d3ebf1d

jjerphan changed the title ~~TST Adapt test_pairwise.py to test implementations on 32bit datasets~~ TST use global_dtype in sklearn/metrics/tests/test_pairwise.py Mar 23, 2022

jjerphan mentioned this pull request Mar 24, 2022

Improve tests to make them run on variously typed data using the global_dtype fixture #22881

Open

jjerphan added 2 commits March 24, 2022 16:28

Merge branch 'main' into tst/test_pairwise-32bit

ce3ce99

TST Adapt absolute tolerance

d989c1b

jjerphan added the Waiting for Reviewer label Mar 24, 2022

Merge branch 'main' into tst/test_pairwise-32bit

064118a

glemaitre removed the Waiting for Reviewer label Jun 9, 2022

glemaitre self-requested a review June 9, 2022 09:45

glemaitre reviewed Jun 9, 2022

View reviewed changes

jeremiedbb and others added 5 commits June 9, 2022 16:22

address comments

a6c6652

lint

841b737

remove useless dtype dependent rtol.

a1c891c

Merge branch 'main' into tst/test_pairwise-32bit

8159390

TST Add atol for quasi zero arrays

1eb1eed

ogrisel approved these changes Sep 19, 2022

View reviewed changes

Replace todense to toarray

653df0a

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

jjerphan mentioned this pull request Sep 19, 2022

FEA Introduce PairwiseDistances #23958

Closed

3 tasks

Merge branch 'main' into tst/test_pairwise-32bit

a7ff548

jeremiedbb approved these changes Sep 26, 2022

View reviewed changes

Merge branch 'main' into tst/test_pairwise-32bit

a5491a2

jeremiedbb merged commit 9a76368 into scikit-learn:main Sep 26, 2022

jjerphan deleted the tst/test_pairwise-32bit branch October 21, 2022 13:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TST use global_dtype in sklearn/metrics/tests/test_pairwise.py #22666

TST use global_dtype in sklearn/metrics/tests/test_pairwise.py #22666

jjerphan commented Mar 3, 2022 •

edited

Loading

ogrisel left a comment

glemaitre Jun 9, 2022

glemaitre Jun 9, 2022

jeremiedbb Jun 9, 2022

ogrisel Sep 19, 2022

jjerphan Sep 23, 2022

jeremiedbb Sep 23, 2022

ogrisel left a comment

ogrisel Sep 19, 2022

jeremiedbb left a comment

TST use global_dtype in sklearn/metrics/tests/test_pairwise.py #22666

TST use global_dtype in sklearn/metrics/tests/test_pairwise.py #22666

Conversation

jjerphan commented Mar 3, 2022 • edited Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

ogrisel left a comment

Choose a reason for hiding this comment

glemaitre Jun 9, 2022

Choose a reason for hiding this comment

glemaitre Jun 9, 2022

Choose a reason for hiding this comment

jeremiedbb Jun 9, 2022

Choose a reason for hiding this comment

ogrisel Sep 19, 2022

Choose a reason for hiding this comment

jjerphan Sep 23, 2022

Choose a reason for hiding this comment

jeremiedbb Sep 23, 2022

Choose a reason for hiding this comment

ogrisel left a comment

Choose a reason for hiding this comment

ogrisel Sep 19, 2022

Choose a reason for hiding this comment

jeremiedbb left a comment

Choose a reason for hiding this comment

jjerphan commented Mar 3, 2022 •

edited

Loading