TST Improve `assert_argkmin_results_quasi_equality` error message #27281

jjerphan · 2023-09-03T15:29:02Z

Reference Issues/PRs

Updated scope of this PR

This changes the way we check neighbor results, notably for test_pairwise_distances_argkmin with float32 so that:

the error messages are more precise in case of failure,
the are no longer platform specific false positive failures as detected in ⚠️ CI failed on Ubuntu_Atlas.ubuntu_atlas ⚠️ #27126 and linked issues (from nightly builds with different random seeds on different platforms),

Furthermore, the tests parameterization has been updated to make it faster to run all the tests with all the seeds while still exploring various problem sizes.

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

github-actions · 2023-09-03T15:30:42Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 4a0592a. Link to the linter CI: here}

test_pairwise_distances_argkmin

ogrisel · 2023-09-11T15:33:25Z

@jjerphan I pushed a commit to run test_pairwise_distances_argkmin with "all random seeds" so that we can see if the improved error message help us understand what's going on with #27126.

jjerphan · 2023-09-11T15:54:57Z

Feel free to adapt this PR at will.

ogrisel · 2023-09-11T16:00:39Z

I ran one of the failing tests of #27126 locally using:

SKLEARN_TESTS_GLOBAL_RANDOM_SEED="45" pytest sklearn/metrics/tests/test_pairwise_distances_reduction.py -v -k "test_pairwise_distances_argkmin and cityblock" -x  --pdb

the error message was:

E               AssertionError: Neighbors indices are not matching when rounding distances at
E                 3 significant digits derived from rtol=1.0e-04.
E                 
E                 Query vector index         : 61
E                 Reference ordered distances: [159450.703125 160197.78125  160429.53125  161349.21875  161953.109375
E                  162041.515625 162137.984375 162665.90625  162700.25     163778.59375 ]
E                 Computed ordered distances : [159450.65128034 160197.77867103 160429.4601412  161349.25246161
E                  161953.08416814 162041.40858293 162138.01407665 162665.88381121
E                  162700.24564451 163778.55885714]
E                 Norm. abs. ord. dist. diff.: 0.03828817754983902
E                 Rounded distance           : 164000.0
E                 Neighbors group rank       : 5
E                 Reference neighbors indices: [16]
E                 Computed neighbors indices : [88]
E               assert {16} == {88}
E                 Extra items in the left set:
E                 16
E                 Extra items in the right set:
E                 88
E                 Full diff:
E                 - {88}
E                 + {16}

but I also wanted to see the distances and the matching indices arrays, so I used the debugger:

(Pdb) p ref_indices_row
array([39, 48, 21, 56, 46, 93, 59, 54, 45, 16])
(Pdb) p indices_row
array([39, 48, 21, 56, 46, 93, 59, 54, 45, 88])
(Pdb) p ref_dist_row
array([159450.703125, 160197.78125 , 160429.53125 , 161349.21875 ,
       161953.109375, 162041.515625, 162137.984375, 162665.90625 ,
       162700.25    , 163778.59375 ])
(Pdb) p dist_row
array([159450.65128034, 160197.77867103, 160429.4601412 , 161349.25246161,
       161953.08416814, 162041.40858293, 162138.01407665, 162665.88381121,
       162700.24564451, 163778.55885714])

The distances for the last neighbor are indeed very close and the difference happens at the 8th digits which might be expected for float32 rounding errors.

Using pdb I went up in the stack to compare the distances in the reference distance matrix computed by cdist:

(Pdb) p dist_matrix[61, 16]
163778.57524980605
(Pdb) p dist_matrix[61, 88]
163778.55885714293

so those are indeed close. Let's have a look at the next neighbors in the reference cdist matrix:

(Pdb) p dist_matrix.argsort(axis=1)[61, :15]
array([39, 48, 21, 56, 46, 93, 59, 54, 45, 88, 16, 15, 17, 73,  7])

so 88 and 16 are indeed the closest candidate for rank k=10 according to cdist.

So we should probably find a way to tolerate those cases where the Cython code returns neighbors swaps within the range of rounding errors of the chosen dtype.

ogrisel · 2023-09-11T16:03:26Z

The CI has failed because of the way I formatted the commit message. Let me push another empty commit with the right message structure.

ogrisel · 2023-09-11T17:19:14Z

@jjerphan the last commit message makes it possible to reproduce all the failures with the extra info of this PR.

I suppose we should simplify the code of assert_argkmin_results_quasi_equality. Instead of the complex bucketing strategy, we could provide the full reference distance row (without sorting and truncation) and check that the top k computed neighbors are:

in the right order based on the Cython computed distances,
have Cython distance values that match the reference cdist distance within tolerance (irrespective of the reference ordering),
such that there exist no other element in the reference distance row of the query that is significantly smaller most distant top-k neighbor found by Cython.

This should be probably more robust while being both strict enough and easier to debug in case of failure. WDYT?

jjerphan · 2023-09-11T20:18:38Z

We could do that yes.

I think the current yet complex assertion checks that the ordering between both algorithms is identical (up to a bucketing for acceptable numerical imprecision of float32). Is there a way we could check that with what you propose? If I understand your first item correctly, it does not propose to assert that but only the correct ordering of the index w.r.t their distances, is this right?

ogrisel · 2023-09-12T07:33:31Z

Is there a way we could check that with what you propose? If I understand your first item correctly, it does not propose to assert that but only the correct ordering of the index w.r.t their distances, is this right?

Yes but since the computed distances for each neighbor should match the reference computed distance within round-off errors and furthermore that there should not exist any significantly closer neighbor based on the reference distances, the approximate ordering should also be satisfied as a result.

ogrisel · 2023-09-12T07:33:55Z

Let me try to give it a shot.

ogrisel · 2023-09-13T15:54:42Z

@jjerphan I pushed two commits to implement what I had in mind with even stronger tests for the assertion function. This also fixes some xfail edge cases (at least on my machine).

I decided to focus this PR to the argkmin tests for now but we could do a follow-up PR to do a similar treatment for the radius neighbors tests and further simplify this test file.

Once the last commit CI is green, I will push a new empty commit with message [azure parallel] [all random seeds] test_pairwise_distances_argkmin to re-run will all seeds on the CI.

ogrisel · 2023-09-13T18:16:23Z

The code coverage report reveals that the tests for the checker could be improved. I will do that tomorrow. In the mean time let me push the all random seed commit.

…by 10

…ults_quasi_equality

…seed

…ive message with even stronger tests and a fix

… missing neighbor detection

ogrisel · 2023-09-15T10:39:58Z

@jjerphan finally this PR is ready for reviewing.

It should reduce the average CI time quite significantly (see the CI reports of the second to last commit).

jjerphan · 2023-09-15T12:16:39Z

Thanks!

I do not really have time to review it now, but I trust you.

(I find that having me approve this PR is a bit weird since I opened it but I, in fact, have not contributed to it much but you.)

…ults_quasi_equality

ogrisel · 2023-09-16T11:43:38Z

With the recent merges to accelerate conda install and reduce time of the slowest test, combined with this PR, the macOS CI is almost fast again!

jjerphan

I have not reviewed it entirely, but I am trusting you for coming up with better assertions than mine.

LGTM for the sake of a faster CI and better error messages.

(I can't approve this PR, but I could if you reopen a PR with this branch.)

ogrisel · 2023-09-17T08:27:07Z

I have not reviewed it entirely, but I am trusting you for coming up with better assertions than mine.

Let's wait for you to get a but of time to have a quick look, at least on the updated tests and their messages. If the test cases and the messages make sense to you, then this should be good.

I can't approve this PR.

You can just say so in a comment :)

ogrisel · 2023-09-17T08:28:14Z

sklearn/metrics/tests/test_pairwise_distances_reduction.py

-def test_assert_argkmin_results_quasi_equality():
-    rtol = 1e-7
-    eps = 1e-7
+def test_assert_compatible_argkmin_results():


In my opinion, this is the main entry point for the review.

ogrisel · 2023-09-17T08:28:47Z

sklearn/metrics/tests/test_pairwise_distances_reduction.py

-    rtol = 1e-7
-    eps = 1e-7
+@pytest.mark.parametrize("check_sorted", [True, False])
+def test_assert_compatible_radius_results(check_sorted):


And this is a second entry point point to the review.

jjerphan

✔️ LGTM. Thank you, @ogrisel.

Here are some comments which need not be treated and a question.

Edit: I would not block this PR if this fixes errors on the CI which have been there for too long.

sklearn/neighbors/tests/test_neighbors.py

sklearn/metrics/tests/test_pairwise_distances_reduction.py

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

…ults_quasi_equality

glemaitre

Only nitpicks. I am fine with the new tests.

sklearn/metrics/tests/test_pairwise_distances_reduction.py

glemaitre · 2023-09-26T15:48:25Z

sklearn/metrics/tests/test_pairwise_distances_reduction.py

+    indices_row_a,
+    indices_row_b,
+    threshold,
+):


It would also be nice to have a docstring if the code below is already explicit.

Done in 7fb4261.

sklearn/metrics/tests/test_pairwise_distances_reduction.py

glemaitre · 2023-09-26T16:30:47Z

sklearn/metrics/tests/test_pairwise_distances_reduction.py


-        for neighbor_rank in range(n_neighbors):
-            rounded_dist = relative_rounding(
-                ref_dist_row[neighbor_rank],
-                n_significant_digits=n_significant_digits,
-            )
-            reference_neighbors_groups[rounded_dist].add(ref_indices_row[neighbor_rank])
-            effective_neighbors_groups[rounded_dist].add(indices_row[neighbor_rank])
-
-        # Asserting equality of groups (sets) for each distance
-        msg = (
-            f"Neighbors indices for query {query_idx} are not matching "
-            f"when rounding distances at {n_significant_digits} significant digits "
-            f"derived from rtol={rtol:.1e}"
+        # Check that any neighbor with distances below the rounding error threshold have
+        # matching indices.
+        threshold = (1 - rtol) * np.maximum(
+            np.max(dist_row_a), np.max(dist_row_b)


I needed to scratch my head here. rtol is a bit different from the other usage. I don't know if this is siginificant enough to be mentioned (having the docstring in the assert_no_missing_neighbors will help already.)

I hope that 7fb4261 helps.

glemaitre · 2023-09-26T16:37:20Z

sklearn/metrics/tests/test_pairwise_distances_reduction.py

+    # checking the results is expensive for large result sets), yielding 0 most
+    # of the time would make the test useless.
+    if precomputed_dists is None and metric is None:
+        raise ValueError("Either metric or dists must be provided")


Apparently this is not covered.

_non_trivial_radius is a test helper function. It's not meant to be used in the library so I think it's fine (in particular it is not accounted in the test coverage report). This exception is just there to help maintainers use it correctly if they refactor the tests again in the future.

In that case, can it be:

assert ( precomputed_dists is not None or metric is not None ), "Either metric or dists must be provided"

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

…ults_quasi_equality

ogrisel · 2023-09-27T16:25:50Z

Thanks for the review @glemaitre.

ogrisel

Green tick for #27281 (review).

glemaitre

It looks good on my side.

…ikit-learn#27281) Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org> Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

test: Improve assert_argkmin_results_quasi_equality error message

a9f4617

Signed-off-by: Julien Jerphanion <git@jjerphan.xyz> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

github-actions bot added the module:metrics label Sep 3, 2023

jjerphan added the No Changelog Needed label Sep 3, 2023

jjerphan marked this pull request as ready for review September 3, 2023 15:37

This was referenced Sep 3, 2023

⚠️ CI failed on Ubuntu_Atlas.ubuntu_atlas ⚠️ #27126

Closed

PERF PairwiseDistancesReductions subsequent work #25888

Open

Trigger [all random seeds] for test_pairwise_distances_argkmin

a2b9a80

test_pairwise_distances_argkmin

[azure parallel] [all random seeds] test_pairwise_distances_argkmin

37df634

ogrisel added 2 commits September 13, 2023 16:38

Simpler way to assert approximate equality of neighbor results

109db17

Focus this PR on argkmin tests for now

baa792d

Fix edge cases by making check for missing indices symmetric

755f641

ogrisel added 3 commits September 13, 2023 20:16

[azure parallel] [all random seeds] test_pairwise_distances_argkmin

6a5fa0f

Speed-up test_pairwise_distances_* by dividing the number of queries …

9a975b3

…by 10

[azure parallel] [all random seeds] test_pairwise_distances

04f7bdd

ogrisel force-pushed the tst/improve-error-message-assert_argkmin_results_quasi_equality branch from d4161f7 to 04f7bdd Compare September 14, 2023 06:22

ogrisel added 4 commits September 14, 2023 10:54

Merge branch 'main' into tst/improve-error-message-assert_argkmin_res…

bf08959

…ults_quasi_equality

Further speed-up pairwise distance tests by leveraging global random …

9ec1263

…seed

Simplify assert_argkmin_results_quasi_equality to always use informat…

21ed726

…ive message with even stronger tests and a fix

Fix inline comments in test + one more test case to check symmetry of…

c66770e

… missing neighbor detection

ogrisel added 2 commits September 15, 2023 14:44

Typo: if metric in metric in ...

e85d81b

Merge branch 'main' into tst/improve-error-message-assert_argkmin_res…

119c923

…ults_quasi_equality

ogrisel mentioned this pull request Sep 16, 2023

TST Speed up some of the slowest tests #27383

Merged

jjerphan commented Sep 16, 2023

View reviewed changes

ogrisel reviewed Sep 17, 2023

View reviewed changes

ogrisel mentioned this pull request Sep 18, 2023

⚠️ CI failed on Linux_nogil.pylatest_pip_nogil ⚠️ #27394

Closed

ogrisel mentioned this pull request Sep 25, 2023

⚠️ CI failed on Linux_nogil.pylatest_pip_nogil ⚠️ #27460

Closed

glemaitre self-requested a review September 25, 2023 15:37

jjerphan commented Sep 25, 2023

View reviewed changes

ogrisel reviewed Sep 26, 2023

View reviewed changes

sklearn/metrics/tests/test_pairwise_distances_reduction.py Outdated Show resolved Hide resolved

ogrisel and others added 4 commits September 26, 2023 11:03

Typos and code style improvements

c01255e

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

Factorize redundant test logic to find a non-trivial test radius

05234d4

Merge branch 'main' into tst/improve-error-message-assert_argkmin_res…

40b305e

…ults_quasi_equality

Merge branch 'main' into tst/improve-error-message-assert_argkmin_res…

d0dfbf2

…ults_quasi_equality

glemaitre reviewed Sep 26, 2023

View reviewed changes

ogrisel and others added 3 commits September 27, 2023 18:08

Apply suggestions from code review

1224fb6

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

DOC add missing docstring + improve inline comment

7fb4261

Merge branch 'main' into tst/improve-error-message-assert_argkmin_res…

4a0592a

…ults_quasi_equality

ogrisel approved these changes Sep 27, 2023

View reviewed changes

glemaitre approved these changes Sep 28, 2023

View reviewed changes

glemaitre merged commit b06a099 into scikit-learn:main Sep 28, 2023

ogrisel mentioned this pull request Sep 28, 2023

MAINT cosmetic improvement in _non_trivial_radius test helper #27486

Merged

Uh oh!

TST Improve assert_argkmin_results_quasi_equality error message #27281

TST Improve assert_argkmin_results_quasi_equality error message #27281

Uh oh!

Conversation

jjerphan commented Sep 3, 2023 • edited by ogrisel Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

Updated scope of this PR

Uh oh!

github-actions bot commented Sep 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

ogrisel commented Sep 11, 2023

Uh oh!

jjerphan commented Sep 11, 2023

Uh oh!

ogrisel commented Sep 11, 2023

Uh oh!

ogrisel commented Sep 11, 2023

Uh oh!

ogrisel commented Sep 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jjerphan commented Sep 11, 2023

Uh oh!

ogrisel commented Sep 12, 2023

Uh oh!

ogrisel commented Sep 12, 2023

Uh oh!

ogrisel commented Sep 13, 2023

Uh oh!

ogrisel commented Sep 13, 2023

Uh oh!

ogrisel commented Sep 15, 2023

Uh oh!

jjerphan commented Sep 15, 2023

Uh oh!

ogrisel commented Sep 16, 2023

Uh oh!

jjerphan left a comment

Choose a reason for hiding this comment

Uh oh!

ogrisel commented Sep 17, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jjerphan left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TST Improve `assert_argkmin_results_quasi_equality` error message #27281

TST Improve `assert_argkmin_results_quasi_equality` error message #27281

jjerphan commented Sep 3, 2023 •

edited by ogrisel

Loading

github-actions bot commented Sep 3, 2023 •

edited

Loading

ogrisel commented Sep 11, 2023 •

edited

Loading

jjerphan left a comment •

edited

Loading