TST dtype dependent rtol, atol in assert_allclose_dense_sparse #13978

oleksandr-pavlyk · 2019-05-29T16:22:30Z

Check for closeness of new_result and the previously computed result
in check_fit_idempotent should depend on the dtype of the result, and
cannot be less than that precision's epsilon.

Reference Issues/PRs

Closes #13977

What does this implement/fix? Explain your changes.

This change sets atol keyword in the call to assert_allclose_dense_sparse in
check_fit_idempotent estimator checker.

Any other comments?

@amueller @jeremiedbb

Check for closeness of new_result and the previously compute result should depend on the dtype of the result, and can not be less than that precision's epsilon.

oleksandr-pavlyk · 2019-05-30T13:20:02Z

is this build hung?

oleksandr-pavlyk · 2019-05-31T15:55:22Z

@ogrisel @jeremiedbb CI is green. Could you review, please.

jeremiedbb · 2019-06-04T08:46:22Z

Shouldn't we adjust the rtol instead ? unless comparing to 0, atol is usually not meaningful and kind of unrelated to the dtype.

oleksandr-pavlyk · 2019-06-04T12:10:08Z

Yes, if that case how about setting atol = rtol = np.finfo(dtype).epsilon ?

jeremiedbb · 2019-06-04T16:07:35Z

Yes that's fine

oleksandr-pavlyk · 2019-06-04T19:13:33Z

@jeremiedbb Done.

jeremiedbb

just a minor comment. Other than that lgtm.

jeremiedbb · 2019-06-07T09:27:52Z

sklearn/utils/estimator_checks.py

+            if np.issubdtype(new_result.dtype, np.floating):
+                tol = np.finfo(new_result.dtype).eps
+            else:
+                tol = np.finfo(np.float64).eps


maybe add a comment # integer dtype because it's not clear at first sight.

I wonder if it would not be even clearer if you reformulate this way:

if new_result.dtype == np.float32: tol = np.finfo(np.float32).eps else: tol = np.finfo(np.float64).eps

wdyt ?

I think using np.floating is much cleaner NumPy, it accounts for np.half, np.longdouble etc., even though these types may not be used in Scikit-Learn.

fair enough :)

oleksandr-pavlyk · 2019-06-16T18:46:37Z

pinging reviewers, please.

rth

LGTM. Thanks @oleksandr-pavlyk !

rth · 2019-06-16T19:14:55Z

sklearn/utils/estimator_checks.py

@@ -2448,4 +2448,12 @@ def check_fit_idempotent(name, estimator_orig):
    for method in check_methods:
        if hasattr(estimator, method):
            new_result = getattr(estimator, method)(X_test)
-            assert_allclose_dense_sparse(result[method], new_result)
+            if np.issubdtype(new_result.dtype, np.floating):
+                tol = np.finfo(new_result.dtype).eps


No actually for float64, this would 2.22e-16 do we actually need such precision? I think the previous default of 1e-7 was enough? For other dtypes, I agree.

Maybe then,

# previous defaults in `assert_allclose_dense_sparse` rtol = max(tol, 1e-7) atol = max(tol, 1e-9)

also maybe take 2*np.finfo(dtype).eps just to be safe?

Thanks for the good feedback. I agree, and have pushed both proposed changes.

Changes requested

Parameter tol is set to be 2*np.finfo(dtype).eps, rather than np.finfo(dtype).eps. Setting of rtol and atol parameters of `assert_allclose_dense_sparse` is modified to not go below the default values.

rth · 2019-06-26T14:47:22Z

Thanks!

…t-learn#13978)

MAINT: Per scikit-learn#13977 set non-default absolute tolerance

2ca598b

Check for closeness of new_result and the previously compute result should depend on the dtype of the result, and can not be less than that precision's epsilon.

oleksandr-pavlyk force-pushed the check-fit-itempotent-atol branch 2 times, most recently from e654d07 to 1073501 Compare June 4, 2019 17:10

set both atol and rtol to dtype's epsilon

35ca88b

oleksandr-pavlyk force-pushed the check-fit-itempotent-atol branch from 1073501 to 35ca88b Compare June 4, 2019 17:11

jeremiedbb approved these changes Jun 7, 2019

View reviewed changes

rth previously approved these changes Jun 16, 2019

View reviewed changes

rth reviewed Jun 16, 2019

View reviewed changes

Implemented PR feedback

1f517e4

Parameter tol is set to be 2*np.finfo(dtype).eps, rather than np.finfo(dtype).eps. Setting of rtol and atol parameters of `assert_allclose_dense_sparse` is modified to not go below the default values.

rth changed the title ~~MAINT: Per #13977 set non-default absolute tolerance~~ TST dtype dependent rtol, atol in assert_allclose_dense_sparse Jun 26, 2019

rth merged commit 3f25ea0 into scikit-learn:master Jun 26, 2019

koenvandevelde pushed a commit to koenvandevelde/scikit-learn that referenced this pull request Jul 12, 2019

TST dtype dependent rtol, atol in assert_allclose_dense_sparse (sciki…

efe4dae

…t-learn#13978)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TST dtype dependent rtol, atol in assert_allclose_dense_sparse #13978

TST dtype dependent rtol, atol in assert_allclose_dense_sparse #13978

oleksandr-pavlyk commented May 29, 2019 •

edited by rth

Loading

oleksandr-pavlyk commented May 30, 2019

oleksandr-pavlyk commented May 31, 2019

jeremiedbb commented Jun 4, 2019

oleksandr-pavlyk commented Jun 4, 2019

jeremiedbb commented Jun 4, 2019

oleksandr-pavlyk commented Jun 4, 2019

jeremiedbb left a comment

jeremiedbb Jun 7, 2019

oleksandr-pavlyk Jun 7, 2019

jeremiedbb Jun 11, 2019

oleksandr-pavlyk commented Jun 16, 2019

rth left a comment

rth Jun 16, 2019 •

edited

Loading

oleksandr-pavlyk Jun 26, 2019

rth commented Jun 26, 2019

TST dtype dependent rtol, atol in assert_allclose_dense_sparse #13978

TST dtype dependent rtol, atol in assert_allclose_dense_sparse #13978

Conversation

oleksandr-pavlyk commented May 29, 2019 • edited by rth Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

oleksandr-pavlyk commented May 30, 2019

oleksandr-pavlyk commented May 31, 2019

jeremiedbb commented Jun 4, 2019

oleksandr-pavlyk commented Jun 4, 2019

jeremiedbb commented Jun 4, 2019

oleksandr-pavlyk commented Jun 4, 2019

jeremiedbb left a comment

Choose a reason for hiding this comment

jeremiedbb Jun 7, 2019

Choose a reason for hiding this comment

oleksandr-pavlyk Jun 7, 2019

Choose a reason for hiding this comment

jeremiedbb Jun 11, 2019

Choose a reason for hiding this comment

oleksandr-pavlyk commented Jun 16, 2019

rth left a comment

Choose a reason for hiding this comment

rth Jun 16, 2019 • edited Loading

Choose a reason for hiding this comment

oleksandr-pavlyk Jun 26, 2019

Choose a reason for hiding this comment

rth commented Jun 26, 2019

oleksandr-pavlyk commented May 29, 2019 •

edited by rth

Loading

rth Jun 16, 2019 •

edited

Loading