BUG: assert_almost_equal fails on subclasses that cannot handle bool #8452

mhvk · 2017-01-06T22:17:38Z

gh-8410 breaks a large number of astropy tests, because it sets up a boolean array for values that should actually be compared (i.e., are not nan or inf) using zeros_like. The latter means that for subclasses, the boolean test array is not a plain ndarray but the subclass. But for astropy's Quantity, the all method is undefined.

This commit ensures the test arrays from isinf and isnan are used directly, and adds test cases to ensure this does not regress again.

mhvk · 2017-01-06T22:20:53Z

numpy/testing/utils.py

@@ -726,14 +725,13 @@ def chk_same_position(x_id, y_id, hasval='nan'):
                raise AssertionError(msg)

        if isnumber(x) and isnumber(y):
-            x_id, y_id = zeros_like(x, dtype=bool_), zeros_like(y, dtype=bool_)


A more trivial change would replace just this line with

x_id = zeros(x.shape, dtype=bool_) y_id = zeros(y.shape, dtype=bool_)

but it seems unnecessary to create these arrays in the first place -- and indeed the code becomes quite a bit shorter and, to me at least, clearer by just removing elements from the arrays.

shoyer · 2017-01-06T23:17:12Z

I am perfectly happy with the actual fix to the test functions here (indeed, the code is cleaner than before), but I think it is absolute insanity for numpy functions to support subclasses that explicitly disable built-in methods like all.

assert_almost_equal (and other numpy functions) should either coerce to base numpy arrays or work for well behaved subclasses, but it's not realistic for NumPy to cater to the quirks of every strange ndarray subclasses. AstroPy should use its own function for this.

My preference would be to make this change, but leave out the unit tests for the ndarray subclass without the all method. Yes, this means that this will likely break again at some point in the future. Such are the hazards of subclassing ndarray.

mhvk · 2017-01-07T15:22:06Z

@shoyer - I agree that the test at some level is rather odd, even though it is quite obvious that there will be other subclasses for which a boolean dtype simply does not make sense but for which comparisons are well defined; in a sense, the problem here really was that zeros_like allows one to override the dtype, which to me makes very little sense.

But while I don't quite like my own test either, it does seem this is a regression one might as well ensure does not recur; it doesn't seem that unlikely to me there are others than astropy that rely on this working. Would the test be less odd to you if the sample subclass had an explicit __array_finalize__ that raised an error if the dtype was boolean?

p.s. We do have our own assert_quantity_allclose since the numpy version does not work for non-dimensionless quantities. The fact that nonetheless many tests failed just goes to show how wonderfully well (but sometimes frustratingly so) duck-typing often works.

seberg · 2017-01-07T15:42:03Z

Well, not putting a test because we don't want to necessary take care to not break it is not too useful IMHO. Might as well just say in a comment that the test documents a behaviour but is not considered guaranteed. I realize that some people may be discouraged by a failing test immidiately, but my guess is you should always look what kind of test you are breaking and then you see the comment.

seberg · 2017-01-07T17:22:45Z

numpy/testing/utils.py

-                    x_id |= x_isnan
-                    y_id |= y_isnan
+                    x = x[~x_isnan]
+                    y = y[~y_isnan]


OK, I am lazy now, but do these assert functions check the shape exactly? Because otherwise, I think there may be a problem with broadcasting if x and y are only broadcastable to one another.

Yes, the chk_same_position ensures that, when broadcast, x_isnan and y_isnan take out the same elements. As a result, x and y will still broadcast against each other.

OK, shape is checked exactly in any case, so no worries.

mhvk · 2017-01-07T21:22:31Z

@shoyer - would a comment in the test along the lines of what @seberg suggested be OK with you?

shoyer · 2017-01-09T04:47:50Z

@mhvk

would a comment in the test along the lines of what @seberg suggested be OK with you?

Yes, that's fine.

To be clear my complaint here is really more theoretical than practical. As long as you are testing against numpy master and promptly supplying patches when things break, keeping numpy functions working for AstroPy's ndarray subclasses is totally reasonable (assuming no significant performance or code complexity costs).

I just don't want to hobble NumPy with a commitment to supporting this behavior in the long term. The nature of subclassing in Python is that without a clearly delimited public API for subclasses (which doesn't exist for NumPy) it is extremely difficult (in practice impossible) to avoid leaking implementation details in the API. So this sort of breakage happens almost inevitably.

numpygh-8410 breaks a large number of astropy tests, because it sets up a boolean array for values that should actually be compared (i.e., are not `nan` or `inf`) using `zeros_like`. The latter means that for subclasses, the boolean test array is not a plain `ndarray` but the subclass. But for astropy's `Quantity`, the `all` method is undefined. This commit ensures the test arrays from `isinf` and `isnan` are used directly.

mhvk · 2017-01-09T14:41:08Z

OK, all makes sense. I pushed a version with a revised comment. With that, I think this is ready to go in.

seberg · 2017-01-10T10:04:34Z

Hehe, the comment could be more obvious about not being too afraid of making the test fail, but OK. Thanks!

pv · 2017-01-11T09:04:56Z

This seems to break scipy: ``` python3 runtests.py -j2 -t scipy/sparse/tests/test_base.py:TestLIL.test_elementwise_divide ``` results to ``` ====================================================================== FAIL: test_base.TestLIL.test_elementwise_divide ---------------------------------------------------------------------- Traceback (most recent call last): File "/home/pauli/tmp/numpy/build/testenv/lib/python3.5/site-packages/numpy/testing/utils.py", line 707, in chk_same_position assert_array_equal(x_id, y_id) File "/home/pauli/tmp/numpy/build/testenv/lib/python3.5/site-packages/numpy/testing/utils.py", line 842, in assert_array_equal verbose=verbose, header='Arrays are not equal') File "/home/pauli/tmp/numpy/build/testenv/lib/python3.5/site-packages/numpy/testing/utils.py", line 725, in assert_array_compare raise AssertionError(msg) AssertionError: Arrays are not equal (shapes (1, 6), (6,) mismatch) x: [repr failed for <matrix>: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all()] y: array([False, False, False, True, False, False], dtype=bool) During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/usr/lib/python3/dist-packages/nose/case.py", line 198, in runTest self.test(*self.arg) File "/home/pauli/tmp/scipy/scipy/sparse/tests/test_base.py", line 1358, in test_elementwise_divide assert_array_equal(todense(self.datsp / denom), expected) File "/home/pauli/tmp/numpy/build/testenv/lib/python3.5/site-packages/numpy/testing/utils.py", line 842, in assert_array_equal verbose=verbose, header='Arrays are not equal') File "/home/pauli/tmp/numpy/build/testenv/lib/python3.5/site-packages/numpy/testing/utils.py", line 741, in assert_array_compare chk_same_position(x == +inf, y == +inf, hasval='+inf') File "/home/pauli/tmp/numpy/build/testenv/lib/python3.5/site-packages/numpy/testing/utils.py", line 713, in chk_same_position raise AssertionError(msg) AssertionError: Arrays are not equal x and y +inf location mismatch: x: [repr failed for <matrix>: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all()] y: array([ 1. , 0.5 , -3. , inf, 0.25, 0. ]) ```

pv · 2017-01-11T09:09:28Z

Maybe problem with handling np.matrix

seberg · 2017-01-11T10:23:22Z

Yeah, surprising numpy tests did not notice. Possibly the always 2D stuff breaks things, matrices don't quite support the boolean indexing after all. @mhvk can you check it out?

jotasi · 2017-01-11T10:26:52Z

I guess this is the case. You can trigger the error easily by doing:

import numpy as np
np.assert_array_equal(np.matrix([[np.nan, np.inf]]), np.array([[np.nan, np.inf]]))

This will give the following error message:

---------------------------------------------------------------------------
AssertionError                            Traceback (most recent call last)
<ipython-input-2-32896f34a9e8> in <module>()
----> 1 np.testing.assert_array_equal(np.matrix([[np.nan, np.inf]]), np.array([[np.nan, np.inf]]))

XXX/numpy/testing/utils.pyc in assert_array_equal(x, y, err_msg, verbose)
    841     """
    842     assert_array_compare(operator.__eq__, x, y, err_msg=err_msg,
--> 843                          verbose=verbose, header='Arrays are not equal')
    844
    845

XXX/numpy/testing/utils.pyc in assert_array_compare(comparison, x, y, err_msg, verbose, header, precision, equal_nan, equal_inf)
    740                 if any(x_isinf) or any(y_isinf):
    741                     # Check +inf and -inf separately, since they are different
--> 742                     chk_same_position(x == +inf, y == +inf, hasval='+inf')
    743                     chk_same_position(x == -inf, y == -inf, hasval='-inf')
    744                     x = x[~x_isinf]

XXX/numpy/testing/utils.pyc in chk_same_position(x_id, y_id, hasval)
    711                                 % (hasval), verbose=verbose, header=header,
    712                                 names=('x', 'y'), precision=precision)
--> 713             raise AssertionError(msg)
    714
    715     try:

AssertionError:
Arrays are not equal

x and y +inf location mismatch:
 x: matrix([[ inf]])
 y: array([ inf])

I guess the reason is, that a matrix stays two dimensional even after removing the nan's and then when comparing the positions of the inf's, assert_array_equal is called and that calls assert_array_compare which first checks, that the shapes of the arrays match and that then fails.

mhvk · 2017-01-11T14:48:34Z

I'll have a look...

mhvk · 2017-01-11T15:39:33Z

Indeed, it was a problem with slicing and matrix keeping 2 dimensions. Possibly assert_array_equal should broadcast rather than strictly check the array shape, but that presumably is water under the bridge. Anyway, a fix in #8468 (with the above example as a regression check).

shoyer · 2017-01-11T17:27:03Z

Maybe I'm missing something, but why does assert_array_equal even work for duck-array types that are not ndarray subclasses? I guess we are stuck with it because scipy.sparse uses it, but I am surprised that this function does not at least use np.asanyarray to sanitize the inputs.

pv · 2017-01-11T17:28:59Z

scipy.sparse doesn't use it on sparse matrices, all the inputs are np.matrix which is an ndarray subtype.

mhvk · 2017-01-11T17:38:11Z

Just for reference: assert_array_compare does do x = array(x, copy=False, subok=True) and same for y.

mhvk commented Jan 6, 2017

View reviewed changes

This was referenced Jan 6, 2017

Numpy 1.13dev errors astropy/astropy#5674

Closed

BUG: Fixed behavior of assert_array_less for +/-inf #8410

Merged

seberg reviewed Jan 7, 2017

View reviewed changes

charris added 06 - Regression component: numpy.testing labels Jan 7, 2017

mhvk force-pushed the testing-avoid-subclass-bool-arrays branch from 06032a3 to fe46cd6 Compare January 9, 2017 14:40

seberg changed the title ~~BUG assert_almost_equal fails on subclasses that cannot handle bool~~ BUG: assert_almost_equal fails on subclasses that cannot handle bool Jan 9, 2017

seberg merged commit 124c3d8 into numpy:master Jan 10, 2017

homu mentioned this pull request Jan 10, 2017

ENH: Add isnat function and make comparison tests NAT specific #8421

Merged

1 task

mhvk deleted the testing-avoid-subclass-bool-arrays branch January 10, 2017 14:37

pv mentioned this pull request Jan 11, 2017

ENH: Minor user-friendliness cleanup in LowLevelCallable scipy/scipy#6952

Merged

mhvk mentioned this pull request Jan 11, 2017

BUG: Ensure inf/nan removal in assert_array_compare is matrix-safe. #8468

Merged

charris mentioned this pull request Jan 12, 2017

Drop NumPy 1.10.x conda-forge/staged-recipes#2177

Merged

Uh oh!

BUG: assert_almost_equal fails on subclasses that cannot handle bool #8452

BUG: assert_almost_equal fails on subclasses that cannot handle bool #8452

Uh oh!

Conversation

mhvk commented Jan 6, 2017

Uh oh!

mhvk Jan 6, 2017

Choose a reason for hiding this comment

Uh oh!

shoyer commented Jan 6, 2017

Uh oh!

mhvk commented Jan 7, 2017

Uh oh!

seberg commented Jan 7, 2017

Uh oh!

seberg Jan 7, 2017

Choose a reason for hiding this comment

Uh oh!

mhvk Jan 7, 2017

Choose a reason for hiding this comment

Uh oh!

seberg Jan 8, 2017

Choose a reason for hiding this comment

Uh oh!

mhvk commented Jan 7, 2017

Uh oh!

shoyer commented Jan 9, 2017

Uh oh!

mhvk commented Jan 9, 2017

Uh oh!

seberg commented Jan 10, 2017

Uh oh!

pv commented Jan 11, 2017 via email

Uh oh!

pv commented Jan 11, 2017 via email

Uh oh!

seberg commented Jan 11, 2017

Uh oh!

jotasi commented Jan 11, 2017

Uh oh!

mhvk commented Jan 11, 2017

Uh oh!

mhvk commented Jan 11, 2017

Uh oh!

shoyer commented Jan 11, 2017

Uh oh!

pv commented Jan 11, 2017 via email

Uh oh!

mhvk commented Jan 11, 2017

Uh oh!

Uh oh!