Fixup gh 5864 #5964

charris · 2015-06-13T20:15:34Z

Fix failing test in #5864 and make some style cleanups. Original message from Nathaniel follows.

We've long had consensus that returning NotImplemented from ufuncs was a bad and ugly hack, since it violates layering -- NotImplemented is the responsibility of op methods, not ufuncs. And one of the things that came out of the discussion in gh-5844 is that the NotImplemented handling inside the ufunc machinery was actually mostly useless and broken (see the first post on that report for analysis).

Of course, it turns out it's not quite that simple -- there were some places, esp. in the trainwreck that is array_richcompare, which were depending on the current behavior.

This patch series fixes the main pieces of code that used to rely on this behavior, and then removes as much of the NotImplemented handling as it can. And for what isn't removed, it adds deprecation or future warnings.

In particular, there are a bunch of cases where array_richcompare returns bad and wrong results, like where you can compare two arrays (which should produce a boolean array), some failure occurs, and then the error gets thrown away and we end up returning a boolean scalar. Some of these were deprecated in 9b8f6c7, but this deprecates the rest. Probably the most controversial part of this is that this patch deprecates scalar ordering comparisons between unlike types, like np.float64(1) < "foo", which are already an error on py3 but have traditionally returned some semi-arbitrary value on py2. To convince you that this is broken and we should stop supporting it, check this out:

Note: the name of this class affects the results below

In [1]: class quux(object):
...: pass
...:

In [2]: q = quux()

In [3]: np.string_("foo") < q
Out[3]: True

In [4]: np.asarray(np.string_("foo")) < q
Out[4]: False

For a full analysis of how we handle all the legacy comparison cases, see the long comment that starts here:

numpy/numpy/core/src/umath/ufunc_object.c

Line 867 in 19b829a

if (any_flexible && !any_flexible_userloops && !any_object) {

NB this PR is probably easier to review if you read the commits one at a time in order, instead of jumping straight to the big diff.

… case The ndarray richcompare function has a special case for handling string dtypes (which currently cannot be handled by ufuncs). Traditionally this was handled by the ufuncs returning NotImplemented, and then falling through to a special case. By moving the special case to the top of the richcompare function, it becomes unnecessary for the ufuncs to return NotImplemented in this case.

The ndarray richcompare function has special case code for handling void dtypes (esp. structured dtypes), since there are no ufuncs for this. Previously, we would attempt to call the relevant ufunc (e.g. np.equal), and then when this failed (as signaled by the ufunc returning NotImplemented), we would fall back on the special case code. This commit moves the special case code to before the regular code, so that it no longer requires ufuncs to return NotImplemented. Technically, it is possible to define ufunc loops for void dtypes using PyUFunc_RegisterLoopForDescr, so technically I think this commit changes behaviour: if someone had registered a ufunc loop for one of these operations, then previously it might have been found and pre-empted the special case fallback code; now, we use the special-case code without even checking for any ufunc. But the only possible use of this functionality would have been if someone wanted to redefine what == or != meant for a particular structured dtype -- like, they decided that equality for 2-tuples of float32's should be different from the obvious thing. This does not seem like an important capability to preserve. There were also several cases here where on error, an array comparison would return a scalar instead of raising. This is supposedly deprecated, but there were call paths that did this that had no deprecation warning. I added those warnings.

ndarray special methods like __add__ have a special case where if the right argument is not an ndarray or subclass, and it has higher __array_priority__ than the left argument, then we return NotImplemented and let the right argument handle the operation. ufuncs have traditionally had a similar but different special case, where if it's a 2 input - 1 output ufunc, and the right argument is not an ndarray (exactly, subclasses don't count), and when converted to an ndarray ends up as an object array (presumably b/c it doesn't have a meaningful coercion route, though who knows), and it has a higher __array_priority__ than the left argument AND it has a __r<operation>__ attribute, then they return NotImplemented. In practice this latter special case is not used by regular ndarrays, b/c anytime it would need to be triggered, the former special case triggers first and the ufunc is never called. However, numpy.ma did not have the former special case, and was thus relying on the ufunc special case. This commit adds the special case to the numpy.ma special methods directly, so that they no longer depend on the quirky ufunc behaviour. It also cleans up the relevant test to things that actually should be true in general, instead of just testing some implementation details.

This was redundant/broken anyway. See numpygh-5844 for discussion. See the massive comment added to ufunc_object.c:get_ufunc_arguments for a discussion of the deprecation strategy here -- it turns out that array_richcompare is such a disaster zone that we can't quite wholly eliminate NotImplemented quite yet. But this removes most NotImplementeds, and lays the groundwork for eliminating the rest in a release or two.

This is probably a good change in any case, and in particular might help with debugging whatever bizarro thing is going wrong on Travis.

The NotArray test class needs to define a `__ne__` method, otherwise the inherited Python 3 method will call `__eq__`, resulting in two rather than one DeprecationWarning.

charris · 2015-06-13T20:17:27Z

Another fixup in ma/core.py needs to be made before committing.

charris · 2015-06-13T22:25:08Z

See also #2091.

Also added back some extended error messages that were in original PR numpy#5864.

charris · 2015-06-13T23:32:42Z

Additional tests are still needed before this can go in.@pv suggests the following

The getpriority checks inside ufunc (which are removed here) are responsible for making array_priority be respected for inplace ops. Some copypasting of the getpriority check in number.c is required to make it work again. . Note also that the check in number.c has PyArray_Check(other), but the check in the ufunc has PyArray_CheckExact(other), so without me testing it it looks like there's a behavior change.

To be able to do this refactoring safely, it's probably necessary to exercise the various branches in the code, and to have additional tests. It seems as if to cover the different branches, checks for operations involving (i) ndarray & ndarray subclass, (ii) ndarray & non-ndarray object, (iii) ndarray subclass & different ndarray subclass, (iv) ndarray subclass & other object.

@pv I assume these tests involve the various affected operators. Could you expand a bit on the first comment?

The following functions now raise a TypeError instead of returning NotImplemented: np.power, np.add, np.subtract, np.multiply, np.divide, np.bitwise_and, np.bitwise_or, np.bitwise_xor, np.floor_divide, np.fmax, np.fmin, np.fmod, np.hypot, np.logaddexp, np.logaddexp2, np.logical_and, np.logical_or, np.logical_xor, np.maximum, np.minimum, np.mod Functions that remain to be fixed: np.greater, np.greater_equal, np.less, np.less_equal, np.not_equal

This checks the four classes with different __array_priority__ values against each other for the operators __add__, __radd__ __sub__, __rsub__ __mul__, __rmul__ __pow__, __rpow__ __div__, __rdiv__ __mod__, __rmod__ __truediv__, __rtruediv__ __floordiv__, __rfloordiv__ __and__, __rand__ __xor__, __rxor__ __or__, __ror__ __lshift__, __rlshift__ __rshift__, __rrshift__ __eq__ __ne__ __gt__ __ge__ __lt__ __le__ The classes are ndarray, two subclasses of ndarray, and a class unrelated to ndarray. The combinations checked are ndarray -- ndarray subclass ndarray -- unrelated class ndarray subclass -- ndarray subclass ndarray subclass -- unrelated class

charris · 2015-06-14T02:32:14Z

Tests for binary operators in classes of different types with differing values of __array_priority__ added. I think this PR is ready unless someone can recommend inplace operation tests.

Fixup gh 5864

Fixed by numpy#5964, but might as well keep the test from numpy#5960.

njsmith · 2015-06-15T09:28:03Z

Thanks, Chuck, for picking this up!

njsmith and others added 6 commits June 13, 2015 12:32

MAINT: make the deprecation warnings a little more distinctive

c65a751

This is probably a good change in any case, and in particular might help with debugging whatever bizarro thing is going wrong on Travis.

BUG: Fix test_void_dtype_equality_failures for Python 3.

df148e1

The NotArray test class needs to define a `__ne__` method, otherwise the inherited Python 3 method will call `__eq__`, resulting in two rather than one DeprecationWarning.

charris added 01 - Enhancement component: numpy._core component: numpy.ma masked arrays labels Jun 13, 2015

charris added this to the 1.10.0 release milestone Jun 13, 2015

charris mentioned this pull request Jun 13, 2015

Remove NotImplemented handling from the ufunc machinery (almost) #5864

Closed

STY: Minor style cleanups in tests and C code.

0aa3260

Also added back some extended error messages that were in original PR numpy#5864.

charris force-pushed the fixup-gh-5864 branch from 2bf660a to 0aa3260 Compare June 13, 2015 22:29

charris force-pushed the fixup-gh-5864 branch from 2f34614 to ee03f28 Compare June 14, 2015 00:01

charris added 2 commits June 13, 2015 18:52

charris force-pushed the fixup-gh-5864 branch from ee03f28 to ed031c2 Compare June 14, 2015 02:29

charris added a commit that referenced this pull request Jun 14, 2015

Merge pull request #5964 from charris/fixup-gh-5864

899a2a2

Fixup gh 5864

charris merged commit 899a2a2 into numpy:master Jun 14, 2015

charris deleted the fixup-gh-5864 branch June 14, 2015 15:54

charris added a commit to charris/numpy that referenced this pull request Jun 14, 2015

TST: Add test for segfault in richcompare of void type ==, !=.

7721d3d

Fixed by numpy#5964, but might as well keep the test from numpy#5960.

This was referenced Jun 14, 2015

TST: Add test for segfault in richcompare of void type ==, !=. #5965

Merged

BUG: segfault in array_richcompare #5960

Closed

BUG subclass with __numpy_ufunc__ segfaults with recarray comparison #4855

Closed

mhvk mentioned this pull request Jun 17, 2015

Quantity updates for numpy dev astropy/astropy#3860

Merged

This was referenced Jun 17, 2015

BUG Ensure masked object arrays can always return single items. #5962

Merged

MaskedArray ignores __array_priority__ #5227

Closed

This was referenced Jun 22, 2015

functions return NotImplemented instead of raising error (Trac #1494) #2091

Closed

In-place operators in 1.9 "respected" __array_priority__; in current master they don't #6020

Closed

BUG/ENH: Switch to simplified __numpy_ufunc__/binop interaction #6001

Closed

This was referenced Oct 12, 2015

np.datetime('NaT') == np.datetime('NaT', 'us') is no longer True on NumPy 1.10 #6452

Closed

np.equal can still return NotImplemented #6479

Closed

shoyer mentioned this pull request May 20, 2017

BUG: Operator equal supports void and np.equal does not #9147

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fixup gh 5864 #5964

Fixup gh 5864 #5964

Uh oh!

charris commented Jun 13, 2015

Uh oh!

charris commented Jun 13, 2015

Uh oh!

charris commented Jun 13, 2015

Uh oh!

charris commented Jun 13, 2015

Uh oh!

charris commented Jun 14, 2015

Uh oh!

njsmith commented Jun 15, 2015

Uh oh!

Uh oh!

Uh oh!

Fixup gh 5864 #5964

Fixup gh 5864 #5964

Uh oh!

Conversation

charris commented Jun 13, 2015

Note: the name of this class affects the results below

Uh oh!

charris commented Jun 13, 2015

Uh oh!

charris commented Jun 13, 2015

Uh oh!

charris commented Jun 13, 2015

Uh oh!

charris commented Jun 14, 2015

Uh oh!

njsmith commented Jun 15, 2015

Uh oh!

Uh oh!