Add XArray compatibility features #102

hameerabbasi · 2018-02-19T11:21:44Z

Once the right broadcasting setup was in place; it was trivial to implement the three-argument version of where and NaN-skipping aggregations.

cc @mrocklin Feedback welcome.
cc @shoyer Feedback really welcome, as you know the ins and outs of XArray. Of course, "I can't" is okay; as always. :-)

hameerabbasi · 2018-02-19T11:22:05Z

Tests not yet added.

hameerabbasi · 2018-02-19T12:15:10Z

Unfortunately, there doesn't seem to be a way to override np.nansum. I added a print command in there, and:

>>> import numpy as np
>>> import sparse
>>> x = np.asarray([5, 6, np.nan])
>>> s = sparse.COO.from_numpy(x)
>>> s.nansum()
nanreduce
11.0
>>> np.nansum(s)
11.0

Edit: It doesn't want to fall back to our implementation even if we err when coercing.

>>> class NoncoercibleCOO(sparse.COO):
...     def __array__(self, *args, **kwargs):
...         raise ValueError('Cannot coerce COO.')
...     
>>> s = NoncoercibleCOO.from_numpy(x)
>>> np.nansum(s)
Traceback (most recent call last):
  File "<input>", line 1, in <module>
  File "/Users/hameer/anaconda3/envs/sparse/lib/python3.6/site-packages/numpy/lib/nanfunctions.py", line 581, in nansum
    a, mask = _replace_nan(a, 0)
  File "/Users/hameer/anaconda3/envs/sparse/lib/python3.6/site-packages/numpy/lib/nanfunctions.py", line 64, in _replace_nan
    a = np.array(a, subok=True, copy=True)
  File "<input>", line 3, in __array__
ValueError: Cannot coerce COO.

hameerabbasi · 2018-02-19T14:32:45Z

Hmm. It seems that (for nanmin and nanmax), sometimes we get -inf as a value (on a full axis that contained JUST nan and nothing else), whereas NumPy actually returns nan. I think the actual value of the reduction in this case is debatable (I, for one, think empty min should be inf, etc, as a hobbyist mathematician).

In any case, I'm willing to leave this as an xfail for now. Any input welcome.

hameerabbasi · 2018-02-19T15:30:29Z

I added tests and matched the Numpy API. Looks review ready to me. 💃 cc @mrocklin @shoyer

hameerabbasi · 2018-02-20T21:08:22Z

Do we want to support the edge case of NaNs in object arrays? It's proving difficult. It's certainly possible, but only with a bit of trickery.

mrocklin · 2018-02-20T21:11:42Z

Make a test with pytest.mark.xfail and move on?

…

On Tue, Feb 20, 2018 at 4:08 PM, Hameer Abbasi ***@***.***> wrote: Do we want to support the edge case of NaNs in object arrays? It's proving difficult. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#102 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AASszHNoWLT6YB86IzQaL6O_9tQ6yww2ks5tWzRGgaJpZM4SKZV8> .

shoyer · 2018-02-20T21:14:41Z

I don't think there are many uses for object dtype sparse arrays. Mostly we use those for strings, but I imagine sparse arrays are most useful for numbers.

…

On Tue, Feb 20, 2018 at 1:09 PM Hameer Abbasi ***@***.***> wrote: Do we want to support the edge case of NaNs in object arrays? It's proving difficult. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#102 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABKS1lF-wLH1pZ4iTlLns0ZNY24qywqnks5tWzRGgaJpZM4SKZV8> .

hameerabbasi · 2018-02-21T09:06:48Z

It seems we don't support object arrays in many cases at the moment. I've opened #104 to track this but am inclined to ignore this.

hameerabbasi · 2018-02-21T09:26:52Z

I fixed the nan issue (by using fmin/fmax); tested for it; and matched the Numpy warning (and tested for it). This time around I feel this really is ready for review.

Edit: I looked at the implementation for nanmin etc. in Numpy. Unfortunately, without hooking into Numpy and replacing its functions with our own if the args are SparseArray, I don't see a way to solve this.

mrocklin · 2018-02-21T14:23:36Z

This looks good to me :)

hameerabbasi · 2018-02-22T08:17:53Z

Merged!

* Implement where. * Implement NaN-skipping aggregations. * Docs. * Add tests, clarify docs a bit. * Move NaN aggregations to be functions rather than methods to match Numpy * Get rid of eval that was bothering me a lot. * Fix NaN inequality issue. * Remove object dtype code. * Test for and fix warning code.

hameerabbasi added 3 commits February 19, 2018 11:13

Implement where.

bb7be7e

Implement NaN-skipping aggregations.

95a1daa

Docs.

bb579dc

hameerabbasi mentioned this pull request Feb 19, 2018

No way to override NaN-skipping reductions in duck arrays. numpy/numpy#10628

Closed

hameerabbasi requested review from mrocklin and shoyer February 19, 2018 12:41

Add tests, clarify docs a bit.

bed94f3

Move NaN aggregations to be functions rather than methods to match Numpy

31debfb

hameerabbasi added 2 commits February 19, 2018 17:31

Get rid of eval that was bothering me a lot.

12036c8

Fix NaN inequality issue.

8bb786e

Remove object dtype code.

456fe11

Test for and fix warning code.

c2ae0a6

hameerabbasi merged commit 8965294 into pydata:master Feb 22, 2018

hameerabbasi deleted the xarray-compat branch February 22, 2018 09:00

hameerabbasi mentioned this pull request Feb 24, 2018

Introduce if ufunc and rebase the three-argument where on it numpy/numpy#10654

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add XArray compatibility features #102

Add XArray compatibility features #102

hameerabbasi commented Feb 19, 2018 •

edited

Loading

hameerabbasi commented Feb 19, 2018

hameerabbasi commented Feb 19, 2018 •

edited

Loading

hameerabbasi commented Feb 19, 2018

hameerabbasi commented Feb 19, 2018 •

edited

Loading

hameerabbasi commented Feb 20, 2018 •

edited

Loading

mrocklin commented Feb 20, 2018 via email

shoyer commented Feb 20, 2018 via email

hameerabbasi commented Feb 21, 2018

hameerabbasi commented Feb 21, 2018 •

edited

Loading

mrocklin commented Feb 21, 2018

hameerabbasi commented Feb 22, 2018

Add XArray compatibility features #102

Add XArray compatibility features #102

Conversation

hameerabbasi commented Feb 19, 2018 • edited Loading

hameerabbasi commented Feb 19, 2018

hameerabbasi commented Feb 19, 2018 • edited Loading

hameerabbasi commented Feb 19, 2018

hameerabbasi commented Feb 19, 2018 • edited Loading

hameerabbasi commented Feb 20, 2018 • edited Loading

mrocklin commented Feb 20, 2018 via email

shoyer commented Feb 20, 2018 via email

hameerabbasi commented Feb 21, 2018

hameerabbasi commented Feb 21, 2018 • edited Loading

mrocklin commented Feb 21, 2018

hameerabbasi commented Feb 22, 2018

hameerabbasi commented Feb 19, 2018 •

edited

Loading

hameerabbasi commented Feb 19, 2018 •

edited

Loading

hameerabbasi commented Feb 19, 2018 •

edited

Loading

hameerabbasi commented Feb 20, 2018 •

edited

Loading

hameerabbasi commented Feb 21, 2018 •

edited

Loading