ENH: make np.where a ufunc #8994

eric-wieser · 2017-04-26T16:11:06Z

Should be very geared towards a bXX->X loop, for every possible X.

This would offer:

an out argument
support for subclasses using __array_ufunc__
Not really desirable, but comes with the package - a where argument (!) such that np.where(c, y, z, out=x, where=w) is a more efficient x = np.where(w, x, np.where(c, y, z))
A fix to BUG np.where half-initializes subclass of output #5095

Problems:

Are inner loops for void and other flexible types possible?

The text was updated successfully, but these errors were encountered:

mhvk · 2017-04-26T22:01:23Z

Love this idea!

shoyer · 2017-04-28T04:58:43Z

Yes, this would be awesome! This would only work for the three argument version of where -- the one argument version isn't really ufunc like. So we can keep the public API unchanged, and don't need to support the confusing where(... where=...) unless we really want to :).

hameerabbasi · 2018-02-24T07:12:28Z

I believe that they are possible. However, would it be possible to make the one-argument version work with this? Or would the three-argument version defer to this?

hameerabbasi · 2018-02-24T07:14:54Z

I'd be willing to work up a PR if we can decide on a new home for this ufunc so we support the one-argument where.

mhvk · 2018-02-24T17:24:02Z

@hameerabbasi - I think the idea would be that np.where calls the ufunc for its three-argument form (and for types of argument for which the ufunc works). For the one-argument form, no change would be made (since that behaviour cannot be captured by a ufunc).

Questions to all: what should be the name of the ufunc? We already have np.select. Harking to c, perhaps np.conditional(condition, a, b). Or, more fortran-ish, np.merge(a, b, condition)? Or should it be a private function that only gets called by np.where?

p.s. On dealing with void and string: that is a bit trickier, as it needs passing on lengths, etc. Possibly it is best to just start with the regular dtypes...

hameerabbasi · 2018-02-24T18:42:01Z

Questions to all: what should be the name of the ufunc? We already have np.select. Harking to c, perhaps np.conditional(condition, a, b). Or, more fortran-ish, np.merge(a, b, condition)? Or should it be a private function that only gets called by np.where?

In the issue I opened (before opening this one), I suggested if, but it's a Python keyword so not optimal. ternary would also work since it mimics the ternary operator. ifx from LaTeX would make sense too.

p.s. On dealing with void and string: that is a bit trickier, as it needs passing on lengths, etc. Possibly it is best to just start with the regular dtypes...

Use np.promote_types? Doesn't work for all cases though.

>>> np.promote_types('S8', 'S11')
dtype('S11')
>>> dt1 = np.dtype([('f1', np.int16), ('f2', np.float32)])
>>> dt2 = np.dtype([('f5', np.int16), ('f6', np.float32)])
>>> np.promote_types(dt1, dt2)
Traceback (most recent call last):
  File "<input>", line 1, in <module>
TypeError: invalid type promotion
>>> np.promote_types(dt1, dt1)
dtype([('f1', '<i2'), ('f2', '<f4')])

mhvk · 2018-02-24T18:56:44Z

I don't like ternary as that's just the number of arguments, and one could think of other ufuncs with three arguments (e.g., the fused multiply and add discussed on the mailing list).

shoyer · 2018-02-24T18:57:08Z

I like the name np.conditional. "merge" suggests something database like. "cond" matches the name from lisp, but we don't need a short name when we will still have "where"

…

On Sat, Feb 24, 2018 at 10:42 AM Hameer Abbasi ***@***.***> wrote: Questions to all: what should be the name of the ufunc? We already have np.select. Harking to c, perhaps np.conditional(condition, a, b). Or, more fortran-ish, np.merge(a, b, condition)? Or should it be a private function that only gets called by np.where? In the issue I opened (before opening this one), I suggested if, but it's a Python keyword so not optimal. trinary would also work since it mimics the trinary operator. ifx from LaTeX would make sense too. p.s. On dealing with void and string: that is a bit trickier, as it needs passing on lengths, etc. Possibly it is best to just start with the regular dtypes... Use np.promote_types? Doesn't work for all cases though. >>> np.promote_types('S8', 'S11') dtype('S11')>>> dt1 = np.dtype([('f1', np.int16), ('f2', np.float32)])>>> dt2 = np.dtype([('f5', np.int16), ('f6', np.float32)])>>> np.promote_types(dt1, dt2) Traceback (most recent call last): File "<input>", line 1, in <module>TypeError: invalid type promotion>>> np.promote_types(dt1, dt1) dtype([('f1', '<i2'), ('f2', '<f4')]) — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#8994 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABKS1mEAqEz1N7-odyYQiIWBsHRVxVueks5tYFf7gaJpZM4NJF4A> .

hameerabbasi · 2018-02-26T21:32:45Z

I also had another idea. I know numpy doesn't follow this too strictly, but how about if_ similar to operator.and_ et al.

hameerabbasi · 2018-05-18T05:05:41Z

I'm +1 on the name np.conditional. If someone can point me to similar ufunc implementations I'll try my hand at this one.

mhvk · 2018-05-18T13:10:16Z

The casting would, I think, be fairly similar to addition (except that of course the boolean does not influence the outcome). The regular ufuncs are all defined in core/src/umath/, in particular loops.c.src; you may want to look at recent additions; e.g., #8774. Though if you haven't done a ufunc before, it might make sense to start without the scripting/looping in a .src file, and just follow the tutorial for writing a ufunc: https://docs.scipy.org/doc/numpy/user/c-info.ufunc-tutorial.html (I found this fairly helpful).

eric-wieser · 2021-03-01T10:26:55Z

Regarding naming, I think np.where.ufunc is possibly the simplest place to put the actual ufunc object dispatched to by the special-casing of one-argument in np.where. Only users implementing __array_ufunc__ need to know where it is, and they'd find it organically while writing tests for the ufuncs they care about.

eric-wieser added 01 - Enhancement component: numpy._core labels Apr 26, 2017

This was referenced Apr 26, 2017

BUG: Applying np.fix on scalar returns 0-D array #8993

Closed

BUG/DEP: Make ufunclike functions more ufunc-like #8996

Merged

eric-wieser mentioned this issue Jun 3, 2017

MAINT: Don't internally use the one-argument where #9214

Merged

hameerabbasi mentioned this issue Feb 24, 2018

Introduce if ufunc and rebase the three-argument where on it #10654

Closed

eric-wieser mentioned this issue Apr 19, 2018

Functions select and where don't preserve subclasses #10933

Closed

hameerabbasi mentioned this issue May 18, 2018

A protocol for numpy.ones_like #11074

Closed

eric-wieser mentioned this issue Mar 1, 2021

ENH: add ufuncs additional kwargs like out, dtype etc.. for np.where (out is needed most) #18516

Open

rhshadrach mentioned this issue Apr 20, 2024

BUG: np.where called with ps.Series returns np.array instead of pd.Series pandas-dev/pandas#58329

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: make np.where a ufunc #8994

ENH: make np.where a ufunc #8994

eric-wieser commented Apr 26, 2017 •

edited

Loading

mhvk commented Apr 26, 2017

shoyer commented Apr 28, 2017

hameerabbasi commented Feb 24, 2018

hameerabbasi commented Feb 24, 2018 •

edited

Loading

mhvk commented Feb 24, 2018

hameerabbasi commented Feb 24, 2018 •

edited

Loading

mhvk commented Feb 24, 2018

shoyer commented Feb 24, 2018 via email

hameerabbasi commented Feb 26, 2018

hameerabbasi commented May 18, 2018

mhvk commented May 18, 2018

eric-wieser commented Mar 1, 2021

ENH: make np.where a ufunc #8994

ENH: make np.where a ufunc #8994

Comments

eric-wieser commented Apr 26, 2017 • edited Loading

mhvk commented Apr 26, 2017

shoyer commented Apr 28, 2017

hameerabbasi commented Feb 24, 2018

hameerabbasi commented Feb 24, 2018 • edited Loading

mhvk commented Feb 24, 2018

hameerabbasi commented Feb 24, 2018 • edited Loading

mhvk commented Feb 24, 2018

shoyer commented Feb 24, 2018 via email

hameerabbasi commented Feb 26, 2018

hameerabbasi commented May 18, 2018

mhvk commented May 18, 2018

eric-wieser commented Mar 1, 2021

eric-wieser commented Apr 26, 2017 •

edited

Loading

hameerabbasi commented Feb 24, 2018 •

edited

Loading

hameerabbasi commented Feb 24, 2018 •

edited

Loading