MAINT: Improve speed of ufunc kwargs parsing. #11333

mhvk · 2018-06-14T14:26:17Z

This uses the realization from inspecting similar CPython code that as we intern string names anyway, comparing keys with possible names by pointer will generally just work, and is of course much faster than first converting a unicode object to char* and then using strcmp.

@eric-wieser - my C remains limited: is there a cleverer way than the enum I used, ideally one that allows on to type things just once (there are now five places one has to keep up to date, three for the interns, one enum, and one array of the interned strings; this seems too much!)

Reduces the overhead as follows:

import numpy as np
a = np.array(1)
b = np.empty_like(a)
%timeit np.positive(a)                                 # 352->348 ns
%timeit np.positive(a, subok=True)                     # 606->501 ns
%timeit np.positive(a, where=True)                     # 586->503 ns
%timeit np.positive(a, where=True, subok=True)         # 695->531 ns
%timeit np.positive(a, b)                              # 354->352 ns
%timeit np.positive(a, out=b)                          # 557->480 ns
%timeit np.positive(a, out=b, subok=True)              # 668->506 ns
%timeit np.positive(a, out=b, where=True, subok=True)  # 752->536 ns

Using the realization from inspecting similar CPython code that as we intern string names anyway, comparing keys with possible names by pointer will generally just work. Reduces the overhead as follows: ``` import numpy as np a = np.array(1) b = np.empty_like(a) %timeit np.positive(a) # 352->348 ns %timeit np.positive(a, subok=True) # 606->501 ns %timeit np.positive(a, where=True) # 586->503 ns %timeit np.positive(a, where=True, subok=True) # 695->531 ns %timeit np.positive(a, b) # 354->352 ns %timeit np.positive(a, out=b) # 557->480 ns %timeit np.positive(a, out=b, subok=True) # 668->506 ns %timeit np.positive(a, out=b, where=True, subok=True) # 752->536 ns ```

mhvk · 2018-06-14T14:27:05Z

p.s. Quite obviously, more to be gained by doing this elsewhere, and by looking for out just once instead of who-knows-how-many-times.

eric-wieser · 2018-06-14T15:51:52Z

numpy/core/src/umath/ufunc_object.c

+            for (kw_id = 0; kw_id < ufunc_badkey; kw_id++) {
+                int cmp = PyObject_RichCompareBool(key, ufunc_kwnames[kw_id], Py_EQ);
+                if (cmp > 0) {
+                    goto kw_found;


Instead of using a goto, just make the above a helper function returning kw_id

I followed CPython's ceval quite exactly! But with a helper function, this may become useful for GenericReduction as well.

Can you link to the code you're imitating?

https://github.com/python/cpython/blob/master/Python/ceval.c#L3759

p.s. Do think the helper function is a good idea, but was waiting with making changes as I hoped someone would have a better idea of how to handle those name arrays and the enum -- I like the names but not that multiple lists have to be maintained separately. Can one iterate over an enum?

eric-wieser · 2018-06-15T01:54:20Z

I'm curious how much the switch statement buys us over a bunch of if/else ifs comparing PyObject * pointers to interned strings, which would eliminate the need for an enum.

mhvk · 2018-06-15T12:01:35Z

The one issue would be that then the slow path becomes a whole set of goto statements... (and given that python has it, I do not quite dare to remove it...)

mhvk · 2018-06-15T15:11:01Z

I think the real solution here would be to mimic PyArg_ParseTupleAndKeywords, with all those bits of code interpreting the various keywords properly passed on as functions. But that may be better done as another PR...

mhvk · 2018-06-21T14:41:29Z

Closing in favour of #11351

mhvk added 03 - Maintenance component: numpy.ufunc labels Jun 14, 2018

mhvk requested a review from eric-wieser June 14, 2018 14:26

mhvk mentioned this pull request Jun 14, 2018

Adding keywords to ufunc calls seems slow #10301

Closed

eric-wieser reviewed Jun 14, 2018

View reviewed changes

mhvk mentioned this pull request Jun 14, 2018

MAINT: move comparison operator special-handling out of ufunc parsing. #11282

Merged

mhvk mentioned this pull request Jun 16, 2018

MAINT: Improve speed of ufunc kwargs parsing #11351

Merged

mhvk closed this Jun 21, 2018

mhvk deleted the ufunc-use-interned-strings-in-parsing branch June 21, 2018 14:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

MAINT: Improve speed of ufunc kwargs parsing. #11333

MAINT: Improve speed of ufunc kwargs parsing. #11333

Uh oh!

mhvk commented Jun 14, 2018

Uh oh!

mhvk commented Jun 14, 2018

Uh oh!

eric-wieser Jun 14, 2018

Uh oh!

mhvk Jun 14, 2018

Uh oh!

eric-wieser Jun 14, 2018

Uh oh!

mhvk Jun 14, 2018

Uh oh!

mhvk Jun 14, 2018

Uh oh!

eric-wieser commented Jun 15, 2018

Uh oh!

mhvk commented Jun 15, 2018

Uh oh!

mhvk commented Jun 15, 2018

Uh oh!

mhvk commented Jun 21, 2018

Uh oh!

Uh oh!

Uh oh!

MAINT: Improve speed of ufunc kwargs parsing. #11333

MAINT: Improve speed of ufunc kwargs parsing. #11333

Uh oh!

Conversation

mhvk commented Jun 14, 2018

Uh oh!

mhvk commented Jun 14, 2018

Uh oh!

eric-wieser Jun 14, 2018

Choose a reason for hiding this comment

Uh oh!

mhvk Jun 14, 2018

Choose a reason for hiding this comment

Uh oh!

eric-wieser Jun 14, 2018

Choose a reason for hiding this comment

Uh oh!

mhvk Jun 14, 2018

Choose a reason for hiding this comment

Uh oh!

mhvk Jun 14, 2018

Choose a reason for hiding this comment

Uh oh!

eric-wieser commented Jun 15, 2018

Uh oh!

mhvk commented Jun 15, 2018

Uh oh!

mhvk commented Jun 15, 2018

Uh oh!

mhvk commented Jun 21, 2018

Uh oh!

Uh oh!