ENH: Implement essential intrinsics required by the upcoming SIMD optimizations(0) #22306

seiko2plus · 2022-09-17T19:58:32Z

This pullrequest adds:

intrinsics to check true cross all vector lanes
npyv_any_##SFX: returns true if any of the elements is not equal to zero
npyv_all_##SFX: returns true if all elements are not equal to zero
max/min that reverse IEC 60559's NaN behavior(propagates NaNs) for float data types
npyv_maxn_##SFX
npyv_minn_##SFX
max/min reduction for all float and integer vector data types
npyv_reduce_max_##SFX
npyv_reduce_min_##SFX
max/min reduction supports IEC 60559 for float data types
npyv_reduce_maxp_##SFX
npyv_reduce_minp_##SFX
max/min reduction reverse IEC 60559's NaN behavior(propagates NaNs) for float data types
npyv_reduce_maxn_##SFX
npyv_reduce_minn_##SFX
intrinsics to extract the first vector lane:
npyv_extract0_##SFX
npyv_extract0_##SFX

And removes:

local implementation of max/min reduce intrinsics

max/min that reverse IEC 60559's NaN beahvior(propagates NaNs) for float data types npyv_maxn_##SFX npyv_minn_##SFX max/min reduction for all float and integer vector data types npyv_reduce_max_##SFX npyv_reduce_min_##SFX max/min reduction supports IEC 60559 for float data types npyv_reduce_maxp_##SFX npyv_reduce_minp_##SFX max/min reduction reverse IEC 60559's NaN beahvior(propagates NaNs) for float data types npyv_reduce_maxn_##SFX npyv_reduce_minn_##SFX also, this patch implements new intrinsics to extract the first vector lane: npyv_extract0_##SFX npyv_extract0_##SFX

npyv_any_##SFX: returns true if any of the elements is not equal to zero npyv_all_##SFX: returns true if all elements are not equal to zero

seiko2plus · 2022-09-19T07:05:05Z

cc @mattip

charris · 2022-09-19T21:01:03Z

The errors are unrelated, they are on account of log length limitations on travis. I made a conservative fix for that before, looks like it needs to be more drastic.

charris · 2022-09-20T00:36:43Z

close/reopen

seberg · 2022-09-22T11:35:34Z

numpy/core/tests/test_simd.py

-                continue
-            _min = self.min(vdata_a, vdata_b)
-            assert _min == data_min
+        chk_nan = {"xp": 1, "np": 1, "nn": 2, "xn": 2}.get(intrin[-2:], 0)


Would probably be clearer as part of the parametrize but doesn't matter. I am mostly curious what min/max implement for float values? Does the result depend on the order?

In any case, looked at the code and does look good to me (not that I am very fluid at simd). Not sure if the tests cover all the permutations they could for reductions, but I also trust our integration tests for that.

@mattip will you have another quick look?

I am mostly curious what min/max implement for float values? Does the result depend on the order?

no, it doesn't, just check the tail of the intrinsic name to determine the NaN behavior.

Not sure if the tests cover all the permutations they could for reductions

I'm positive about it.

I'm working on refactoring the whole testing unit starting from _simd module to count more on parametrizing rather than inheritance.
see the new numpy/core/tests/test_simd.py part of #21057

mattip · 2022-09-25T04:23:02Z

Thanks @seiko2plus

seiko2plus added 2 commits September 17, 2022 21:48

MAINT, SIMD: remove local implementation of max/min reduce intrinsics

a2697ca

seiko2plus added the component: SIMD Issues in SIMD (fast instruction sets) code or machinery label Sep 17, 2022

github-actions bot added the 01 - Enhancement label Sep 17, 2022

seiko2plus mentioned this pull request Sep 17, 2022

ENH: Add SIMD versions of bool logical_&&,||,! and absolute #22167

Merged

seiko2plus force-pushed the npyv_new_intrinsics_sep2022_vol0 branch 9 times, most recently from 57e0148 to e3ad145 Compare September 19, 2022 05:47

seiko2plus marked this pull request as ready for review September 19, 2022 05:48

SIMD: Add new intrinsics to check true cross all vector lanes

6ef4c8b

npyv_any_##SFX: returns true if any of the elements is not equal to zero npyv_all_##SFX: returns true if all elements are not equal to zero

seiko2plus force-pushed the npyv_new_intrinsics_sep2022_vol0 branch from e3ad145 to 6ef4c8b Compare September 19, 2022 06:27

charris closed this Sep 20, 2022

charris reopened this Sep 20, 2022

seberg reviewed Sep 22, 2022

View reviewed changes

mattip merged commit d66ca35 into numpy:main Sep 25, 2022

seiko2plus mentioned this pull request Dec 7, 2022

ENH: Implement SIMD versions of isnan,isinf, isfinite and signbit #22165

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: Implement essential intrinsics required by the upcoming SIMD optimizations(0) #22306

ENH: Implement essential intrinsics required by the upcoming SIMD optimizations(0) #22306

Uh oh!

seiko2plus commented Sep 17, 2022

Uh oh!

seiko2plus commented Sep 19, 2022

Uh oh!

charris commented Sep 19, 2022 •

edited

Loading

Uh oh!

charris commented Sep 20, 2022

Uh oh!

seberg Sep 22, 2022

Uh oh!

seiko2plus Sep 25, 2022

Uh oh!

seiko2plus Sep 25, 2022 •

edited

Loading

Uh oh!

mattip commented Sep 25, 2022

Uh oh!

Uh oh!

Uh oh!

ENH: Implement essential intrinsics required by the upcoming SIMD optimizations(0) #22306

ENH: Implement essential intrinsics required by the upcoming SIMD optimizations(0) #22306

Uh oh!

Conversation

seiko2plus commented Sep 17, 2022

Uh oh!

seiko2plus commented Sep 19, 2022

Uh oh!

charris commented Sep 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

charris commented Sep 20, 2022

Uh oh!

seberg Sep 22, 2022

Choose a reason for hiding this comment

Uh oh!

seiko2plus Sep 25, 2022

Choose a reason for hiding this comment

Uh oh!

seiko2plus Sep 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mattip commented Sep 25, 2022

Uh oh!

Uh oh!

charris commented Sep 19, 2022 •

edited

Loading

seiko2plus Sep 25, 2022 •

edited

Loading