ENH: Add SIMD operation copysign #19770

howjmay · 2021-08-28T13:42:10Z

The benchmark run in the CI gave the following response.

    before           after         delta
    [79a8986b]       [bdd62b3]
    <master>         <simd>

-   546±0μs          490±0μs        10.26%  bench_core.PackBits.time_copysign

rgommers · 2021-09-11T13:06:33Z

Thanks @howjmay! Would you be able to add the results from an asv benchmark to the PR description? For an example, see gh-17102. benchmarks/bench_ufunc.py seems to have a copysign benchmark; if it doesn't cover the case you are optimizing then maybe you can add a new one?

howjmay · 2021-10-04T09:31:04Z

Thanks @howjmay! Would you be able to add the results from an asv benchmark to the PR description? For an example, see gh-17102. benchmarks/bench_ufunc.py seems to have a copysign benchmark; if it doesn't cover the case you are optimizing then maybe you can add a new one?

Hi @rgommers I have added copysign to the benchmark!

rgommers · 2021-10-04T20:34:51Z

Thanks @howjmay! It would be good to add the results to the PR description, in the before after format like in gh-17102. That way it's immediately clear that this PR improves performance and to what extent.

howjmay · 2021-10-05T01:50:50Z

Hi @rgommers, I have appended the result of benchmark ran in CI. Please take a look. Thanks!

rgommers

Thanks @howjmay. The speedup is only ~10% so that seems a bit marginal, but on the other the code complexity here isn't very high. So it may be a decent tradeoff - it's not quite clear to me. Any opinions @seiko2plus, @Qiyu8, @mattip?

rgommers · 2021-10-05T16:07:37Z

benchmarks/benchmarks/bench_ufunc.py

@@ -81,6 +81,8 @@ def setup(self):
        self.i = np.ones(150000, dtype=np.int32)
        self.f = np.zeros(150000, dtype=np.float32)
        self.d = np.zeros(75000, dtype=np.float64)
+
+        self.tf = np.ones(150000, dtype=np.int32)


Isn't this the same as self.i, so you can reuse that instead?

Just to point it out, but you do not want to use an integer array in the test. This will trigger casting. You need to use a homogeneous signature (i.e. both inputs are float32 or float64).
The speedup may still not be huge (this is probably memory-bound anyway), but adding a cast might hide most of it even if it is huge.

Hi @rgommers, @seberg do you guys know how can I run two benchmark for comparing the original implementation and the SIMD implementation? I have changed the benchmark test for using float64, but the elapsed time increased. I think it is quite weird, and the 10% speedup is weird for my side too. I thought it will accelerate much more.

For running the benchmark, you could reorder your commits so the benchmark one comes first, and put a new branch name pointing to that commit. Then you can compare that new branch with the simd-copysign one.

Reordering commits should not even be necessary (although maybe nice) if you use --bench-compare from asv through runtests: https://numpy.org/doc/stable/benchmarking.html

Not sure if the docs needs updating, I feel I always have to run it from the benchmarks folder with ../runtests.py.

There's a missing understanding here, this patch only adds universal intrinsics for copysign but without actually using them in the inner loop of ufunc so there's no need for a benchmark.

seiko2plus · 2021-11-03T16:46:21Z

@howjmay, First of all, I would like to thank you for your interest in improving the performance of NumPy, we really appreciate your efforts but again, there's no need to add new intrinsics for copysign for the same reasons I mentioned in #19780 (review).

howjmay · 2021-11-16T18:09:54Z

Thank you for informing!

InessaPawson · 2022-08-10T01:10:39Z

@howjmay Did you mean to reopen this PR? If not, please close it once again when you get a chance.

charris · 2023-02-20T20:12:51Z

@seiko2plus Should this be closes (again)?

seiko2plus · 2023-02-21T01:04:03Z

@charris, yes, unless if @howjmay decided to re-implemented similar to #19780

charris · 2023-02-21T02:23:57Z

Closing. @howjmay If you wish to pursue this, please make another PR.

github-actions bot added the 01 - Enhancement label Aug 28, 2021

howjmay force-pushed the simd-copysign branch 15 times, most recently from c466922 to 5bc0d7a Compare August 29, 2021 06:50

howjmay marked this pull request as ready for review August 29, 2021 08:04

howjmay force-pushed the simd-copysign branch from 5bc0d7a to 9acdd62 Compare August 29, 2021 08:04

howjmay changed the title ~~ENH: Add SIM copysign~~ ENH: Add SIMD operation copysign Aug 29, 2021

seberg added the component: SIMD Issues in SIMD (fast instruction sets) code or machinery label Sep 8, 2021

howjmay force-pushed the simd-copysign branch 4 times, most recently from ae737db to bde468f Compare September 12, 2021 10:39

howjmay force-pushed the simd-copysign branch from bde468f to bdd62b3 Compare October 4, 2021 08:50

rgommers reviewed Oct 5, 2021

View reviewed changes

howjmay force-pushed the simd-copysign branch from bdd62b3 to 75e1ded Compare October 6, 2021 03:20

ENH: Add SIMD operation copysign

fe4effa

howjmay force-pushed the simd-copysign branch from 75e1ded to fe4effa Compare October 6, 2021 03:21

howjmay closed this Nov 16, 2021

howjmay reopened this Nov 16, 2021

seiko2plus mentioned this pull request Dec 30, 2022

ENH: Add SIMD operation signbit #19748

Closed

charris added the triage review Issue/PR to be discussed at the next triage meeting label Feb 20, 2023

seiko2plus added the 57 - Close? Issues which may be closable unless discussion continued label Feb 21, 2023

charris closed this Feb 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Add SIMD operation copysign #19770

ENH: Add SIMD operation copysign #19770

howjmay commented Aug 28, 2021 •

edited

Loading

rgommers commented Sep 11, 2021

howjmay commented Oct 4, 2021

rgommers commented Oct 4, 2021

howjmay commented Oct 5, 2021

rgommers left a comment

rgommers Oct 5, 2021

seberg Oct 5, 2021

howjmay Oct 6, 2021

rgommers Oct 6, 2021

seberg Oct 6, 2021

seiko2plus Nov 3, 2021

seiko2plus commented Nov 3, 2021

howjmay commented Nov 16, 2021

InessaPawson commented Aug 10, 2022

charris commented Feb 20, 2023

seiko2plus commented Feb 21, 2023

charris commented Feb 21, 2023

ENH: Add SIMD operation copysign #19770

ENH: Add SIMD operation copysign #19770

Conversation

howjmay commented Aug 28, 2021 • edited Loading

rgommers commented Sep 11, 2021

howjmay commented Oct 4, 2021

rgommers commented Oct 4, 2021

howjmay commented Oct 5, 2021

rgommers left a comment

Choose a reason for hiding this comment

rgommers Oct 5, 2021

Choose a reason for hiding this comment

seberg Oct 5, 2021

Choose a reason for hiding this comment

howjmay Oct 6, 2021

Choose a reason for hiding this comment

rgommers Oct 6, 2021

Choose a reason for hiding this comment

seberg Oct 6, 2021

Choose a reason for hiding this comment

seiko2plus Nov 3, 2021

Choose a reason for hiding this comment

seiko2plus commented Nov 3, 2021

howjmay commented Nov 16, 2021

InessaPawson commented Aug 10, 2022

charris commented Feb 20, 2023

seiko2plus commented Feb 21, 2023

charris commented Feb 21, 2023

howjmay commented Aug 28, 2021 •

edited

Loading