BUG: Incorrect handling of not-equal comparison to nan #21685

WarrenWeckesser · 2022-06-07T13:45:50Z

Describe the issue:

For a sufficiently large array x, the comparison x != 0 returns False for elements of x that are nan. The result should be True. For example,

In [1]: import numpy as np

In [2]: x = np.zeros((4, 8))

In [3]: x[0,0] = np.nan

In [4]: x != 0  # The first element of the result should be True
Out[4]: 
array([[False, False, False, False, False, False, False, False],
       [False, False, False, False, False, False, False, False],
       [False, False, False, False, False, False, False, False],
       [False, False, False, False, False, False, False, False]])

In [5]: x[:2] != 0  # The result with a smaller array works as expected.
Out[5]: 
array([[ True, False, False, False, False, False, False, False],
       [False, False, False, False, False, False, False, False]])

The bug appears to have been introduced in gh-21483.

NumPy/Python version information:

In [6]: import sys, numpy; print(numpy.__version__, sys.version)
1.24.0.dev0+136.g15b92f7ab 3.10.1 (main, Jan 14 2022, 02:27:20) [GCC 11.2.0]

The text was updated successfully, but these errors were encountered:

seberg · 2022-06-07T14:04:54Z

Ping @rafaelcfsousa, could you have a look, since it was likely your PR that introduced this?

rafaelcfsousa · 2022-06-07T14:31:13Z

@seberg: Yes, I am already checking the issue.

rafaelcfsousa · 2022-06-07T16:12:51Z

Ok, I already know what exactly is causing this bug.

The comparison kernels implemented with AVX and AVX512 use the universal intrinsic npyv_cmpneq_f64 (see below) to compute np.not_equal for dtype=double.

numpy/numpy/core/src/common/simd/avx2/operators.h

Line 212 in 15b92f7

    
           #define npyv_cmpneq_f64(A, B) _mm256_castpd_si256(_mm256_cmp_pd(A, B, _CMP_NEQ_OQ))

The universal intrinsic npyv_cmpneq_f64 for both SIMD extensions listed above uses the flag _CMP_NEQ_OQ to indicate how the comparison with NaN has to be done (more info here: https://stackoverflow.com/questions/16988199/how-to-choose-avx-compare-predicate-variants).

For _CMP_NEQ_OQ (Ordered comparisons returns false for NaN operands):

nan != nan --> false
nan != 0 --> false

For _CMP_NEQ_UQ (Unordered comparison returns true for NaN operands):

nan != nan --> true
nan != 0 --> true

charris · 2022-06-07T16:29:44Z

I assume this doesn't need a backport?

rafaelcfsousa · 2022-06-07T16:31:40Z

In my point of view, we could follow one of the 4 options below to have this issue resolved:

Use _CMP_NEQ_UQ instead of _CMP_NEQ_OQ (but I do not know yet the impact of doing that)
Disable the optimization for AVX[2,512] so that SSE2 is used instead
Move the dtypes f32 and f64 to a new dispatchable source file (loops_comparison_fp.dispatch.c.src) and enable on it only SSE (and the other architectures)
Use scalar execution for AVX[2,512] when dtype=f32 or f64

seberg · 2022-06-07T16:46:15Z

Using _CMP_NEQ_OQ seems right to me on first sight, unless this is used in places where the unorderd version is needed (in which case, I guess we may need both universal intrinsics?)

EDIT: Whoops, copied the wrong intrinsic there...

rafaelcfsousa · 2022-06-07T17:28:44Z

@seberg and @charris :

I think we should use _CMP_NEQ_UQ instead of _CMP_NEQ_OQ.

>>> import numpy as np
>>> np.nan != np.nan
True
>>> np.nan != 0
True

The result when I compare anything with NaN is True.

Btw, npyv_cmpneq_f32 and npyv_cmpneq_f64 are only used by loops_comparison.dispatch.c.src.

seberg · 2022-06-07T17:33:54Z

Sorry, yes. I got the wrong intrinsic, I meant: I do think we should swap out the intrinsic/flag used and that seems like it should be the right solution. Are you planning on making a PR?

rafaelcfsousa · 2022-06-07T17:36:56Z

Yes, I will submit a PR. Give me ~1h and I will submit it (I will test the other comparison function as well).

WarrenWeckesser · 2022-06-07T17:48:20Z

FWIW: Locally, I just tried switching _CMP_NEQ_OQ to _CMP_NEQ_UQ for npyv_cmpneq_f32 and npyv_cmpneq_f64. The existing tests still passed, and the example that I gave above worked correctly for float32 and float64. @rafaelcfsousa, thanks for tracking down the issue so quickly!

WarrenWeckesser added 00 - Bug component: SIMD Issues in SIMD (fast instruction sets) code or machinery labels Jun 7, 2022

WarrenWeckesser mentioned this issue Jun 7, 2022

BUG: regression in DIA sparse matrices with np.nan values scipy/scipy#16365

Closed

WarrenWeckesser mentioned this issue Jun 7, 2022

⚠️ CI failed on Linux_Nightly.pylatest_pip_scipy_dev ⚠️ scikit-learn/scikit-learn#23531

Closed

rafaelcfsousa mentioned this issue Jun 7, 2022

BUG: switch _CMP_NEQ_OQ to _CMP_NEQ_UQ for npyv_cmpneq_f[32,64] #21687

Merged

seberg closed this as completed in #21687 Jun 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: Incorrect handling of not-equal comparison to nan #21685

BUG: Incorrect handling of not-equal comparison to nan #21685

WarrenWeckesser commented Jun 7, 2022 •

edited

Loading

seberg commented Jun 7, 2022

rafaelcfsousa commented Jun 7, 2022

rafaelcfsousa commented Jun 7, 2022 •

edited

Loading

charris commented Jun 7, 2022

rafaelcfsousa commented Jun 7, 2022 •

edited

Loading

seberg commented Jun 7, 2022 •

edited

Loading

rafaelcfsousa commented Jun 7, 2022 •

edited

Loading

seberg commented Jun 7, 2022 •

edited

Loading

rafaelcfsousa commented Jun 7, 2022

WarrenWeckesser commented Jun 7, 2022

BUG: Incorrect handling of not-equal comparison to nan #21685

BUG: Incorrect handling of not-equal comparison to nan #21685

Comments

WarrenWeckesser commented Jun 7, 2022 • edited Loading

Describe the issue:

NumPy/Python version information:

seberg commented Jun 7, 2022

rafaelcfsousa commented Jun 7, 2022

rafaelcfsousa commented Jun 7, 2022 • edited Loading

charris commented Jun 7, 2022

rafaelcfsousa commented Jun 7, 2022 • edited Loading

seberg commented Jun 7, 2022 • edited Loading

rafaelcfsousa commented Jun 7, 2022 • edited Loading

seberg commented Jun 7, 2022 • edited Loading

rafaelcfsousa commented Jun 7, 2022

WarrenWeckesser commented Jun 7, 2022

WarrenWeckesser commented Jun 7, 2022 •

edited

Loading

rafaelcfsousa commented Jun 7, 2022 •

edited

Loading

rafaelcfsousa commented Jun 7, 2022 •

edited

Loading

seberg commented Jun 7, 2022 •

edited

Loading

rafaelcfsousa commented Jun 7, 2022 •

edited

Loading

seberg commented Jun 7, 2022 •

edited

Loading