MAINT: Return size_t from num_codepoints in string ufuncs Buffer class #25571

lysnikolaou · 2024-01-11T17:45:15Z

mhvk · 2024-01-11T18:18:54Z

numpy/_core/src/umath/string_ufuncs.cpp


    adjust_offsets(&start, &end, len1);
-    if (end - start < len2) {
+    if (end - start < static_cast<npy_int64>(len2)) {


Somewhat flyby, but could one write this as end < start + len2 and avoid the distracting static_cast?

My understanding is that the signedness of start + len2 is implementation-specific and depends on the size of size_t. For implementations where it's a 64-bit integer, the resulting type is also unsigned, so we end up with signed-unsigned comparison again.

If the unsigned type has conversion rank greater than or equal to the rank of the signed type, then the operand with the signed type is implicitly converted to the unsigned type.

the ranks of all signed integer types are different and increase with their precision: rank of signed char < rank of short < rank of int < rank of long int < rank of long long int

the ranks of all signed integer types equal the ranks of the corresponding unsigned integer types

Well, maybe explicit is better, then...

ngoldbaum · 2024-01-11T18:48:26Z

Looks like there's a heap buffer overflow in rstrip. Also woohoo that's the first time I've seen the compiler sanitizer job catch a bug like that.

mhvk · 2024-01-11T19:14:41Z

The sanitizer actually did a really nice job on my fft via gufunc PR (#25536). Though it still took me quite a while to figure out where I was writing a byte too many! But very nice that it found it, since everything else just passed fine.

lysnikolaou · 2024-01-12T12:34:00Z

The test failures are unrelated as far as I can tell.

mhvk · 2024-01-12T13:53:48Z

Rerunning the pypy job just in case. Is it failing also for other PRs?

lysnikolaou · 2024-01-12T13:58:36Z

I don't see any other failures, no, but I think it was a very early segfault in pypy code before the build even began. Could you maybe rerun the OpenSUSE Netlib BLAS/LAPACK one?

mhvk · 2024-01-12T14:07:40Z

Ah, sorry, missed that there was another failure. Let's see what a rerun does.

lysnikolaou · 2024-01-12T14:18:11Z

Yup, they're both okay now.

ngoldbaum · 2024-01-12T14:49:36Z

There seems to be a new 60 minute build time limit we're hitting in azure, it's unrelated to this PR.

ngoldbaum · 2024-01-12T14:50:56Z

Thanks for this!

MAINT: Return size_t from num_codepoints in string ufuncs Buffer class

4268aa4

Closes numpy#25565.

github-actions bot added the 03 - Maintenance label Jan 11, 2024

mhvk reviewed Jan 11, 2024

View reviewed changes

lysnikolaou force-pushed the num_codepoints-use-size_t branch 3 times, most recently from 339c6e5 to c347ca5 Compare January 12, 2024 10:59

Fix signedness bug in strip

67d932d

lysnikolaou force-pushed the num_codepoints-use-size_t branch from c347ca5 to 67d932d Compare January 12, 2024 11:14

ngoldbaum merged commit 05d3e81 into numpy:main Jan 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

MAINT: Return size_t from num_codepoints in string ufuncs Buffer class #25571

MAINT: Return size_t from num_codepoints in string ufuncs Buffer class #25571

Uh oh!

lysnikolaou commented Jan 11, 2024

Uh oh!

mhvk Jan 11, 2024

Uh oh!

lysnikolaou Jan 12, 2024

Uh oh!

mhvk Jan 12, 2024

Uh oh!

ngoldbaum commented Jan 11, 2024

Uh oh!

mhvk commented Jan 11, 2024

Uh oh!

lysnikolaou commented Jan 12, 2024

Uh oh!

mhvk commented Jan 12, 2024

Uh oh!

lysnikolaou commented Jan 12, 2024

Uh oh!

mhvk commented Jan 12, 2024

Uh oh!

lysnikolaou commented Jan 12, 2024

Uh oh!

ngoldbaum commented Jan 12, 2024

Uh oh!

ngoldbaum commented Jan 12, 2024

Uh oh!

Uh oh!

Uh oh!

MAINT: Return size_t from num_codepoints in string ufuncs Buffer class #25571

MAINT: Return size_t from num_codepoints in string ufuncs Buffer class #25571

Uh oh!

Conversation

lysnikolaou commented Jan 11, 2024

Uh oh!

mhvk Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

lysnikolaou Jan 12, 2024

Choose a reason for hiding this comment

Uh oh!

mhvk Jan 12, 2024

Choose a reason for hiding this comment

Uh oh!

ngoldbaum commented Jan 11, 2024

Uh oh!

mhvk commented Jan 11, 2024

Uh oh!

lysnikolaou commented Jan 12, 2024

Uh oh!

mhvk commented Jan 12, 2024

Uh oh!

lysnikolaou commented Jan 12, 2024

Uh oh!

mhvk commented Jan 12, 2024

Uh oh!

lysnikolaou commented Jan 12, 2024

Uh oh!

ngoldbaum commented Jan 12, 2024

Uh oh!

ngoldbaum commented Jan 12, 2024

Uh oh!

Uh oh!