BUG: revise string_from_pyobj/try_pyarr_from_string with respect to malloc and copy. #18759

pearu · 2021-04-12T20:26:28Z

charris · 2021-04-12T21:16:58Z

Note that you can simplify by using raw strings to dispense with most of the escaped characters.

In [14]: r""" "a"\t \ """                                                       
Out[14]: ' "a"\\t \\ '

Just avoid having more than three " in a row.

charris · 2021-04-12T21:24:24Z

We also prefer the style

if (something) {
    foo;
}
else {
    bar;
}

Although it is hard to enforce with outside code coming in.

numpy/f2py/cfuncs.py

pearu · 2021-04-13T10:54:22Z

We also prefer the style ...

Applied the style to the changeset of this PR.

numpy/f2py/cfuncs.py

numpy/f2py/tests/test_string.py

numpy/f2py/cfuncs.py

pearu · 2021-04-14T07:32:43Z

I have marked this PR as a draft because the handling of nulls is still inconsistent. For instance, the second item in the output of #15311 (comment) is unexpected: all the nulls are replaced except the last one resulting bytes object with length 4 while it should be fixed to 5.

I am starting to think that the current approach of replacing nulls with spaces is wrong. IIRC, the approach tries to fix a Fortran I/O issue where nulls are not included in the output. But this issue appears to be a non-issue: the fixed-size character arrays have the correct width, just when outputting it, null values take zero width.
Please, correct me if the reason for replacing nulls with spaces is something else.

So my plan is to use the same interpretation of nulls that numpy uses. For instance:

>>> np.array(b"\00a\00\00")
array(b'\x00a', dtype='|S4')
>>> np.array(b"\00a\00\00").tobytes()
b'\x00a\x00\x00'

The trailing nulls are not included in the repr output of the ndarray object while these exist as seen in the bytes buffer output.

This may be a BC sensitive change and requires updating f2py docs but the docs are broken anyway: #15311

eric-wieser · 2021-04-14T12:04:50Z

I've very little experience with fortran, and have no idea why replacing the nulls with spaces is a good idea. I'll defer to your judgement here.

seberg · 2021-04-14T18:28:49Z

In general, stripping all trailing nulls seems like the safest bet to me as well (including not replacing NULLs, although I have no clue about possible backward incompatibility). Using NULL bytes inside the string is fairly brittle to begin with, so I am willing to make the bet that the combination of NULL bytes and string wrapping just isn't a thing (and if it is will fail loudly).

numpy/f2py/cfuncs.py

pearu · 2021-05-26T10:20:23Z

If nobody will object, I'll merge this PR in two days in case there is no further reviewing activity.

numpy/f2py/cfuncs.py

numpy/f2py/rules.py

Co-authored-by: Eric Wieser <wieser.eric@gmail.com>

eric-wieser

Looks good now, thanks - just a few minor style comments, and a possible follow-up

numpy/f2py/cfuncs.py

numpy/f2py/tests/test_string.py

pearu · 2021-05-26T12:57:32Z

@eric-wieser thanks for the review!

Here's more follow-up:

apply lint/fl8ake to all f2py files at once, BUG: revise string_from_pyobj/try_pyarr_from_string with respect to malloc and copy. #18759 (comment)
Use Python 3 CAPI features to cleanup PRINTPYOBJERR #19106

WarrenWeckesser · 2021-06-08T14:08:23Z

The scipy tests with the "nightly" numpy wheel are experiencing Python interpreter crashes in the code that uses the Fortran library L-BFGS-B (see scipy/scipy#14203). f2py is used to wrap that code. Running a git bisect on numpy shows that the crashes started with numpy commit 8992459, which is part of this PR. I haven't yet looked at that commit very closely, so I don't know if the problem is a numpy bug, or if the changes exposed a scipy bug. I'll look into it, but another set of eyes on the PR might find a problem that was missed in the initial review.

I created a separate issue for this: #19201

rgommers · 2021-06-13T13:01:00Z

@pearu @melissawm I'd like to revert this PR until gh-19201. The bug is most likely in this PR; even if it's not and the bug is in SciPy then we still have a problem because leaving this in breaks the most recent SciPy release; and this has broken SciPy CI which prevents working on a release blocker.

It would also be good to use the SciPy test suite as the f2py test suite. That will prevent this kind of thing showing up in CI afterwards.

pearu added the component: numpy.f2py label Apr 12, 2021

pearu self-assigned this Apr 12, 2021

github-actions bot added the 00 - Bug label Apr 12, 2021

pearu mentioned this pull request Apr 12, 2021

BUG: Fix invalid read in f2py string_from_pyobj #18646

Closed

eric-wieser reviewed Apr 12, 2021

View reviewed changes

numpy/f2py/cfuncs.py Outdated Show resolved Hide resolved

charris reviewed Apr 12, 2021

View reviewed changes

numpy/f2py/cfuncs.py Outdated Show resolved Hide resolved

eric-wieser reviewed Apr 13, 2021

View reviewed changes

numpy/f2py/cfuncs.py Outdated Show resolved Hide resolved

eric-wieser reviewed Apr 13, 2021

View reviewed changes

numpy/f2py/cfuncs.py Outdated Show resolved Hide resolved

eric-wieser reviewed Apr 13, 2021

View reviewed changes

numpy/f2py/cfuncs.py Outdated Show resolved Hide resolved

eric-wieser reviewed Apr 13, 2021

View reviewed changes

numpy/f2py/cfuncs.py Outdated Show resolved Hide resolved

eric-wieser reviewed Apr 13, 2021

View reviewed changes

numpy/f2py/tests/test_string.py Outdated Show resolved Hide resolved

pearu requested review from charris and eric-wieser April 13, 2021 15:17

eric-wieser reviewed Apr 13, 2021

View reviewed changes

numpy/f2py/cfuncs.py Outdated Show resolved Hide resolved

eric-wieser reviewed Apr 13, 2021

View reviewed changes

numpy/f2py/cfuncs.py Show resolved Hide resolved

eric-wieser reviewed Apr 13, 2021

View reviewed changes

numpy/f2py/cfuncs.py Outdated Show resolved Hide resolved

pearu mentioned this pull request Apr 13, 2021

f2py should use npy_intp instead of int for buffer size types. #18767

Open

eric-wieser reviewed Apr 13, 2021

View reviewed changes

numpy/f2py/cfuncs.py Outdated Show resolved Hide resolved

eric-wieser reviewed Apr 13, 2021

View reviewed changes

numpy/f2py/cfuncs.py Outdated Show resolved Hide resolved

pearu requested review from seberg and eric-wieser April 13, 2021 19:24

pearu mentioned this pull request Apr 13, 2021

BUG: f2py not triggering ValueError with smaller-than-expected byte-size input in array #15311

Closed

pearu marked this pull request as draft April 13, 2021 20:33

pearu force-pushed the gh-18431-string_from_pyobj branch from e2a3f6e to 7f5523f Compare May 10, 2021 14:46

eric-wieser reviewed May 19, 2021

View reviewed changes

numpy/f2py/cfuncs.py Outdated Show resolved Hide resolved

eric-wieser reviewed May 19, 2021

View reviewed changes

numpy/f2py/cfuncs.py Show resolved Hide resolved

pearu added 2 commits May 23, 2021 22:51

Apply reviewers comments. Thanks to @eric-wieser!

0930786

Fix lint

5942b33

pearu requested a review from eric-wieser May 23, 2021 20:04

eric-wieser reviewed May 23, 2021

View reviewed changes

numpy/f2py/cfuncs.py Outdated Show resolved Hide resolved

Apply reviewer nit

eae806c

pearu requested a review from eric-wieser May 23, 2021 20:38

pearu mentioned this pull request May 25, 2021

BUG: Raise ValueError on strings with smaller than expected byte-size #18427

Closed

eric-wieser reviewed May 26, 2021

View reviewed changes

numpy/f2py/cfuncs.py Outdated Show resolved Hide resolved

pearu added 3 commits May 26, 2021 14:54

MAINT: apply sizeof(char)==1

8c6e74d

Add internal check for array contiguity.

870bf6c

Fix lint

b30cf59

eric-wieser reviewed May 26, 2021

View reviewed changes

numpy/f2py/rules.py Outdated Show resolved Hide resolved

Update numpy/f2py/rules.py

6eeab25

Co-authored-by: Eric Wieser <wieser.eric@gmail.com>

eric-wieser approved these changes May 26, 2021

View reviewed changes

numpy/f2py/cfuncs.py Show resolved Hide resolved

numpy/f2py/cfuncs.py Outdated Show resolved Hide resolved

numpy/f2py/tests/test_string.py Outdated Show resolved Hide resolved

pearu mentioned this pull request May 26, 2021

Use Python 3 CAPI features to cleanup PRINTPYOBJERR #19106

Closed

Minor fixes

09856ec

pearu merged commit 4c93c93 into numpy:main May 28, 2021

pearu deleted the gh-18431-string_from_pyobj branch May 28, 2021 04:29

WarrenWeckesser mentioned this pull request Jun 8, 2021

SciPy crash with nightly numpy wheel--possible f2py issue #19201

Closed

rgommers mentioned this pull request Jun 13, 2021

Revert "BUG: revise string_from_pyobj/try_pyarr_from_string with respect to malloc and copy." #19235

Merged

pearu restored the gh-18431-string_from_pyobj branch June 13, 2021 21:44

pearu deleted the gh-18431-string_from_pyobj branch June 13, 2021 22:47

pearu mentioned this pull request Jun 15, 2021

BUG: revise string_from_pyobj/try_pyarr_from_string with respect to malloc and copy (the second round) #19251

Merged

danbeibei mentioned this pull request Sep 7, 2023

BUG: f2py string optional inout argument produces try_pyarr_from_string error #24662

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: revise string_from_pyobj/try_pyarr_from_string with respect to malloc and copy. #18759

BUG: revise string_from_pyobj/try_pyarr_from_string with respect to malloc and copy. #18759

pearu commented Apr 12, 2021

charris commented Apr 12, 2021

charris commented Apr 12, 2021 •

edited

Loading

pearu commented Apr 13, 2021

pearu commented Apr 14, 2021

eric-wieser commented Apr 14, 2021

seberg commented Apr 14, 2021

pearu commented May 26, 2021

eric-wieser left a comment

pearu commented May 26, 2021

WarrenWeckesser commented Jun 8, 2021 •

edited

Loading

rgommers commented Jun 13, 2021

BUG: revise string_from_pyobj/try_pyarr_from_string with respect to malloc and copy. #18759

BUG: revise string_from_pyobj/try_pyarr_from_string with respect to malloc and copy. #18759

Conversation

pearu commented Apr 12, 2021

charris commented Apr 12, 2021

charris commented Apr 12, 2021 • edited Loading

pearu commented Apr 13, 2021

pearu commented Apr 14, 2021

eric-wieser commented Apr 14, 2021

seberg commented Apr 14, 2021

pearu commented May 26, 2021

eric-wieser left a comment

Choose a reason for hiding this comment

pearu commented May 26, 2021

WarrenWeckesser commented Jun 8, 2021 • edited Loading

rgommers commented Jun 13, 2021

charris commented Apr 12, 2021 •

edited

Loading

WarrenWeckesser commented Jun 8, 2021 •

edited

Loading