Added 'prescan' option to loadtxt for array allocation prior to reading #144

dhomeier · 2011-08-23T15:55:52Z

ENH: implement 2-pass reading in loadtxt to avoid problems with
excessive memory usage on large data files. The extra parsing
typically takes 10% of the total read-in time; 30-50% for compressed
files.

Setting 'prescan=True' will parse the valid data lines of the input
file in a first pass, then allocate an array to read the data directly
into (row by row), bypassing the creation of an input list with the
associated high memory usage.

Setting 'prescan=True' will parse the valid data lines of the input file in a first pass, then allocate an array to read the data directly into (row by row), bypassing the creation of an input list with the associated high memory usage.

pv · 2011-09-02T01:32:38Z

I think this is not the correct approach (will not work with streams etc.). A cleaner one would be

Read the first N lines of the file to determine the number of columns
After that, resize the array dynamically using the .resize() while loading

charris · 2013-05-03T04:05:52Z

Looks like it is time to close this.

feat: Add vshr_n_s8

charris closed this May 3, 2013

njsmith mentioned this pull request May 2, 2016

Segmentation fault in PyArray_Item_INCREF #7595

Closed

roachlord mentioned this pull request Feb 18, 2019

Passing MagicMock to np.dtype causes a segmentation fault #12982

Closed

luyahan pushed a commit to plctlab/numpy that referenced this pull request Apr 25, 2024

Merge pull request numpy#144 from howjmay/vshr_n_s8

1395060

feat: Add vshr_n_s8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Added 'prescan' option to loadtxt for array allocation prior to reading #144

Added 'prescan' option to loadtxt for array allocation prior to reading #144

Uh oh!

dhomeier commented Aug 23, 2011

Uh oh!

pv commented Sep 2, 2011

Uh oh!

charris commented May 3, 2013

Uh oh!

Uh oh!

Uh oh!

Added 'prescan' option to loadtxt for array allocation prior to reading #144

Added 'prescan' option to loadtxt for array allocation prior to reading #144

Uh oh!

Conversation

dhomeier commented Aug 23, 2011

Uh oh!

pv commented Sep 2, 2011

Uh oh!

charris commented May 3, 2013

Uh oh!

Uh oh!