1842: New xlsx option for ignoring certain nodes for improved performance #2132

hofnarwillie · 2022-09-09T16:53:34Z

Summary

This PR is to mitigate the issue described here: #1842
When the xlsx document contains a single dataValidation applied to the entire worksheet (or a very large range) using a cell address like A1:AAA1048000 then the DataValidationsXform.parseClose() function loops through each cell in the range and unless you have a super computer it runs out of memory. In some cases these validations are not relevant to the context in which the file is being parsed, so a simple performance improvement is to skip over specific XLSX nodes (in this case dataValidations).

Some unrelated linting issues were fixed automatically by the husky pre-commit. I've left them in this PR.

Test plan

Included a couple of integration tests and ensured all the other tests still pass. Also updated the README to describe the new option.

Related to source code (for typings update)

Added typescript typings for options passed into workbook.xlsx.readFile() and workbook.xlsx.load(). I have not made any changes to the stream.xslx.* interfaces. Would appreciate some input here from the maintainers of the package as I don't know if it is necessary (i.e. if the stream based implementations will use the same options and parsing techniques).

…ance

aonamrata · 2023-01-06T08:43:50Z

can someone from the maintainers group review this?
@guyonroche , @alubbe or @Siemienik

Siemienik · 2023-01-06T09:42:47Z

I would like, however currently I'm overloaded

Siemienik

Great idea on the topic of performance optimization. Thank you for your contribution! I added some proposals to update Readme and reduce unnecessary if. I hope that's not a problem for you 😄 @hofnarwillie, I appreciate your effort, the tests added, and your code is cool 👍

@skypesky or @zurmokeeper, would you like to update README_zh?

Will it merge on the next MergeFest

This PR was reviewed during the MergeFest session.

lib/xlsx/xform/sheet/worksheet-xform.js

spec/integration/data/.gitignore

README.md

zurmokeeper · 2023-05-06T01:59:33Z

Great idea on the topic of performance optimization. Thank you for your contribution! I added some proposals to update Readme and reduce unnecessary if. I hope that's not a problem for you 😄 @hofnarwillie, I appreciate your effort, the tests added, and your code is cool 👍

@skypesky or @zurmokeeper, would you like to update README_zh?

Will it merge on the next MergeFest

This PR was reviewed during the MergeFest session.

Sure, I'm happy to do that, and after this pr is merged into master, I'll raise a new PR to update README_zh.

hofnarwillie and others added 3 commits September 9, 2022 17:35

1842: New xlsx option for ignoring certain nodes for improved perform…

0d10573

…ance

Included test file

6f44152

Added typescript typings

e973737

hofnarwillie mentioned this pull request Sep 15, 2022

Out of Heap Memory #412

Open

Siemienik self-assigned this Apr 6, 2023

Merge branch 'master' into master

886857f

hofnarwillie mentioned this pull request May 2, 2023

ExcelJS not able to load file #1621

Open

Siemienik approved these changes May 5, 2023

View reviewed changes

lib/xlsx/xform/sheet/worksheet-xform.js Outdated Show resolved Hide resolved

spec/integration/data/.gitignore Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

zurmokeeper mentioned this pull request Aug 5, 2023

[F] New xlsx option for ignoring certain nodes for improved performance zurmokeeper/excelize#22

Closed

Siemienik and others added 4 commits September 21, 2023 22:39

optimizing ifs

0e2170b

new line EOF

5cd82db

Update README.md - available options

f88df56

Merge branch 'master' into master

050c418

Siemienik merged commit 3178efd into exceljs:master Sep 21, 2023

Vishwas1 mentioned this pull request Sep 8, 2024

[Snyk] Upgrade: cookie-parser, exceljs, express, express-validator, node-fetch, https-localhost, nodemailer, hypersign-auth-js-sdk, mongoose, url-parse, web3 hypersign-protocol/hyperfyre-frontend#1979

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

1842: New xlsx option for ignoring certain nodes for improved performance #2132

1842: New xlsx option for ignoring certain nodes for improved performance #2132

Uh oh!

hofnarwillie commented Sep 9, 2022 •

edited

Loading

Uh oh!

aonamrata commented Jan 6, 2023

Uh oh!

Siemienik commented Jan 6, 2023

Uh oh!

Siemienik left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zurmokeeper commented May 6, 2023

Will it merge on the next MergeFest

Uh oh!

Uh oh!

1842: New xlsx option for ignoring certain nodes for improved performance #2132

1842: New xlsx option for ignoring certain nodes for improved performance #2132

Uh oh!

Conversation

hofnarwillie commented Sep 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Related to source code (for typings update)

Uh oh!

aonamrata commented Jan 6, 2023

Uh oh!

Siemienik commented Jan 6, 2023

Uh oh!

Siemienik left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Will it merge on the next MergeFest

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zurmokeeper commented May 6, 2023

Will it merge on the next MergeFest

Uh oh!

Uh oh!

hofnarwillie commented Sep 9, 2022 •

edited

Loading

Siemienik left a comment •

edited

Loading