-
-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Sort merge chunking opens an extra file #6944
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Labels
Comments
karlmcdowall
added a commit
to karlmcdowall/coreutils
that referenced
this issue
Dec 13, 2024
Fix bug uutils#6944 Rework the way batching is done with sort such that it doesn't open more input files than necessary. Previously, the code would always open one extra input file which causes problems in ulimit scenarios. Add additional test case.
karlmcdowall
added a commit
to karlmcdowall/coreutils
that referenced
this issue
Dec 13, 2024
Fix bug uutils#6944 Rework the way batching is done with sort such that it doesn't open more input files than necessary. Previously, the code would always open one extra input file which causes problems in ulimit scenarios. Add additional test case.
karlmcdowall
added a commit
to karlmcdowall/coreutils
that referenced
this issue
Dec 13, 2024
Fix bug uutils#6944 Rework the way batching is done with sort such that it doesn't open more input files than necessary. Previously, the code would always open one extra input file which causes problems in ulimit scenarios. Add additional test case.
karlmcdowall
added a commit
to karlmcdowall/coreutils
that referenced
this issue
Dec 13, 2024
Fix bug uutils#6944 Rework the way batching is done with sort such that it doesn't open more input files than necessary. Previously, the code would always open one extra input file which causes problems in ulimit scenarios. Add additional test case.
karlmcdowall
added a commit
to karlmcdowall/coreutils
that referenced
this issue
Dec 14, 2024
Fix bug uutils#6944 Rework the way batching is done with sort such that it doesn't open more input files than necessary. Previously, the code would always open one extra input file which causes problems in ulimit scenarios. Add additional test case.
karlmcdowall
added a commit
to karlmcdowall/coreutils
that referenced
this issue
Dec 14, 2024
Fix bug uutils#6944 Rework the way batching is done with sort such that it doesn't open more input files than necessary. Previously, the code would always open one extra input file which causes problems in ulimit scenarios. Add additional test case.
karlmcdowall
added a commit
to karlmcdowall/coreutils
that referenced
this issue
Dec 14, 2024
Fix bug uutils#6944 Rework the way batching is done with sort such that it doesn't open more input files than necessary. Previously, the code would always open one extra input file which causes problems in ulimit scenarios. Add additional test case.
karlmcdowall
added a commit
to karlmcdowall/coreutils
that referenced
this issue
Dec 15, 2024
Fix bug uutils#6944 Rework the way batching is done with sort such that it doesn't open more input files than necessary. Previously, the code would always open one extra input file which causes problems in ulimit scenarios. Add additional test case.
karlmcdowall
added a commit
to karlmcdowall/coreutils
that referenced
this issue
Dec 15, 2024
Fix bug uutils#6944 Rework the way batching is done with sort such that it doesn't open more input files than necessary. Previously, the code would always open one extra input file which causes problems in ulimit scenarios. Add additional test case.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
As part of investigating this issue, found that the chunking logic for merges opens one file more than necessary.
Issue is here...
So, for example, if you had a batch size of 2, the call to
batches.next()
above would open the first file, thenmerge_without_limit
would iterate two more times (i.e. batch size of two) resulting in three open input-files rather than just the two required. This might seem like a minor issue, but it's impacting GNU compatibility for tests that run under ulimit with limited file descriptors available.I have a fix for this that I plan to PR shortly.
The text was updated successfully, but these errors were encountered: