gh-74028: Introduce a prefetch parameter to Executor.map to handle large iterators #114975

Jason-Y-Z · 2024-02-03T20:19:11Z

Introduce a prefetch parameter to Executor.map, so that large and even unbounded iterators can be handled.
This is a continuation of #18566, with backward compatibility, which is to say when the new prefetch parameter is not specified, we default to the current behaviour.
cc @graingert @rdarder @kumaraditya303 @brianquinlan

📚 Documentation preview 📚: https://cpython-previews--114975.org.readthedocs.build/

Issue: executor.map hangs at seemly random iteration when iterable is large #114948

Issue: Make Executor.map work with infinite/large inputs correctly #74028

gaogaotiantian · 2024-02-06T21:43:50Z

Code itself aside, I don't think this PR solves the current issue.

Yes, with the prefetch argument, the executor will only schedule a certain amount of tasks in the beginning. However, if you enumerate the result iterator, it will just have the exact same effect - blocking the manager thread due to a large amount of submits.

Also, this design means the worker process won't work, until the user requests the result - which is very counter intuitive. If the user maps a large amount of data, they would hope that the data is being processed in the background by the executor, not wait there until the user asks for the result.

So, from my personal perspective, the design is not what we want for the executor. If you want to fix this, you need more than this.

gaogaotiantian

Read the docs again and the docs clearly states:

the iterables are collected immediately rather than lazily;

So this could break backward compatibility. Also the original issue is solvable by chunksize. Unfortunately, I don't think this PR is worth more effort.

vinismarques · 2024-02-08T05:02:55Z

Also the original issue is solvable by chunksize

Unfortunately chunksize is not available for ThreadPoolExecutor, which is best suited for I/O operations. Do you know of any better alternatives that can handle large iterators?

gaogaotiantian · 2024-02-08T05:46:45Z

Unfortunately chunksize is not available for ThreadPoolExecutor, which is best suited for I/O operations. Do you know of any better alternatives that can handle large iterators?

I believe the reason ThreadPoolExecutor does not have the argument, is because thread pool does not suffer from it. There's no expensive submit wakeup pipe queue so you can do large iterators just fine.

Jason-Y-Z · 2024-02-10T10:03:25Z

Thanks for the discussion! Would love @graingert @rdarder @kumaraditya303 @brianquinlan your thoughts as well

Jason-Y-Z · 2024-02-10T10:20:03Z

So this could break backward compatibility.

I'm not sure whether I'm following fully here, but my intention was if we use the default or None for prefetch, we submit all tasks in the beginning. And I'm pretty sure that's how I implemented it, but I can very much likely have missed something, so very open to feedback here. It would be really useful to have an example here. @gaogaotiantian

Also the original issue is solvable by chunksize.

Do we mean the issue in bpo-29842 as well?

Unfortunately, I don't think this PR is worth more effort.

No worries at all, less work for me if that's case :)

gaogaotiantian · 2024-02-10T20:13:39Z

I'm not sure whether I'm following fully here, but my intention was if we use the default or None for prefetch, we submit all tasks in the beginning. And I'm pretty sure that's how I implemented it, but I can very much likely have missed something, so very open to feedback here. It would be really useful to have an example here. @gaogaotiantian

My wording is inaccurate in that comment. What I really meant is - with the prefetch argument set, the iterables will not be resolved immediately, as compared to what the current docs states, which is definitely not the end of the world, especially considering prefetch itself expresses the meaning of "lazily". However, I would still be concerned because that makes the behavior more complicated that what it currently documented.

Do we mean the issue in bpo-29842 as well?

No, that one will not be solved by chunksize.

I think the fundamental factors we need beforep proceeding this, is the gain v cost. What do we get from the change?

Making executing.map similar to map sounds great and consistent, but we can't have that. What we can have is an optional "less eager" solution. That's not the consistency we hoped.
When will prefetch be used? Is there any benefit using it when the input is not infinite (compared to chunksize)? The current implementation only submits the task when it needs the result, which causes a delay in communication. So by doing it lazily it might take much more time to finish the iterable.
If this only helps when the iterable is infinite, how common is that? What's the workaround and if that's acceptable?

I'm not saying no. Just how I consider this issue. This might be why the original issue has been sitting there for a couple of years - it might be a serious issue to the users.

Jason-Y-Z · 2024-02-10T20:57:47Z

Thanks for the comments! @gaogaotiantian

However, I would still be concerned because that makes the behavior more complicated that what it currently documented.

Happy to update the documentation if helpful.

When will prefetch be used? Is there any benefit using it when the input is not infinite (compared to chunksize)? The current implementation only submits the task when it needs the result, which causes a delay in communication. So by doing it lazily it might take much more time to finish the iterable.

I guess the main rationale is to give the users the flexibility, to choose how many items, from the input iterator, they would like to be processed at a time. This is certainly useful in the infinite iterator case, and I would imagine this to be useful in cases where, I don't want too many tasks to be processed at one time as well.
For example, I might be sending some requests to downstream services and I don't want that service to be overloaded. This will come in handy to be a simple rate-limiting mechanism. (Might be a poor example but I hope you get my point.)

…arge iterators

gaogaotiantian · 2024-02-15T22:06:56Z

Sorry but I still do not get the rationale for it. I understood your statements, but it did not convince me that this is a feature that worth the effort. The main concern is the usage - the current implementation will not submit task until the result is asked, after the prefetched ones, that just does not feel right to me.

You'll need some core dev behind this anyway (I'm not one). If you can find someone who likes the idea, you might be able to make some progress on this PR.

Jason-Y-Z · 2024-02-19T22:27:57Z

Hey @gpshead, sorry for tagging, but since this is concurrency related, I thought you might be interested

TLDR: RTFM Once upon a time, in a countryside farm in Belgium... At first, the upgrade of databases was straightforward. But, as time passed, the size of the databases grew, and some CPU-intensive computations took so much time that a solution needed to be found. Hopefully, the Python standard library has the perfect module for this task: `concurrent.futures`. Then, Python 3.10 appeared, and the usage of `ProcessPoolExecutor` started to sometimes hang for no apparent reasons. Soon, our hero finds out he wasn't the only one to suffer from this issue[^1]. Unfortunately, the proposed solution looked overkill. Still, it revealed that the issue had already been known[^2] for a few years. Despite the fact that an official patch wasn't ready to be committed, discussion about its legitimacy[^3] leads our hero to a nicer solution. By default, `ProcessPoolExecutor.map` submits elements one by one to the pool. This is pretty inefficient when there are a lot of elements to process. This can be changed by using a large value for the *chunksize* argument. Who would have thought that a bigger chunk size would solve a performance issue? As always, the response was in the documentation[^4]. [^1]: https://stackoverflow.com/questions/74633896/processpoolexecutor-using-map-hang-on-large-load [^2]: python/cpython#74028 [^3]: python/cpython#114975 (review) [^4]: https://docs.python.org/3/library/concurrent.futures.html#concurrent.futures.Executor.map

TLDR: RTFM Once upon a time, in a countryside farm in Belgium... At first, the upgrade of databases was straightforward. But, as time passed, the size of the databases grew, and some CPU-intensive computations took so much time that a solution needed to be found. Hopefully, the Python standard library has the perfect module for this task: `concurrent.futures`. Then, Python 3.10 appeared, and the usage of `ProcessPoolExecutor` started to sometimes hang for no apparent reasons. Soon, our hero finds out he wasn't the only one to suffer from this issue[^1]. Unfortunately, the proposed solution looked overkill. Still, it revealed that the issue had already been known[^2] for a few years. Despite the fact that an official patch wasn't ready to be committed, discussion about its legitimacy[^3] leads our hero to a nicer solution. By default, `ProcessPoolExecutor.map` submits elements one by one to the pool. This is pretty inefficient when there are a lot of elements to process. This can be changed by using a large value for the *chunksize* argument. Who would have thought that a bigger chunk size would solve a performance issue? As always, the response was in the documentation[^4]. [^1]: https://stackoverflow.com/questions/74633896/processpoolexecutor-using-map-hang-on-large-load [^2]: python/cpython#74028 [^3]: python/cpython#114975 (review) [^4]: https://docs.python.org/3/library/concurrent.futures.html#concurrent.futures.Executor.map closes #94 Signed-off-by: Nicolas Seinlet (nse) <nse@odoo.com>

ebonnal · 2024-10-17T22:15:11Z

Hi, fyi here is a follow up PR: #125663 🙏🏻

bedevere-app bot added the awaiting review label Feb 3, 2024

This was referenced Feb 3, 2024

bpo-29842: Make Executor.map less eager so it handles large/unbounded… #18566

Closed

executor.map hangs at seemly random iteration when iterable is large #114948

Closed

Jason-Y-Z force-pushed the fix-issue-29842 branch 7 times, most recently from 46ea84e to 67b7b0f Compare February 4, 2024 11:24

Jason-Y-Z changed the title ~~bpo-29842: Introduce a prefetch parameter to Executor.map to handle l…~~ gh-114948: Introduce a prefetch parameter to Executor.map to handle l… Feb 6, 2024

gaogaotiantian reviewed Feb 7, 2024

View reviewed changes

Jason-Y-Z changed the title ~~gh-114948: Introduce a prefetch parameter to Executor.map to handle l…~~ bpo-29842: Introduce a prefetch parameter to Executor.map to handle l… Feb 10, 2024

Jason-Y-Z force-pushed the fix-issue-29842 branch from f18eecf to fb6f498 Compare February 10, 2024 10:14

Jason-Y-Z force-pushed the fix-issue-29842 branch from fb6f498 to 203c8b2 Compare February 10, 2024 15:33

Jason-Y-Z force-pushed the fix-issue-29842 branch from 203c8b2 to 5a5dbe6 Compare February 10, 2024 22:30

Jason-Y-Z and others added 2 commits February 14, 2024 21:55

bpo-29842: Introduce a prefetch parameter to Executor.map to handle l…

c1f4847

…arge iterators

📜🤖 Added by blurb_it.

acab150

Jason-Y-Z force-pushed the fix-issue-29842 branch from 5a5dbe6 to acab150 Compare February 14, 2024 21:55

Jason-Y-Z requested a review from gaogaotiantian February 14, 2024 21:56

Jason-Y-Z changed the title ~~bpo-29842: Introduce a prefetch parameter to Executor.map to handle l…~~ gh-74028: Introduce a prefetch parameter to Executor.map to handle l… Feb 18, 2024

bedevere-app bot mentioned this pull request Feb 18, 2024

Make Executor.map work with infinite/large inputs correctly #74028

Closed

hugovk changed the title ~~gh-74028: Introduce a prefetch parameter to Executor.map to handle l…~~ gh-74028: Introduce a prefetch parameter to Executor.map to handle large iterators Jun 14, 2024

Jason-Y-Z closed this Aug 3, 2024

Jason-Y-Z deleted the fix-issue-29842 branch August 3, 2024 19:40

ebonnal mentioned this pull request Oct 17, 2024

gh-74028: concurrent.futures.Executor.map: introduce buffersize param for lazier behavior #125663

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-74028: Introduce a prefetch parameter to Executor.map to handle large iterators #114975

gh-74028: Introduce a prefetch parameter to Executor.map to handle large iterators #114975

Uh oh!

Jason-Y-Z commented Feb 3, 2024 •

edited by hugovk

Loading

Uh oh!

gaogaotiantian commented Feb 6, 2024

Uh oh!

gaogaotiantian left a comment

Uh oh!

vinismarques commented Feb 8, 2024

Uh oh!

gaogaotiantian commented Feb 8, 2024

Uh oh!

Jason-Y-Z commented Feb 10, 2024

Uh oh!

Jason-Y-Z commented Feb 10, 2024 •

edited

Loading

Uh oh!

gaogaotiantian commented Feb 10, 2024

Uh oh!

Jason-Y-Z commented Feb 10, 2024

Uh oh!

gaogaotiantian commented Feb 15, 2024

Uh oh!

Jason-Y-Z commented Feb 19, 2024 •

edited

Loading

Uh oh!

ebonnal commented Oct 17, 2024

Uh oh!

Uh oh!

Uh oh!

gh-74028: Introduce a prefetch parameter to Executor.map to handle large iterators #114975

gh-74028: Introduce a prefetch parameter to Executor.map to handle large iterators #114975

Uh oh!

Conversation

Jason-Y-Z commented Feb 3, 2024 • edited by hugovk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gaogaotiantian commented Feb 6, 2024

Uh oh!

gaogaotiantian left a comment

Choose a reason for hiding this comment

Uh oh!

vinismarques commented Feb 8, 2024

Uh oh!

gaogaotiantian commented Feb 8, 2024

Uh oh!

Jason-Y-Z commented Feb 10, 2024

Uh oh!

Jason-Y-Z commented Feb 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gaogaotiantian commented Feb 10, 2024

Uh oh!

Jason-Y-Z commented Feb 10, 2024

Uh oh!

gaogaotiantian commented Feb 15, 2024

Uh oh!

Jason-Y-Z commented Feb 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ebonnal commented Oct 17, 2024

Uh oh!

Uh oh!

Jason-Y-Z commented Feb 3, 2024 •

edited by hugovk

Loading

Jason-Y-Z commented Feb 10, 2024 •

edited

Loading

Jason-Y-Z commented Feb 19, 2024 •

edited

Loading