Document that cache() and lru_cache() do not have a "call once" guarantee #103475

okrcma · 2023-04-12T13:46:12Z

Documentation

The documentation for @functools.cache states that

The cache is threadsafe so the wrapped function can be used in multiple threads.

but it is not threadsafe before the value is cached.

In the following example the wrapped function will return two different instances of list.

import functools
from threading import Thread
from time import sleep


@functools.cache
def get_cached_object():
    sleep(1)  # stand-in for some logic which takes some time to execute
    return list()


def run():
    print(f"{id(get_cached_object())}\n")


if __name__ == "__main__":
    first_thread = Thread(target=run)
    second_thread = Thread(target=run)

    first_thread.start()
    second_thread.start()

    sleep(2)  # wait for both threads to finish
    run()

This is an issue, for example, when you use @functools.cache for implementation of singletons (we can leave debate about singletons not being a good practice out of this).

The documentation should not claim the cache to be threadsafe or there should be an explicit warning about this situation.

Linked PRs

The text was updated successfully, but these errors were encountered:

sunmy2019 · 2023-04-12T16:44:16Z

From source code, it seems only
cached_property() - computed once per instance, cached as attribute
provides the "once guarantee".

cache has the meaning of "saving computation" rather than "once guarantee", since it is part of lru_cache

rhettinger · 2023-04-13T01:32:15Z

The term threadsafe means different things to different people. Here, it is correctly used in the narrow sense to indicate that the underlying data structure updates take place with a lock (or GIL) held. That makes it safe to use in a multithreaded environment without fear that the data structure will become incoherent. The stands in marked contrast to structures like the pure Python version of OrderedDict or random.gauss() where the invariants get broken during multi-threaded updates. We're mostly consistent about this in the docs where we say that random.random, dict.setdefault, and deque.append are threadsafe.

I'll add a clarifying note to the cache() and lru_cache() docs to note that there is no "call once" guarantee. We've never made that claim and have intentionally called the underlying function outside of the lock. In general, it isn't reliable or safe to leave a lock open across a call to arbitrary user code.

…"call once" guarantee.

GH-103669)

…arantee (pythonGH-103669) (cherry picked from commit e5eaac6) Co-authored-by: Raymond Hettinger <rhettinger@users.noreply.github.com>

…uarantee (GH-103669) (#103682) GH-103475: cache() and lru_cache() do not have a "call once" guarantee (GH-103669) (cherry picked from commit e5eaac6) Co-authored-by: Raymond Hettinger <rhettinger@users.noreply.github.com>

petergaultney · 2023-05-11T18:22:20Z

I'll add a clarifying note to the cache() and lru_cache() docs to note that there is no "call once" guarantee. We've never made that claim and have intentionally called the underlying function outside of the lock. In general, it isn't reliable or safe to leave a lock open across a call to arbitrary user code.

I'm curious about this statement. I understand that locks, in general, do not compose. But the current implementation uses a reentrant lock for the caching bits, and I can't come up with a reason we couldn't do an efficient double-checked lock with that reentrant lock even in the case that the user code (perversely) chooses to recurse.

Assuming I'm wrong about the above... would the CPython team be open to a PR that would include a conditional in the lru_cache decorator (perhaps lock_call: bool = False) that would conditionally apply the existing reentrant lock to the call to user_function? It would slightly complicate the code, but I and my coworkers run into the need for something like this (essentially the computation or underlying 'side effect' is so expensive that we cannot afford to have it called twice in a multithreading context) so frequently that we really wish it could eventually be part of the stdlib.

sunmy2019 · 2023-05-12T03:47:19Z

would the CPython team be open to a PR

You can open a PR. But it is hard to handle things correctly.

Note you cannot simply use one lock to guard the whole function. This lru_cache may be called during the call of the user function. Consider this

from functools import lru_cache
from concurrent.futures import ThreadPoolExecutor


def heavy_things(n: int):
    return f(n - 1) + f(n - 2)


@lru_cache
def f(n):
    if n > 1:
        with ThreadPoolExecutor() as p:
            result = p.map(heavy_things, [n])
        return sum(result)
    return 1


print(f(10))

It's a demo that some calculations might happen in another thread.

Do not cause a deadlock in this case.

My recommendation would be to customize with your own implementation.

petergaultney · 2023-05-13T00:50:39Z

yes, this makes sense.

for what it's worth, we do have our own implementation. it's just unfortunate that we have to have it.

An updated suggestion for a possible API that would be backward-compatible and put as much power as possible in the hands of the user: Allow lru_cache to take a func_lock argument, which defaults to None. If provided, the user is providing a lock (ideally, nothing more than a context manager which is whatever locking implementation they wish to provide), and the user_function is called inside that context.

I feel confident this would not break existing tests, and it should not be terribly difficult to write a threaded test (although what would constitute sufficient 'negative proof' of race conditions I am not sure.)

petergaultney · 2023-05-13T18:28:05Z

actually, i'd revise my proposal. The parameter would be key_lock, and its call signature would be key_lock: typing.ContextManager[Hashable].

Then, if this was non-nil, _lru_wrapper would call user_function inside the context as the following pseudocode demonstrates:

key = make_key(...)

def _check_cache(...):
    # same code as currently exists

cache_hit = _check_cache(key)

if not cache_hit:
    with key_lock(key):
        cache_hit = _check_cache(key)
        if not cache_hit:
            cache_hit = add_result_to_cache(key, user_function(*args, **kwds))

 return cache_hit.result

This allows the user to opt into granular locking per unique call. there are various ways to implement this; our implementation uses threading.Event to release waiting threads after the winning thread has performed the work.

rhettinger · 2023-05-31T22:47:15Z

would the CPython team be open to a PR that would include a conditional in the lru_cache decorator (perhaps lock_call: bool = False) that would conditionally apply the existing reentrant lock to the call to user_function?

I don't think so. This would be a feature creep for the cache to handle a niche use case. IIRC other cache implementations I looked at did not have this feature.

Also note that the C version doesn't even have its own lock, so there is nothing to attach this to. And for the Python version, I'm wary of ever leaving an open lock across an arbitrary user function call — that is a recipe for hard to find bugs.

Perhaps make your own package for implementing call-once behavior and post it to the Python packaging index. There, it can get a thorough shake-down and we can see if there is any uptake by the user community. FWIW,OrderedDict makes it easy to implement your own lru_cache variants.

sunmy2019 · 2023-07-01T03:30:04Z

would the CPython team be open to a PR that would include a conditional in the lru_cache decorator (perhaps lock_call: bool = False) that would conditionally apply the existing reentrant lock to the call to user_function?

I don't think so now. We are moving the other way. See #101890

okrcma added the docs Documentation in the Doc dir label Apr 12, 2023

AlexWaygood assigned rhettinger Apr 12, 2023

rhettinger changed the title ~~@functools.cache is NOT thread safe~~ Document that cache() and lru_cache() do not have a "call once" guarantee Apr 13, 2023

rhettinger added a commit to rhettinger/cpython that referenced this issue Apr 21, 2023

pythonGH-103475: Document that cache() and lru_cache() do not have a …

eedc727

…"call once" guarantee.

bedevere-bot mentioned this issue Apr 21, 2023

GH-103475: cache() and lru_cache() do not have a "call once" guarantee #103669

Merged

rhettinger added a commit that referenced this issue Apr 22, 2023

GH-103475: cache() and lru_cache() do not have a "call once" guarantee (

e5eaac6

GH-103669)

bedevere-bot mentioned this issue Apr 22, 2023

[3.11] GH-103475: cache() and lru_cache() do not have a "call once" guarantee (GH-103669) #103682

Merged

rhettinger closed this as completed Apr 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document that cache() and lru_cache() do not have a "call once" guarantee #103475

Document that cache() and lru_cache() do not have a "call once" guarantee #103475

okrcma commented Apr 12, 2023 •

edited by bedevere-bot

Loading

sunmy2019 commented Apr 12, 2023

rhettinger commented Apr 13, 2023

petergaultney commented May 11, 2023

sunmy2019 commented May 12, 2023 •

edited

Loading

petergaultney commented May 13, 2023

petergaultney commented May 13, 2023 •

edited

Loading

rhettinger commented May 31, 2023

sunmy2019 commented Jul 1, 2023

Document that cache() and lru_cache() do not have a "call once" guarantee #103475

Document that cache() and lru_cache() do not have a "call once" guarantee #103475

Comments

okrcma commented Apr 12, 2023 • edited by bedevere-bot Loading

Documentation

Linked PRs

sunmy2019 commented Apr 12, 2023

rhettinger commented Apr 13, 2023

petergaultney commented May 11, 2023

sunmy2019 commented May 12, 2023 • edited Loading

petergaultney commented May 13, 2023

petergaultney commented May 13, 2023 • edited Loading

rhettinger commented May 31, 2023

sunmy2019 commented Jul 1, 2023

okrcma commented Apr 12, 2023 •

edited by bedevere-bot

Loading

sunmy2019 commented May 12, 2023 •

edited

Loading

petergaultney commented May 13, 2023 •

edited

Loading