[ESM] Add configurable poll frequency and log shard info #12415

gregfurman · 2025-03-19T23:08:04Z

Motivation

In order to address performance shortcomings, we should allow for a user to configure their poller frequency as well as their maximum backoff on error/empty polls.

Changes

StreamPollers no longer back-off when a GetRecords call is empty
StreamPollers back-off when no shards have been initialised.
More debug logs have been added to show shard count.
Adds the following config varibales:
- LAMBDA_EVENT_SOURCE_MAPPING_POLL_INTERVAL_SEC -> set all poller's per-second frequency.
- LAMBDA_EVENT_SOURCE_MAPPING_MAX_BACKOFF_ON_ERROR_SEC -> set maximum backoff on poller exception.
- LAMBDA_EVENT_SOURCE_MAPPING_MAX_BACKOFF_ON_EMPTY_POLL_SEC -> set maximum backoff on empty polls.

github-actions · 2025-03-19T23:17:58Z

S3 Image Test Results (AMD64 / ARM64)

2 files ±0 2 suites ±0 9m 4s ⏱️ +12s
486 tests ±0 436 ✅ ±0 50 💤 ±0 0 ❌ ±0
972 runs ±0 872 ✅ ±0 100 💤 ±0 0 ❌ ±0

Results for commit c964218. ± Comparison against base commit 00e64fd.

♻️ This comment has been updated with latest results.

github-actions · 2025-03-20T00:13:27Z

LocalStack Community integration with Pro

2 files ±0 2 suites ±0 1h 51m 15s ⏱️ -29s
4 304 tests +2 3 983 ✅ ±0 321 💤 +2 0 ❌ ±0
4 306 runs +2 3 983 ✅ ±0 323 💤 +2 0 ❌ ±0

Results for commit c964218. ± Comparison against base commit 00e64fd.

♻️ This comment has been updated with latest results.

joe4dev · 2025-03-20T09:31:10Z

localstack-core/localstack/services/lambda_/event_source_mapping/pollers/stream_poller.py

@@ -164,7 +170,7 @@ def poll_events_from_shard(self, shard_id: str, shard_iterator: str):
        records = get_records_response.get("Records", [])
        if not records:
            self.shards[shard_id] = get_records_response["NextShardIterator"]
-            raise EmptyPollResultsException(service=self.event_source(), source_arn=self.source_arn)


that gonna help a lot with responsiveness; good we caught this accidential delay introducer 😬

This will effectively prevent backoff when querying empty streams, right?

Yeah unfortunately. A GetRecords may return no records while iterating across a shard.

We should re-think how to best approach this. Like with Kinesis we have that MillisBehindLatest that is 0 when iterator is caught up. So a 0 value and no new records returned could warrant some backoff.

localstack-core/localstack/services/lambda_/event_source_mapping/pollers/stream_poller.py

localstack-core/localstack/config.py

joe4dev · 2025-03-20T09:35:54Z

localstack-core/localstack/config.py

+
+# INTERNAL: 60 (default)
+# Maximum duration (in seconds) to wait between retries when an event source poll fails.
+LAMBDA_EVENT_SOURCE_MAPPING_MAX_BACKOFF_ON_ERROR_SEC = float(


Thanks for marking this as internal 👍

I understand that it can be helpful for quick experimentation with some customers, but we should be generally cautious making everything configurable, which introduces extra complexity. These are candidates to remove if there is no strong need in favor of choosing sensible defaults.

Yeah agree. Since these are not internal AWS behaviour though, think we should make these configurable.

dfangl

Looks good, just some minor comments

dfangl · 2025-03-20T09:42:37Z

localstack-core/localstack/services/lambda_/event_source_mapping/esm_worker.py

-MAX_BACKOFF_POLL_ERROR_SEC: float = 60
+POLL_INTERVAL_SEC: float = LAMBDA_EVENT_SOURCE_MAPPING_POLL_INTERVAL_SEC
+MAX_BACKOFF_POLL_EMPTY_SEC: float = LAMBDA_EVENT_SOURCE_MAPPING_MAX_BACKOFF_ON_EMPTY_POLL_SEC
+MAX_BACKOFF_POLL_ERROR_SEC: float = LAMBDA_EVENT_SOURCE_MAPPING_MAX_BACKOFF_ON_ERROR_SEC


Importing them like this, and then assigning it to a static variable, prevents us from monkeypatching easily. We might want to do that for tests, would it be a lot of work to directly access the config from where it is used?

dfangl · 2025-03-20T10:34:48Z

localstack-core/localstack/services/lambda_/event_source_mapping/pollers/stream_poller.py

@@ -164,7 +170,7 @@ def poll_events_from_shard(self, shard_id: str, shard_iterator: str):
        records = get_records_response.get("Records", [])
        if not records:
            self.shards[shard_id] = get_records_response["NextShardIterator"]
-            raise EmptyPollResultsException(service=self.event_source(), source_arn=self.source_arn)


This will effectively prevent backoff when querying empty streams, right?

gregfurman · 2025-03-20T12:03:51Z

@dfangl Unfortunately we can't reliably backoff when a GetRecords call is empty since this could be expected behabiour. I'll make a note of this as a comment but here's the relevant snippet from GetRecords returns an empty records array even when there is data in the stream:

Consuming, or getting records is a pull model. Developers are expected to call GetRecords in a continuous loop with no back-offs. Every call to GetRecords also returns a ShardIterator value, which must be used in the next iteration of the loop.

Which goes on to say:

[...] An empty Records element is returned under two conditions:

There is no more data currently in the shard.

There is no data near the part of the shard pointed to by the ShardIterator

And we can't reliably tell whether this is (1) or (2) which makes this back-off incorrect if we're doing pull-based retrieval.

However.... we could actually use the MillisBehindLatest 🤔 since:

The number of milliseconds the GetRecords response is from the tip of the stream, indicating how far behind current time the consumer is. A value of zero indicates that record processing is caught up, and there are no new records to process at this moment.

So a zero value of MillisBehindLatest with no new records should indicate we can backoff 💡

dfangl · 2025-03-20T12:08:34Z

@gregfurman Thank you for linking the docs! Yes - the empty get records call is something we also discussed yesterday. I would like to bring up two points for this one:

Against AWS this perfectly makes sense - however, did we check if kinesis-mock also has this behavior? Are we fixing a potential problem in theory, or did we actually experience the issue? To be clear, I am fine with removing this for now, and the ESM should really work against AWS as well, just asking out of interest.
This will again increase the number of requests against the gateway. It is fine, since it is not higher than a couple of weeks ago, but do we have data about what improvements we actually lose this way?

Just using this PR as discussion point for this, please do not let it hold back the merge :)

[ESM] Add configurable poll frequency and log shard info

124f132

gregfurman added area: performance Make LocalStack go rocket-fast aws:lambda:event-source-mapping AWS Lambda Event Source Mapping (ESM) labels Mar 19, 2025

gregfurman added this to the 4.3 milestone Mar 19, 2025

gregfurman self-assigned this Mar 19, 2025

gregfurman added semver: patch Non-breaking changes which can be included in patch releases semver: minor Non-breaking changes which can be included in minor releases, but not in patch releases and removed semver: patch Non-breaking changes which can be included in patch releases labels Mar 20, 2025

gregfurman marked this pull request as ready for review March 20, 2025 09:07

gregfurman requested review from joe4dev, dominikschubert, dfangl and thrau as code owners March 20, 2025 09:07

joe4dev approved these changes Mar 20, 2025

View reviewed changes

dfangl approved these changes Mar 20, 2025

View reviewed changes

address comments

c964218

gregfurman merged commit e70207f into master Mar 20, 2025
38 checks passed

gregfurman deleted the add/esm/configurable-polling branch March 20, 2025 13:16

gregfurman mentioned this pull request Mar 20, 2025

feature request: Empty Kinesis stream backing off configuration #12363

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[ESM] Add configurable poll frequency and log shard info #12415

[ESM] Add configurable poll frequency and log shard info #12415

Uh oh!

gregfurman commented Mar 19, 2025

Uh oh!

github-actions bot commented Mar 19, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Mar 20, 2025 •

edited

Loading

Uh oh!

joe4dev Mar 20, 2025

Uh oh!

dfangl Mar 20, 2025

Uh oh!

gregfurman Mar 20, 2025

Uh oh!

Uh oh!

Uh oh!

joe4dev Mar 20, 2025

Uh oh!

gregfurman Mar 20, 2025

Uh oh!

dfangl left a comment

Uh oh!

dfangl Mar 20, 2025 •

edited

Loading

Uh oh!

dfangl Mar 20, 2025

Uh oh!

gregfurman commented Mar 20, 2025 •

edited

Loading

Uh oh!

dfangl commented Mar 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[ESM] Add configurable poll frequency and log shard info #12415

[ESM] Add configurable poll frequency and log shard info #12415

Uh oh!

Conversation

gregfurman commented Mar 19, 2025

Motivation

Changes

Uh oh!

github-actions bot commented Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

S3 Image Test Results (AMD64 / ARM64)

Uh oh!

github-actions bot commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

LocalStack Community integration with Pro

Uh oh!

joe4dev Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

dfangl Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

gregfurman Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

joe4dev Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

gregfurman Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

dfangl left a comment

Choose a reason for hiding this comment

Uh oh!

dfangl Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dfangl Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

gregfurman commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dfangl commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Mar 19, 2025 •

edited

Loading

github-actions bot commented Mar 20, 2025 •

edited

Loading

dfangl Mar 20, 2025 •

edited

Loading

gregfurman commented Mar 20, 2025 •

edited

Loading

dfangl commented Mar 20, 2025 •

edited

Loading