[WIP] Step Functions: Telemetry #12581

MEPalma · 2025-05-05T15:02:26Z

Motivation

The mocked service integrations feature for Step Functions testing was recently introduced, but no telemetry was added to track its usage. These changes address that gap by recording usage of the associated environment variable, SFN_MOCK_CONFIG, without capturing the file path it points to.
Moreover, these changes introduce a new analytics usage counter to capture the number of executions by type. For now, this includes the is_mock_test_case execution type, which counts whether an execution is initiated as part of a mocked service integrations test case. This counter may be extended to later include additional labels to distinguish between standard and express executions.

Changes

added SFN_MOCK_CONFIG to analytics' PRESENCE_ENV_VAR list
added new execution_type_counter usage counter
added counting of is_mock_test_case upon start_execution calls

github-actions · 2025-05-05T15:11:56Z

S3 Image Test Results (AMD64 / ARM64)

2 files ±0 2 suites ±0 8m 39s ⏱️ -17s
488 tests ±0 438 ✅ ±0 50 💤 ±0 0 ❌ ±0
976 runs ±0 876 ✅ ±0 100 💤 ±0 0 ❌ ±0

Results for commit a50fcba. ± Comparison against base commit 9b55514.

♻️ This comment has been updated with latest results.

github-actions · 2025-05-05T16:29:36Z

LocalStack Community integration with Pro

2 files ±0 2 suites ±0 1h 42m 27s ⏱️ +44s
4 399 tests ±0 4 037 ✅ ±0 362 💤 ±0 0 ❌ ±0
4 401 runs ±0 4 037 ✅ ±0 364 💤 ±0 0 ❌ ±0

Results for commit a50fcba. ± Comparison against base commit 9b55514.

joe4dev

I added some questions and discussion items regarding the optimal place of instrumentation and the trade-off regarding feature/parameter vs. operation tracking.

joe4dev · 2025-05-06T07:42:35Z

localstack-core/localstack/services/stepfunctions/provider.py

@@ -6,6 +6,7 @@
 import time


nit: I think the new standard name is analytics.py (following the Notion guide and samples in API Gateway, Lambda, etc)

joe4dev · 2025-05-06T07:44:26Z

localstack-core/localstack/services/stepfunctions/provider.py

@@ -789,6 +790,10 @@ def start_execution(
            state_machine_arn_parts[1] if len(state_machine_arn_parts) == 2 else None
        )

+        # Count metrics about the execution type.
+        is_mock_test_case: bool = mock_test_case_name is not None
+        UsageMetrics.execution_type_counter.labels(is_mock_test_case=is_mock_test_case).increment()


question: Is this the right place to increment the counter given we could raise multiple exceptions following this counter (e.g., _raise_state_machine_does_not_exist or InvalidExecutionInput)?

joe4dev · 2025-05-06T08:04:16Z

localstack-core/localstack/services/stepfunctions/usage.py

+
+# Initialize a counter to record the use of each execution type.
+execution_type_counter = Counter(
+    namespace="stepfunctions", name="execution_type", labels=["is_mock_test_case"]


naming: Isn't execution sufficient because the _type sounds like an aspect that a label should cover. See "Metric & Label Best Practices" in Notion.

suggestion/idea: Following the Lambda example in localstack-core/localstack/services/lambda_/analytics.py:function_counter, what about the following focusing on the main operation(s) rather than having feature-specific counters (e.g., JSONata, is_mock_test_case, workflow_type, invocation_type, ...):

execution_counter = Counter( namespace="stepfunctions" name="execution", labels=[ "is_mock_test_case", "workflow_type", # standard | express "invocation_type", # async | sync ], )

Is it worthwhile (i.e., can we derive something actionable) from capturing more metadata linked to executions?
In Lambda, we even use an operation field to capture create and invoke activities separately. Feel free to weigh in on what's the best model for StepFunctions here.

sidenote: The dimension status might be interesting to learn about unexpected/unhandled errors (i.e., try/catch), the more metadata we collect about executions. However, if all executions happen through the API, unhandled exceptions should be tracked generically by our ASF-based telemetry (including actionable stack traces in DEBUG mode).

What is your recommendation/guidance from the data side @vittoriopolverino, especially regarding the trade-off: counters for features/parameters (e.g., JSONata, is_mock_test_case, workflow_type, invocation_type) vs. for operations (e.g., execute, create) with features as labels?

joe4dev · 2025-05-06T08:06:32Z

localstack-core/localstack/services/stepfunctions/provider.py

@@ -789,6 +790,10 @@ def start_execution(
            state_machine_arn_parts[1] if len(state_machine_arn_parts) == 2 else None
        )

+        # Count metrics about the execution type.
+        is_mock_test_case: bool = mock_test_case_name is not None
+        UsageMetrics.execution_type_counter.labels(is_mock_test_case=is_mock_test_case).increment()


question: Are we missing other entry points, such as start_sync_execution? Are they worth tracking?

MEPalma · 2025-05-06T09:33:02Z

Thanks @joe4dev for sharing your thoughts!
The objective here was to quickly introduce a basic mechanism to track usage of this new feature in time for the upcoming release. Hence, optimality of this solution was not a goal and aimed to minimize changes ahead of the release. That said, I agree that implementing a more robust and optimal tracking strategy is worth exploring whilst introducing a new counter.
For now, I’m opening a separate PR that only tracks the use of the environment variable, and I’ll continue the broader discussion in this thread.

MEPalma · 2025-05-06T09:48:36Z

#12584

add SFN_MOCK_CONFIG to analytics PRESENCE_ENV_VAR list

4fb61c8

MEPalma added this to the 4.4 milestone May 5, 2025

MEPalma requested a review from joe4dev May 5, 2025 15:02

MEPalma self-assigned this May 5, 2025

MEPalma added the semver: minor Non-breaking changes which can be included in minor releases, but not in patch releases label May 5, 2025

add execution_type counter

a50fcba

MEPalma changed the title ~~Step Functions: Add Telemetry for SFN_MOCK_CONFIG Usage~~ Step Functions: Add Telemetry for Mocked Integrations May 5, 2025

MEPalma marked this pull request as ready for review May 5, 2025 17:40

MEPalma requested review from gregfurman and thrau as code owners May 5, 2025 17:40

MEPalma removed request for thrau and gregfurman May 5, 2025 17:40

joe4dev reviewed May 6, 2025

View reviewed changes

MEPalma modified the milestones: 4.4, Playground May 6, 2025

MEPalma marked this pull request as draft May 6, 2025 09:34

MEPalma changed the title ~~Step Functions: Add Telemetry for Mocked Integrations~~ [WIP] Step Functions: Telemetry May 6, 2025

MEPalma closed this Sep 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[WIP] Step Functions: Telemetry #12581

[WIP] Step Functions: Telemetry #12581

Uh oh!

MEPalma commented May 5, 2025 •

edited

Loading

Uh oh!

github-actions bot commented May 5, 2025 •

edited

Loading

Uh oh!

github-actions bot commented May 5, 2025

Uh oh!

joe4dev left a comment

Uh oh!

joe4dev May 6, 2025

Uh oh!

joe4dev May 6, 2025

Uh oh!

joe4dev May 6, 2025

Uh oh!

joe4dev May 6, 2025

Uh oh!

joe4dev May 6, 2025

Uh oh!

joe4dev May 6, 2025

Uh oh!

MEPalma commented May 6, 2025

Uh oh!

MEPalma commented May 6, 2025

Uh oh!

Uh oh!

Uh oh!

[WIP] Step Functions: Telemetry #12581

[WIP] Step Functions: Telemetry #12581

Uh oh!

Conversation

MEPalma commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Changes

Uh oh!

github-actions bot commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

S3 Image Test Results (AMD64 / ARM64)

Uh oh!

github-actions bot commented May 5, 2025

LocalStack Community integration with Pro

Uh oh!

joe4dev left a comment

Choose a reason for hiding this comment

Uh oh!

joe4dev May 6, 2025

Choose a reason for hiding this comment

Uh oh!

joe4dev May 6, 2025

Choose a reason for hiding this comment

Uh oh!

joe4dev May 6, 2025

Choose a reason for hiding this comment

Uh oh!

joe4dev May 6, 2025

Choose a reason for hiding this comment

Uh oh!

joe4dev May 6, 2025

Choose a reason for hiding this comment

Uh oh!

joe4dev May 6, 2025

Choose a reason for hiding this comment

Uh oh!

MEPalma commented May 6, 2025

Uh oh!

MEPalma commented May 6, 2025

Uh oh!

Uh oh!

MEPalma commented May 5, 2025 •

edited

Loading

github-actions bot commented May 5, 2025 •

edited

Loading