fix put-metric-alarm test failure rate #12598

pinzon · 2025-05-08T16:33:01Z

Motivation

As discussed internally, the test_put_metric_alarm test currently has a failure rate of around 14%, which makes it frustrating during full test suite runs. The root cause appears to be that the alarm-triggering value can vary in LocalStack—sometimes evaluating to 21 instead of the expected average 21.5 (which is consistently correct on AWS). This is likely due to timing differences or slight inconsistencies in how LocalStack handles metric ingestion and evaluation windows.

Changes

Increase the threshold of the CW Alarm to make the Alarm triggering value consistent between LS and AWS.

Testing

updated snapshots

github-actions · 2025-05-08T17:12:00Z

LocalStack Community integration with Pro

2 files ± 0 2 suites ±0 43m 16s ⏱️ - 1h 0m 22s
1 039 tests - 3 386 980 ✅ - 3 062 59 💤 - 324 0 ❌ ±0
1 041 runs - 3 386 980 ✅ - 3 062 61 💤 - 324 0 ❌ ±0

Results for commit 9c95b14. ± Comparison against base commit a7b4250.

This pull request removes 3386 tests.

tests.aws.scenario.bookstore.test_bookstore.TestBookstoreApplication ‑ test_lambda_dynamodb
tests.aws.scenario.bookstore.test_bookstore.TestBookstoreApplication ‑ test_opensearch_crud
tests.aws.scenario.bookstore.test_bookstore.TestBookstoreApplication ‑ test_search_books
tests.aws.scenario.bookstore.test_bookstore.TestBookstoreApplication ‑ test_setup
tests.aws.scenario.kinesis_firehose.test_kinesis_firehose.TestKinesisFirehoseScenario ‑ test_kinesis_firehose_s3
tests.aws.scenario.lambda_destination.test_lambda_destination_scenario.TestLambdaDestinationScenario ‑ test_destination_sns
tests.aws.scenario.lambda_destination.test_lambda_destination_scenario.TestLambdaDestinationScenario ‑ test_infra
tests.aws.scenario.loan_broker.test_loan_broker.TestLoanBrokerScenario ‑ test_prefill_dynamodb_table
tests.aws.scenario.loan_broker.test_loan_broker.TestLoanBrokerScenario ‑ test_stepfunctions_input_recipient_list[step_function_input0-SUCCEEDED]
tests.aws.scenario.loan_broker.test_loan_broker.TestLoanBrokerScenario ‑ test_stepfunctions_input_recipient_list[step_function_input1-SUCCEEDED]
…

♻️ This comment has been updated with latest results.

tiurin · 2025-05-12T15:47:57Z

tests/aws/services/cloudwatch/test_cloudwatch.snapshot.json

@@ -403,7 +403,7 @@
            "StateUpdatedTimestamp": "timestamp",
            "StateValue": "INSUFFICIENT_DATA",
            "Statistic": "Average",
-            "Threshold": 2.0,
+            "Threshold": 21.0,


Good adjustment of threshold to be equal to one of the data points! 👍 Indeed since metric data is added sequentially in a loop and alarm evaluation happens in a separate thread it is likely that in those 14% of cases evaluation happened after 21.0 was added but before 22.0. Since 21 was already bigger than 2 alarm was triggered. Now it shouldn't be the case - alarm needs both data points to be triggered.

tiurin · 2025-05-12T16:11:45Z

tests/aws/services/cloudwatch/test_cloudwatch.py

@@ -1338,8 +1338,8 @@ def test_put_metric_alarm(
        retry(
            _sqs_messages_snapshot,
            retries=60,
-            sleep=3 if is_aws_cloud() else 1,
-            sleep_before=5 if is_aws_cloud() else 0,
+            sleep=3,


Here retries are applied to receiving messages from SQS. Increasing sleep won't change the resulting alarm. Currently alarm has period of 30 seconds, 60 retries wit sleep 1 covers it more than enough. Locally test succeeds exactly after 30 seconds.

In fact, this made me think that we can reduce period to a minimum value of 10 and therefore shave 20 seconds off test execution!

I've pushed changes to the branch - reverted sleeps and reduced period to 10. Test now consistently executes successfully locally in just over 10 seconds. If you're happy with those changes please merge the PR! Otherwise happy to revert if there is any consideration.

pinzon · 2025-05-13T14:07:24Z

The changing of the period is a better change than the edition of retry. Thanks @tiurin. 👍

fix put-metric-alarm test failure rate

03f834a

pinzon added the semver: patch Non-breaking changes which can be included in patch releases label May 8, 2025

pinzon added this to the 4.5 milestone May 8, 2025

extend seconds

2443f16

pinzon marked this pull request as ready for review May 8, 2025 18:53

pinzon requested a review from steffyP as a code owner May 8, 2025 18:53

pinzon marked this pull request as draft May 8, 2025 18:53

pinzon added 2 commits May 8, 2025 13:55

trigger

63f8577

trigger

2031e0a

pinzon marked this pull request as ready for review May 9, 2025 15:37

pinzon assigned tiurin May 9, 2025

tiurin added 2 commits May 12, 2025 18:23

Revert local sleep increase

57b9b01

Reduce alarm evaluation period to 10 seconds

9c95b14

tiurin approved these changes May 12, 2025

View reviewed changes

pinzon merged commit 05f6746 into master May 13, 2025
32 checks passed

pinzon deleted the cw/fix/test-metric-alarm branch May 13, 2025 14:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix put-metric-alarm test failure rate #12598

fix put-metric-alarm test failure rate #12598

Uh oh!

pinzon commented May 8, 2025

Uh oh!

github-actions bot commented May 8, 2025 •

edited

Loading

Uh oh!

tiurin May 12, 2025

Uh oh!

tiurin May 12, 2025

Uh oh!

tiurin May 12, 2025

Uh oh!

tiurin May 12, 2025

Uh oh!

pinzon commented May 13, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fix put-metric-alarm test failure rate #12598

fix put-metric-alarm test failure rate #12598

Uh oh!

Conversation

pinzon commented May 8, 2025

Motivation

Changes

Testing

Uh oh!

github-actions bot commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

LocalStack Community integration with Pro

Uh oh!

tiurin May 12, 2025

Choose a reason for hiding this comment

Uh oh!

tiurin May 12, 2025

Choose a reason for hiding this comment

Uh oh!

tiurin May 12, 2025

Choose a reason for hiding this comment

Uh oh!

tiurin May 12, 2025

Choose a reason for hiding this comment

Uh oh!

pinzon commented May 13, 2025

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented May 8, 2025 •

edited

Loading