Add structured metrics instrumentation #12230

vittoriopolverino · 2025-02-05T11:35:21Z

Motivation

The existing Counter implementation follows a parsing-based approach, relying on metric name patterns like label1:label2:* and inferring implicit relationships through JSON structures, dictionaries, or other nested data formats. Since these structures can vary depending on the feature or how each feature owner decides to implement them, querying, filtering, and aggregating metrics become inconsistent and complex. This lack of explicit structure makes it harder to standardize analysis across different use cases.

This refactor introduces a structured and extensible approach to metric collection, enforcing clear label standards and dimensional analysis while making all relationships and classifications explicit, rather than relying on naming conventions, positional placement, or nested JSON structures. By ensuring consistency, we can simplify querying, filtering, and aggregations, making analytics more scalable and reducing the need for custom metric handling.

Changes

Moved metrics to metrics.py, making them a standalone concept, not strictly tied to usage.py.
Introduced MetricRegistry as a singleton for managing all registered metrics (Counters, Gauges, etc.).
Introduced Metric as the base interface for metrics, enforcing the implementation of the collect() method in all metric types (e.g., Counters, Gauges).
Updated published event name from ls:usage to ls_metrics
Set a limit of 8 labels per metric to ensure labels are used effectively as dimensions for time-series analysis, filtering, and aggregations, preventing misuse or unnecessary complexity. A dedicated guide will follow to outline best practices.

Usage Examples

Basic Counter
A Counter tracks the occurrences of an event. It can be used without labels for simple counting:

from localstack.utils.analytics.metrics import Counter

# Define a counter
http_responses = Counter(namespace="http", name="requests")

# Increment
http_responses.increment()
http_responses.increment(value=3)

# Reset
http_responses.reset()

Counters with Labels
Counters can include labels to categorize events more effectively:

from localstack.utils.analytics.metrics import Counter

# Define a counter with labels
chaos_invocations = Counter(namespace="chaos", name="invocations", labels=["operation"])

# Increment
chaos_invocations.labels(operation="fault").increment(value=3)
chaos_invocations.labels(operation="network-effect").increment(value=4)

# Reset
chaos_invocations.labels(operation="fault").reset()
chaos_invocations.labels(operation="network-effect").reset()

Testing

Unit Tests (No External Services Required)

The test suite includes unit tests that validate metric collection and registry behavior without requiring event publishing to the analytics backend.

End-to-End Testing with the Analytics Backend

For a more in-depth test, follow these steps:

Run the Analytics Backend Locally

Start the analytics backend to capture and process metric events.
Ensure that environment variables are correctly set to send events to the data platform.

Start LocalStack Core with Analytics Enabled

Start LocalStack Core with the appropriate environment variables to enable analytics and send events to the local backend:

ANALYTICS_API="http://localhost:8000/v1" \
DEBUG=1 \
DEBUG_ANALYTICS=1 \
python -m localstack.cli.main start --host

TODO

What's left to do:

Instrument Chaos API - PR.
Complete internal documentation on best practices for metrics and label naming

localstack-bot

Welcome to LocalStack! Thanks for raising your first Pull Request and landing in your contributions. Our team will reach out with any reviews or feedbacks that we have shortly. We recommend joining our Slack Community and share your PR on the #community channel to share your contributions with us. Please make sure you are following our contributing guidelines and our Code of Conduct.

localstack-bot · 2025-02-05T11:35:35Z

All contributors have signed the CLA ✍️ ✅
_{Posted by the CLA Assistant Lite bot.}

github-actions · 2025-02-05T12:47:03Z

LocalStack Community integration with Pro

2 files ±0 2 suites ±0 1h 54m 20s ⏱️ + 4m 21s
4 104 tests +1 3 772 ✅ +2 332 💤 - 1 0 ❌ ±0
4 106 runs +1 3 772 ✅ +2 334 💤 - 1 0 ❌ ±0

Results for commit 0691017. ± Comparison against base commit f5b247b.

♻️ This comment has been updated with latest results.

vittoriopolverino · 2025-02-07T11:35:21Z

I have read the CLA Document and I hereby sign the CLA