Skip to content

Commit a6bd85d

Browse files
authored
feat: scaletest: add Grafana dashboard for scale testing (#8274)
* feat: scaletest: add Grafana dashboard for scale testing Fixes #7600. * make fmt
1 parent c569528 commit a6bd85d

File tree

2 files changed

+3904
-0
lines changed

2 files changed

+3904
-0
lines changed

scaletest/README.md

Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -88,3 +88,22 @@ A number of utility scripts are provided in `lib`, and are used by `scaletest.sh
8888
- `coder_workspacetraffic.sh`: Runs traffic generation against the loadtest cluster and creates
8989
a monitoring manifest for the traffic generation pod. This pod will restart automatically
9090
after the traffic generation has completed.
91+
92+
## Grafana Dashboard
93+
94+
A sample Grafana dashboard is provided in `scaletest_dashboard.json`. This dashboard is intended
95+
to be imported into an existing Grafana instance. It provides a number of useful metrics:
96+
97+
- **Control Plane Resources**: CPU, memory, and network usage for the Coder deployment, as well as the number of pod restarts.
98+
- **Database**: Rows inserted/updated/deleted/returned, active connections, and transactions per second. Fine-grained `sqlQuerier` metrics are provided for Coder's database as well, broken down my query method.
99+
- **HTTP requests**: Number of HTTP requests per second, broken down by status code and path.
100+
- **Workspace Resources**: CPU, memory, and network usage for all workspaces.
101+
- **Workspace Agents**: Workspace agent network usage, connection latency, and number of active connections.
102+
- **Workspace Traffic**: Statistics related to workspace traffic generation.
103+
- **Internals**: Provisioner job timings, concurrency, workspace builds, and AuthZ duration.
104+
105+
A subset of these metrics may be useful for a production deployment, but some are only useful
106+
for load testing.
107+
108+
> **Note:** in particular, `sqlQuerier` metrics produce a large number of time series and may cause
109+
> increased charges in your metrics provider.

0 commit comments

Comments
 (0)