feat: scaletest: scale down nodegroups by default #8276

johnstcn · 2023-06-30T14:17:57Z

This PR modifies the scaletest terraform to allow scaling down the cluster by setting -var state=stopped. This should hopefully save some time performing scaletests as we no longer have to wait for the cloudsql database etc. to be created.

By default, scaletest.sh will scale down the nodepools unless the --destroy argument is passed.

mtojek

Why do I have a feeling that it performs more than just starting with nodecount = 0 :)

johnstcn · 2023-06-30T14:36:46Z

Why do I have a feeling that it performs more than just starting with nodecount = 0 :)

There were some... interesting gotcha's. The main thing I found is that when you try to delete a namespace in GKE with no active nodepools is that the namespace will hang in the Deleting state, probably due to some finalizers that haven't run.

I spent a while chasing the 'right' way to ignore the resource deletion in Terraform before deciding to just use a null_resource for creating the namespaces and moving on with my life.

johnstcn · 2023-06-30T14:41:25Z

scaletest/scaletest.sh

+max_attempts=10
+for attempt in $(seq 1 $max_attempts); do
+	maybedryrun "$DRY_RUN" curl --silent --fail --output /dev/null "${SCALETEST_CODER_URL}/api/v2/buildinfo"
+	curl_status=$?
+	if [[ $curl_status -eq 0 ]]; then
+		break
+	fi
+	if attempt -eq $max_attempts; then
+		echo
+		echo "Coder deployment failed to become ready in time!"
+		exit 1
+	fi
+	echo "Coder deployment not ready yet (${attempt}/${max_attempts}), sleeping 3 seconds"
+	maybedryrun "$DRY_RUN" sleep 3
+done


review: there is a race condition between the rollout status returning true and the service actually becoming ready; so I'm just going back to curl :-)

johnstcn · 2023-06-30T14:42:36Z

scaletest/terraform/gcp_cluster.tf

+resource "null_resource" "cluster_kubeconfig" {
+  depends_on = [google_container_cluster.primary]
+  triggers = {
+    path       = local.cluster_kubeconfig_path
+    name       = google_container_cluster.primary.name
+    project_id = var.project_id
+    zone       = var.zone
+  }
+  provisioner "local-exec" {
+    command = <<EOF
+      KUBECONFIG=${self.triggers.path} gcloud container clusters get-credentials ${self.triggers.name} --project=${self.triggers.project_id} --zone=${self.triggers.zone}
+    EOF
+  }
+
+  provisioner "local-exec" {
+    when    = destroy
+    command = <<EOF
+      rm -f ${self.triggers.path}
+    EOF
+  }
+}


review: this was previously being created when applying prom monitoring manifests; moved it to its own resource declaration here.

johnstcn added 3 commits June 30, 2023 15:16

feat: allow scaling down scaletest environments

63c884a

fix bugged namespace deletion

8f706d8

misc fixes to scaletest.sh

5847ab2

johnstcn requested review from mafredri, spikecurtis and mtojek June 30, 2023 14:17

johnstcn self-assigned this Jun 30, 2023

destroy namespaces is a no-op as the cluster will be gone anyway

b75e235

mtojek approved these changes Jun 30, 2023

View reviewed changes

johnstcn commented Jun 30, 2023

View reviewed changes

johnstcn merged commit 1e8cc2c into main Jun 30, 2023

johnstcn deleted the cj/scaletest-scaledown branch June 30, 2023 15:07

github-actions bot locked and limited conversation to collaborators Jun 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: scaletest: scale down nodegroups by default #8276

feat: scaletest: scale down nodegroups by default #8276

Uh oh!

johnstcn commented Jun 30, 2023 •

edited

Loading

Uh oh!

mtojek left a comment

Uh oh!

johnstcn commented Jun 30, 2023

Uh oh!

johnstcn Jun 30, 2023

Uh oh!

johnstcn Jun 30, 2023

Uh oh!

Uh oh!

feat: scaletest: scale down nodegroups by default #8276

feat: scaletest: scale down nodegroups by default #8276

Uh oh!

Conversation

johnstcn commented Jun 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mtojek left a comment

Choose a reason for hiding this comment

Uh oh!

johnstcn commented Jun 30, 2023

Uh oh!

johnstcn Jun 30, 2023

Choose a reason for hiding this comment

Uh oh!

johnstcn Jun 30, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

johnstcn commented Jun 30, 2023 •

edited

Loading