Research and document how to gracefully shut down agents using different terraform providers (update templates)

To ensure workspaces (and agents) are stopped gracefully, we want to make sure Terraform providers are configured in a way that tries (best-effort) to respect the agent shutdown procedure.

What does graceful shutdown mean? It means the provider issues a `kill -INT [coder_agent_pid]` (one of `HUP`, `INT`, `TERM`) when the resource is stopped. After a specified timeout, it may issue `kill -KILL [coder_agent_pid]` to ensure de-provision progresses.

**Note:** This will not be the primary form of ensuring an agent is stopped gracefully. In the ideal scenario, this happens in a pre-provision step and this acts as a safety net (https://github.com/coder/coder/issues/6175).

We need to figure out how to perform graceful shutdown with (or verify that it works):

- [x] [kreuzwerker/terraform-provider-docker](https://github.com/kreuzwerker/terraform-provider-docker) (see below)
- [ ] [hashicorp/terraform-provider-aws](https://github.com/hashicorp/terraform-provider-aws)
	- Is there a better option than trying to gracefully interrupt in user_data script?
	- Container [should work out-of-the-box](https://aws.amazon.com/blogs/containers/graceful-shutdowns-with-ecs/), timeout can be controlled via `stopTimeout`/ `ECS_CONTAINER_STOP_TIMEOUT`
	- Related: #5901
- [ ] [hashicorp/terraform-provider-azurerm](https://github.com/hashicorp/terraform-provider-azurerm)
- [ ] [digitalocean/terraform-provider-digitalocean](https://github.com/digitalocean/terraform-provider-digitalocean)
	- Droplets
	- K8S
- [ ] [hashicorp/terraform-provider-google](https://github.com/hashicorp/terraform-provider-google)
- [x] [hashicorp/terraform-provider-kubernetes](https://github.com/hashicorp/terraform-provider-kubernetes)

Eventually:
- [ ] Update documentation and example templates

Related: #4677, #5914, #6139

---

**Docker:**
```tf
resource "docker_container" "workspace" {
  destroy_grace_seconds = 30
  stop_timeout          = 30
  stop_signal           = "SIGINT"
}
```

**EDIT:** It looks like `destroy_grace_seconds` is required to trigger a graceful stop, `stop_{timeout,signal}` is insufficient alone. It may be that `stop_{timeout,signal}` can be omitted in normal use-cases though (needs verification).

- [`destroy_grace_seconds`](https://registry.terraform.io/providers/kreuzwerker/docker/latest/docs/resources/container#destroy_grace_seconds)
- [`stop_signal`](https://registry.terraform.io/providers/kreuzwerker/docker/latest/docs/resources/container#stop_signal)
- [`stop_timeout`](https://registry.terraform.io/providers/kreuzwerker/docker/latest/docs/resources/container#stop_timeout)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Research and document how to gracefully shut down agents using different terraform providers (update templates) #6174

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Research and document how to gracefully shut down agents using different terraform providers (update templates) #6174

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions