feat(agent/agentcontainers): recreate devcontainers concurrently #18042

mafredri · 2025-05-26T11:43:36Z

This change introduces a refactor of the devcontainers recreation logic
which is now handled asynchronously rather than being request scoped.
The response was consequently changed from "No Content" to "Accepted" to
reflect this.

A new Status field was introduced to the devcontainer struct which
replaces Running (bool). This reflects that the devcontainer can now
be in various states (starting, running, stopped or errored).

The status field also protects against multiple concurrent recrations,
as long as they are initiated via the API.

Updates #16424

This change introduces a refactor of the devcontainers recreation logic which is now handled asynchronously rather than being request scoped. The response was consequently changed from "No Content" to "Accepted" to reflect this. A new `Status` field was introduced to the devcontainer struct which replaces `Running` (bool). This reflects that the devcontainer can now be in various states (starting, running, stopped or errored). The status field also protects against multiple concurrent recrations, as long as they are initiated via the API. Updates #16424

johnstcn · 2025-05-26T12:46:45Z

coderd/workspaceagents.go

 // @Param workspaceagent path string true "Workspace agent ID" format(uuid)
 // @Param container path string true "Container ID or name"
-// @Success 204
+// @Success 202 {object} codersdk.Response


Note: the response code change here does not appear to affect any endpoints currently in use by the FE.

Correct, not yet.

codersdk/workspaceagents.go

johnstcn · 2025-05-26T12:50:35Z

agent/agentcontainers/api_test.go

+				err = json.NewDecoder(rec.Body).Decode(&resp)
+				require.NoError(t, err, "unmarshal response failed after recreation")
+				require.Len(t, resp.Devcontainers, 1, "expected one devcontainer in response after recreation")
+				assert.Equal(t, codersdk.WorkspaceAgentDevcontainerStatusRunning, resp.Devcontainers[0].Status, "devcontainer is not stopped after recreation")


nit: this could also be a require?

johnstcn · 2025-05-26T12:50:43Z

agent/agentcontainers/api_test.go

+					err = json.NewDecoder(rec.Body).Decode(&resp)
+					require.NoError(t, err, "unmarshal response failed after error")
+					require.Len(t, resp.Devcontainers, 1, "expected one devcontainer in response after error")
+					assert.Equal(t, codersdk.WorkspaceAgentDevcontainerStatusError, resp.Devcontainers[0].Status, "devcontainer is not in an error state after up failure")


nit: this could also be a require?

To what benefit? Assert is useful in that it can allow multiple conditions to fail and give you a larger picture of what's wrong. Require is ofc needed whenever you need that condition to be true, like nil and lengths checks so that the rest of the code doesn't panic.

My nit is very much stylistic here; having an assert on the last line below one or more require doesn't help us against panics or any of the above you mentioned. Feel free to ignore!

johnstcn · 2025-05-26T12:56:13Z

agent/agentcontainers/api.go

+// The devcontainer state must be set to starting and the recreateWg must be
+// incremented before calling this function.
+func (api *API) recreateDevcontainer(dc codersdk.WorkspaceAgentDevcontainer, configPath string) {
+	defer api.recreateWg.Done()


I'm nervous that a hanging request to recreateDevcontainer could make closing the devcontainers API hang on api.wg.Wait(). I wonder if it makes sense to have a 'graceful' shutdown context and a 'hard' shutdown context that will forcefully cancel all in-flight recreation waitgroups after a certain period of time elapses?

This uses api.ctx which is essentially that "hard" shutdown context, there isn't any reason to have a graceful one AFAICT.

github-actions bot assigned mafredri May 26, 2025

mafredri force-pushed the mafredri/feat-agent-agentcontainers-recreate-async branch 4 times, most recently from d17bca2 to 0ebc262 Compare May 26, 2025 12:02

mafredri force-pushed the mafredri/feat-agent-agentcontainers-recreate-async branch from 0ebc262 to 9328228 Compare May 26, 2025 12:02

mafredri requested a review from Copilot May 26, 2025 12:04

This comment was marked as resolved.

Sign in to view

appease linter and standardize slog fields

78b0159

mafredri requested review from johnstcn and DanielleMaywood May 26, 2025 12:24

johnstcn reviewed May 26, 2025

View reviewed changes

mafredri marked this pull request as ready for review May 26, 2025 14:07

johnstcn approved these changes May 26, 2025

View reviewed changes

mafredri merged commit 0731304 into main May 26, 2025
34 of 36 checks passed

mafredri deleted the mafredri/feat-agent-agentcontainers-recreate-async branch May 26, 2025 15:30

github-actions bot locked and limited conversation to collaborators May 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(agent/agentcontainers): recreate devcontainers concurrently #18042

feat(agent/agentcontainers): recreate devcontainers concurrently #18042

Uh oh!

mafredri commented May 26, 2025

Uh oh!

This comment was marked as resolved.

Uh oh!

johnstcn May 26, 2025

Uh oh!

mafredri May 26, 2025

Uh oh!

Uh oh!

johnstcn May 26, 2025

Uh oh!

johnstcn May 26, 2025

Uh oh!

mafredri May 26, 2025

Uh oh!

johnstcn May 26, 2025 •

edited

Loading

Uh oh!

johnstcn May 26, 2025

Uh oh!

mafredri May 26, 2025

Uh oh!

Uh oh!

Uh oh!

feat(agent/agentcontainers): recreate devcontainers concurrently #18042

feat(agent/agentcontainers): recreate devcontainers concurrently #18042

Uh oh!

Conversation

mafredri commented May 26, 2025

Uh oh!

This comment was marked as resolved.

Uh oh!

johnstcn May 26, 2025

Choose a reason for hiding this comment

Uh oh!

mafredri May 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

johnstcn May 26, 2025

Choose a reason for hiding this comment

Uh oh!

johnstcn May 26, 2025

Choose a reason for hiding this comment

Uh oh!

mafredri May 26, 2025

Choose a reason for hiding this comment

Uh oh!

johnstcn May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

johnstcn May 26, 2025

Choose a reason for hiding this comment

Uh oh!

mafredri May 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

johnstcn May 26, 2025 •

edited

Loading