fix(cli): port-forward: update workspace last_used_at #12659

johnstcn · 2024-03-19T12:19:43Z

For the CLI to report workspace usage, we currently depend on the agent sending a stats report with a non-zero connection count. This is sourced from SetConnStatsCallback (tailnet/conn.go:621).
The problem with this approach is that the tailscale connection tracking only appears to track "connection established" events, and is not equivalent to running e.g. ss -plunt. This means:

Having an open port-forward on its own will not be detected as an 'active connection'
Having an open port-forward with a single long-lived connection (e.g. websocket conn) will only be counted as one activity datapoint, and subsequent stats collection intervals will ignore that active connection.

This PR updates the coder port-forward command to periodically inform coderd that the workspace is being used:

Adds workspaceusage.Tracker which periodically batch-updates workspace LastUsedAt
Adds coderd endpoint to signal workspace usage
Updates coder port-forward to periodically hit this endpoint
Modifies BatchUpdateWorkspacesLastUsedAt to avoid overwriting with stale data

In later PRs, we can:

Update coder ssh to also use this behaviour,
Update the workspace activity tracking package to also call ActivityBumpWorkspace so that last_used_at and deadline are handled in the same place,
Remove the existing behaviour of updating the workspace when stats are received from the agent.

Follow-ups:

We may need to ensure that multiple port-forwards to the same workspace only result in one coder process on a user's workstation updating the workspace at a time. My assumption here is that most users will have at most one or two port-forwards running at a time.

coderd/workspaceusage/tracker.go

coderd/coderdtest/coderdtest.go

coderd/workspaceusage/tracker.go

dannykopping

Mostly nits & small {que,sugge}stions
Very clean! LGTM

coderd/coderdtest/coderdtest.go

coderd/workspaceusage/tracker.go

coderd/workspaceusage/tracker_test.go

codersdk/workspaces.go

coderd/workspaceusage/tracker.go

Co-authored-by: Danny Kopping <danny@coder.com>

mtojek

High-level review first

coderd/workspaceusage/tracker.go

mtojek · 2024-03-20T09:40:56Z

coderd/workspaceusage/tracker.go

+		LastUsedAt: now,
+		IDs:        ids,
+	}); err != nil {
+		wut.log.Error(ctx, "failed updating workspaces last_used_at", slog.F("count", count), slog.Error(err))


Is there any retry mechanism if the batch fails?

There is a built-in assumption here that any failure of a single batch can be ignored as it gets retried frequently. I suppose consecutive failures should surface some form of error somewhere. I wonder if the database health page could show a summary of queries that have failed multiple consecutive executions?

There is a built-in assumption here that any failure of a single batch can be ignored as it gets retried frequently. I suppose consecutive failures should surface some form of error somewhere

I might have missed it, but it would be cool to leave a comment stating this.

…r data

mtojek

Nice!

…e instances

dannykopping

Awesome 👍

mafredri · 2024-03-20T11:31:23Z

cli/portforward_test.go

+		db, ps  = dbtestutil.NewDB(t)
+		wuTick  = make(chan time.Time)
+		wuFlush = make(chan int, 1)
+		wut     = workspaceusage.New(db, workspaceusage.WithFlushChannel(wuFlush), workspaceusage.WithTickChannel(wuTick))


cli/portforward_test.go

coderd/coderd.go

mafredri · 2024-03-20T11:39:55Z

coderd/workspaceusage/tracker.go

+// the number of marked workspaces every time Tracker flushes.
+// For testing only and will panic if used outside of tests.
+func WithFlushChannel(c chan int) Option {
+	if flag.Lookup("test.v") == nil {


Nice strictness check! Feels like this could be a testutil func, even if obvious.

coderd/workspaceusage/tracker.go

mafredri · 2024-03-20T11:46:57Z

coderd/workspaceusage/tracker.go

+	// Calling Loop after Close() is an error.
+	select {
+	case <-wut.doneCh:
+		panic("developer error: Loop called after Close")


If we need to do this instead of just returning, I'd prefer returning an error here, wouldn't want this to happen for a production instance after some changes to the code.

Feels like this mostly exists for tests (otherwise could be launched from New), so more reason not to add a possibility of panic.

If we return an error, then we have two options:

Exit with the error, which brings us back to potentially impacting prod, but just in a nicer-looking way.

Just log the error and do nothing, which means the error will likely pass unnoticed until folks wonder why their workspace usage isn't being tracked.

I don't think 2) is a valid option. We're not guaranteed to catch it in tests especially because of the possibility of ignoring errors in slogtest.

There's an alternative possibility 1a) where we check the error and recreate the tracker if non-nil. WDYT?

I forgot about option c) - have New start the loop and unexport the function.
Now it's un-possible!

coderd/workspaceusage/tracker.go

coderd/workspaceusage/tracker_test.go

coderd/coderdtest/coderdtest.go

Emyrk · 2024-03-20T14:17:28Z

coderd/coderdtest/coderdtest.go

+		// Did last_used_at not update? Scratching your noggin? Here's why.
+		// Workspace usage tracking must be triggered manually in tests.
+		// The vast majority of existing tests do not depend on last_used_at
+		// and adding an extra background goroutine to all existing tests may
+		// lead to future flakes and goleak complaints.
+		// To do this, pass in your own WorkspaceUsageTracker like so:
+		//
+		// 	db, ps  = dbtestutil.NewDB(t)
+		//   wuTick  = make(chan time.Time)
+		//   wuFlush = make(chan int, 1)
+		//   wut     = workspaceusage.New(db, workspaceusage.WithFlushChannel(wuFlush), workspaceusage.WithTickChannel(wuTick))
+		//   client  = coderdtest.New(t, &coderdtest.Options{
+		//     WorkspaceUsageTracker: wut,
+		//     Database:              db,
+		//     Pubsub:                ps,
+		//   })
+		//
+		// See TestPortForward for how this works in practice.


Lol, love the personality in this comment 😄

johnstcn self-assigned this Mar 19, 2024

johnstcn changed the title ~~feat: tunnel: update workspace last_used_at~~ feat(cli): port-forward: update workspace last_used_at Mar 19, 2024

johnstcn mentioned this pull request Mar 19, 2024

coder tunnel does not update workspace last_used with long-lived connections #12431

Closed

add tests to assert last used at updated on port-forward

279f874

johnstcn force-pushed the cj/cli-workspace-heartbeat branch 2 times, most recently from 17cb933 to 2412fb5 Compare March 19, 2024 13:20

johnstcn changed the title ~~feat(cli): port-forward: update workspace last_used_at~~ fix(cli): port-forward: update workspace last_used_at Mar 19, 2024

johnstcn added 4 commits March 19, 2024 15:16

add workspaceusage package

8f9b945

add workspace usager tracking to coderd, add endpoint

e8c842c

add workspace usage tracking to cli/portforward, fix tests

86704a1

make gen

c99327c

johnstcn force-pushed the cj/cli-workspace-heartbeat branch from 2412fb5 to c99327c Compare March 19, 2024 15:16

johnstcn requested review from mafredri, dannykopping, Emyrk and mtojek March 19, 2024 15:32

johnstcn marked this pull request as ready for review March 19, 2024 15:32

johnstcn commented Mar 19, 2024

View reviewed changes

coderd/workspaceusage/tracker.go Outdated Show resolved Hide resolved

coderd/workspaceusage/tracker.go Outdated Show resolved Hide resolved

coderd/workspaceusage/tracker.go Outdated Show resolved Hide resolved

coderd/coderdtest/coderdtest.go Outdated Show resolved Hide resolved

Emyrk reviewed Mar 19, 2024

View reviewed changes

johnstcn added 3 commits March 19, 2024 19:17

workspaceusage: improve locking and tests

5876edd

address more PR comments

e4e0311

try to race harder

958d1d1

dannykopping approved these changes Mar 20, 2024

View reviewed changes

johnstcn and others added 2 commits March 20, 2024 09:10

add danny's suggestions

a36aeb9

Co-authored-by: Danny Kopping <danny@coder.com>

add big big comments

692f666

mtojek reviewed Mar 20, 2024

View reviewed changes

johnstcn mentioned this pull request Mar 20, 2024

feat(coderd/database): add dbrollup service to rollup insights #12665

Merged

mtojek approved these changes Mar 20, 2024

View reviewed changes

johnstcn added 2 commits March 20, 2024 10:23

fix(database): BatchUpdateWorkspaceLastUsedAt: avoid overwriting olde…

d794e00

…r data

fix(coderd/workspaceusage): log number of consecutive flush errors

45a0eef

upgrade to error log on multiple flush failures

8e40efd

mtojek approved these changes Mar 20, 2024

View reviewed changes

chore(coderd/workspaceusage): add integration-style test with multipl…

591e1ab

…e instances

dannykopping approved these changes Mar 20, 2024

View reviewed changes

mafredri reviewed Mar 20, 2024

View reviewed changes

johnstcn added 6 commits March 20, 2024 13:01

fix(cli/portforward_test.go): use testutil.RequireRecv/SendCtx

0caaf3a

just use default flush interval

cc72868

rename receiver

f5f8d75

defer close doneCh

a2e716d

defer instead of cleanup, avoid data race in real pubsub

5b64f96

fix(coderdtest): buffer just in case

23ccf21

Emyrk reviewed Mar 20, 2024

View reviewed changes

Emyrk approved these changes Mar 20, 2024

View reviewed changes

refactor: unexport Loop, remove panic, simplify external API

c9ac9d2

johnstcn merged commit 92aa1eb into main Mar 20, 2024

johnstcn deleted the cj/cli-workspace-heartbeat branch March 20, 2024 16:44

github-actions bot locked and limited conversation to collaborators Mar 20, 2024

fix(cli): port-forward: update workspace last_used_at #12659

fix(cli): port-forward: update workspace last_used_at #12659

Uh oh!

Conversation

johnstcn commented Mar 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dannykopping left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mtojek left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mtojek left a comment

Choose a reason for hiding this comment

Uh oh!

dannykopping left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

johnstcn commented Mar 19, 2024 •

edited

Loading