feat(agent): add connection reporting for SSH and reconnecing PTY #16652

mafredri · 2025-02-21T13:23:15Z

To allow merging, I've placed the reporting behind an experimental flag on the agent (--experimental-connection-reports-enable).

See #15139 for how these are represented in the UI (and plans for updating the UI).

Updates #15139

johnstcn · 2025-02-24T13:27:41Z

agent/agent.go

+			a.reportConnectionsMu.Lock()
+			if len(a.reportConnections) == 0 {
+				a.reportConnectionsMu.Unlock()
+				break


Do we need a label here for clarity? break will always break the innermost loop but I always have trouble remembering that personally.

If you review in VS Code, the syntax highlighting can be helpful here!

I only use labels when I need to. I don't think we have examples in our code for labels breaking the inner loop. Do we?

You wrote one a while back ;-) https://github.com/coder/coder/pull/14578/files#diff-f8b1ec0d615f1374c7f54e81c7871b337f92f8749e5608a155227c71160fafc8R48

You got me 😄 ❤️

agent/agent.go

johnstcn · 2025-02-24T13:36:35Z

agent/agentssh/agentssh.go

+}
+
+func (s *sessionCloseTracker) Close() error {
+	s.track(1)


What does 1 mean in this instance?

Same as a shell script exiting with generic error (1). Calling Close instead of Exit on the session indicates an "error" state (not error as in log, but error as in session didn't go right).

johnstcn · 2025-02-24T13:37:19Z

cli/agent.go

+		{
+			Flag:        "experimental-connection-reports-enable",
+			Hidden:      true,
+			Default:     "false",
+			Env:         "CODER_AGENT_EXPERIMENTAL_CONNECTION_REPORTS_ENABLE",
+			Description: "Enable experimental connection reports.",
+			Value:       serpent.BoolOf(&experimentalConnectionReports),
+		},


Should this be something that gets set as a deployment-wide experiment and is automatically set in the agent manifest?

The reason this is an (temporary) experiment is simply to not mess up the audit log UI until it is updated. I'm not aware that this needs to be placed behind an experiment deployment wide.

Just an alternative suggestion!

Emyrk · 2025-02-24T14:41:57Z

agent/agent.go

+			a.reportConnectionsMu.Lock()
+			if len(a.reportConnections) == 0 {
+				a.reportConnectionsMu.Unlock()
+				break


I only use labels when I need to. I don't think we have examples in our code for labels breaking the inner loop. Do we?

Emyrk · 2025-02-24T14:44:59Z

agent/agent.go

+
+			// Remove the payload we sent.
+			a.reportConnectionsMu.Lock()
+			a.reportConnections = a.reportConnections[1:]


Why not just make reportConnections channel? If it's an append only slice that is only read in this function.

This slice behavior is correct, just feels like a weaker implementation of a channel.

I guess it could be a channel, but how big to make it? How many in-flight reports is "too many"?

That's fair, a channel would not be a terrible option here. It requires upfront allocation which can either be a good or a bad thing in memory constrained systems. For now I've limited this to 2048 reports pending, or about 300KB. We can revisit this later if needed.

Emyrk · 2025-02-24T14:45:21Z

agent/agent.go

+	a.reportConnections = append(a.reportConnections, &proto.ReportConnectionRequest{
+		Connection: &proto.Connection{
+			Id:         id[:],
+			Action:     proto.Connection_CONNECT,
+			Type:       connectionType,
+			Timestamp:  timestamppb.New(time.Now()),
+			Ip:         ip,
+			StatusCode: 0,
+			Reason:     nil,
+		},
+	})


Do we have to worry about the size of this slice?

Typically no. The only way these accumulate is if we have lost connection to coders. I could add some safe-guards around this to drop messages, but since it's part of auditing that feels a bit wrong. WDYT?

I'll defer to your judegment. If you feel it is relatively bounded, then it's good with me. Maybe leave a comment of the assumptions?

I did not spend that long understanding the full context of the code.

johnstcn

Approving to unblock. I'm not particularly pushed on channel versus slice since this is behind an experimental flag and not enabled by default.

github-actions bot assigned mafredri Feb 21, 2025

This comment was marked as resolved.

Sign in to view

mafredri force-pushed the mafredri/feat-add-agent-connection-reporting branch from 1b8c15c to cd31d7e Compare February 21, 2025 13:28

feat(agent): add connection reporting for SSH and reconnecing PTY

58463c6

Updates #15139

mafredri force-pushed the mafredri/feat-add-agent-connection-reporting branch from cd31d7e to 58463c6 Compare February 21, 2025 14:07

mafredri mentioned this pull request Feb 24, 2025

Audit IDE connections and app opens #15139

Closed

mafredri marked this pull request as ready for review February 24, 2025 13:18

mafredri requested review from johnstcn, spikecurtis and Emyrk and removed request for johnstcn February 24, 2025 13:20

chore: put connection reports behind experimental flag

a77ceac

mafredri force-pushed the mafredri/feat-add-agent-connection-reporting branch from 4f934c8 to a77ceac Compare February 24, 2025 13:25

johnstcn reviewed Feb 24, 2025

View reviewed changes

mafredri added 2 commits February 24, 2025 13:59

simplify

17ddb8e

net.SplitHostPort

82b6fab

Emyrk reviewed Feb 24, 2025

View reviewed changes

johnstcn approved these changes Feb 24, 2025

View reviewed changes

mafredri mentioned this pull request Feb 25, 2025

feat(agent): wire up agentssh server to allow exec into container #16638

Merged

mafredri added 2 commits February 27, 2025 09:50

Merge branch 'main' into mafredri/feat-add-agent-connection-reporting

4166c29

add buffer limit

0318212

mafredri force-pushed the mafredri/feat-add-agent-connection-reporting branch from c77b24e to 0318212 Compare February 27, 2025 10:20

mafredri added 2 commits February 27, 2025 10:27

release pointer

d47e8a4

typo

601b6f4

mafredri enabled auto-merge (squash) February 27, 2025 10:37

mafredri merged commit 4ba5a8a into main Feb 27, 2025
30 checks passed

mafredri deleted the mafredri/feat-add-agent-connection-reporting branch February 27, 2025 10:45

github-actions bot locked and limited conversation to collaborators Feb 27, 2025

feat(agent): add connection reporting for SSH and reconnecing PTY #16652

feat(agent): add connection reporting for SSH and reconnecing PTY #16652

Uh oh!

Conversation

mafredri commented Feb 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as resolved.

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

johnstcn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mafredri commented Feb 21, 2025 •

edited

Loading