feat: adds callback call when connection ends #4

spikecurtis · 2023-06-20T12:43:55Z

I want to add logging to the Coder Agent when ssh connections disconnect, including the reason for disconnection. c.f. coder/coder#7961

Currently, our ssh library doesn't expose or log this information, as it only calls the ConnectionFailedCallback if the connection fails during the initial SSH handshake, not if it fails later.

This PR makes it so that if the initial ssh handshake succeeds, then we start a goroutine that waits for the connection to end, and calls the callback. We already log calls to this callback in our server code, but in a separate PR we might want to adjust so that we only log at Info or Debug for "expected" disconnection errors like EOF.

mafredri

I think the purpose of ConnectionFailedCallback is different from what we're trying to do here and repurposing it for established connection errors seems like it could lead to confusion.

Is there a reason we wouldn't want to handle this in agentssh instead? (E.g. get sshConn via ContextKeyConn).

Or perhaps ultimately, a new kind of callback named ConnectionErrorCallback could be added.

mafredri · 2023-06-20T13:01:20Z

server.go

+	if srv.ConnectionFailedCallback != nil {
+		go func() {
+			wErr := sshConn.Wait()
+			srv.ConnectionFailedCallback(conn, wErr)


Instead of spawning a goroutine that can theoretically be left running after the server has closed, we could move this check to after the for loop for ch := range chans { ... }. That channel will be closed anyway before .Wait() here returns.

mtojek

Or perhaps ultimately, a new kind of callback named ConnectionErrorCallback could be added.

As I have spent some time recently diving in coder/ssh, I would recommend researching that approach.

BTW Sometimes the connection doesn't end, but will timeout eventually if there is no network activity (connection read/write timeout). With TCP keep alive enabled it might be hard to determine if the connection ended or if it is a transient network issue.

spikecurtis · 2023-06-21T05:18:45Z

Is there a reason we wouldn't want to handle this in agentssh instead? (E.g. get sshConn via ContextKeyConn).

I do want to handle the logging in agentssh, but there are currently no callbacks at the scope I need. I want to log the connection lifetime & errors, so doing it in a session handler is the wrong scope: a connection could have 0, 1, or several sessions, so we can't easily arrange to log exactly once. There is a "connection" callback, but that has a net.Conn and what I want is the ssh.Conn (net.Conn doesn't have any method to wait for close, nor would it expose the error from the SSH layer).

spikecurtis · 2023-06-21T05:21:26Z

Or perhaps ultimately, a new kind of callback named ConnectionErrorCallback could be added.

As I have spent some time recently diving in coder/ssh, I would recommend researching that approach.

I like this idea. I'll refactor to introduce a new callback.

Signed-off-by: Spike Curtis <spike@coder.com>

spikecurtis · 2023-06-21T05:42:41Z

I went with ConnectionCompleteCallback since "error" sounds like something that gets called for abnormal termination, but I want this to get called for every connection.

mtojek

👍

mafredri

Nice, thanks for making my suggestion even better and also documenting the always err behavior!

spikecurtis requested review from mafredri and mtojek June 20, 2023 12:43

mafredri reviewed Jun 20, 2023

View reviewed changes

mtojek reviewed Jun 20, 2023

View reviewed changes

feat: add ConnectionCompleteCallback

c92d705

Signed-off-by: Spike Curtis <spike@coder.com>

spikecurtis force-pushed the spike/conn-wait-callback branch from 3aef9e1 to c92d705 Compare June 21, 2023 05:40

spikecurtis requested review from mafredri and mtojek June 21, 2023 05:40

mtojek approved these changes Jun 21, 2023

View reviewed changes

mafredri approved these changes Jun 21, 2023

View reviewed changes

spikecurtis merged commit 9a7e234 into master Jun 21, 2023

spikecurtis deleted the spike/conn-wait-callback branch June 21, 2023 09:54

spikecurtis mentioned this pull request Jun 22, 2023

chore: add ssh disconnect log with errors coder/coder#8143

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: adds callback call when connection ends #4

feat: adds callback call when connection ends #4

Uh oh!

spikecurtis commented Jun 20, 2023 •

edited

Loading

Uh oh!

mafredri left a comment

Uh oh!

mafredri Jun 20, 2023

Uh oh!

mtojek left a comment

Uh oh!

spikecurtis commented Jun 21, 2023

Uh oh!

spikecurtis commented Jun 21, 2023

Uh oh!

spikecurtis commented Jun 21, 2023

Uh oh!

mtojek left a comment

Uh oh!

mafredri left a comment

Uh oh!

Uh oh!

feat: adds callback call when connection ends #4

feat: adds callback call when connection ends #4

Uh oh!

Conversation

spikecurtis commented Jun 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mafredri left a comment

Choose a reason for hiding this comment

Uh oh!

mafredri Jun 20, 2023

Choose a reason for hiding this comment

Uh oh!

mtojek left a comment

Choose a reason for hiding this comment

Uh oh!

spikecurtis commented Jun 21, 2023

Uh oh!

spikecurtis commented Jun 21, 2023

Uh oh!

spikecurtis commented Jun 21, 2023

Uh oh!

mtojek left a comment

Choose a reason for hiding this comment

Uh oh!

mafredri left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

spikecurtis commented Jun 20, 2023 •

edited

Loading