-
Notifications
You must be signed in to change notification settings - Fork 876
Define: clear path to debugging a broken workspace #1321
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Leaving this one in Community MVP, but removing from switch blocker. This is an issue that causes confusion and hurts UX and resolving this will make it easier to debug and troubleshoot failed provisions, but is also a nice to have for flipping the switch to public in May. |
@tjcran have you explored potential solutions to this problem? Maybe there's something simple we can do that makes it a lot better. Keep in mind this doesn't exist in v1 since once you get the infrastructure working, the configs don't change so it will probably keep working. |
Does being able to dump the Terraform template/state from the CLI seem sufficient for this, or is there more to do here? FWIW, I also recently ran into some issues with my dev environment where this would have been helpful. |
@dwahler State push/pull is a necessary addition to the product, but we have to be careful about leaking secrets. |
Good point. Maybe something that's equivalent to |
We do have
pulls the deployment name for my workspace. This is cumbersome, but perhaps sufficient. |
Setting an agent connection timeout through terraform was a suggestion by @kylecarbs in grooming today. |
Wait for the agent to be up since a user cannot do anything anyway until they are connected was a suggestion from @dwahler |
The timeout should be set via our Terraform provider. Here's an example: resource "coder-agent" "dev" {
timeout = "1m"
} Upon SSH, the user could be notified with a message that the agent timed out which isn't perfect, but is better than spinning forever. |
Moving this to an Epic after our grooming discussion. Tickets needed: Time out on agent connect after a successful build (2 min, 10 min?). This one needs fixed before Community MVP launch.
|
Ok, so I have an old workspace called
ding
, but I can never SSH into it. I've tried restarting it to no avail.The natural thing to do is disassemble the terraform, find the underlying resource, and then check on the agent. While I can list templates, I have no way of presenting the underlying terraform or configuration values.
Lame workaround
The only workaround is asking the Coder admin where the terraform source is and what values they configured the template with. This makes the coder admin a bottleneck, and even they have to use recall instead of information from the Coder product.
The text was updated successfully, but these errors were encountered: