Skip to content

Conversation

spikecurtis
Copy link
Contributor

@spikecurtis spikecurtis commented Sep 11, 2024

fixes #14365

I bet what's going on is that in connectToCoordinatorAndFetchResumeToken() we call Coordinate(), send a message on the Coordinate client and then close it in rapid succession. We don't wait around for a response from the coordinator, so dRPC is likely aborting the call Coordinate() in the backend because the stream is closed before it even gets a chance.

Instead of using the Coordinator to record the peer ID assigned on the API call, we can wrap the resume token provider, since we call that API and wait for a response. This also affords the opportunity to directly assert we get called with the right token.

Copy link
Contributor Author

This stack of pull requests is managed by Graphite. Learn more about stacking.

Join @spikecurtis and the rest of your teammates on Graphite Graphite

@spikecurtis spikecurtis marked this pull request as ready for review September 11, 2024 11:47
@spikecurtis spikecurtis force-pushed the spike/fix-resume-token-flake branch from c2df472 to 9893031 Compare September 11, 2024 11:57
@spikecurtis spikecurtis merged commit 5bd19f8 into main Sep 11, 2024
27 checks passed
@spikecurtis spikecurtis deleted the spike/fix-resume-token-flake branch September 11, 2024 12:32
@github-actions github-actions bot locked and limited conversation to collaborators Sep 11, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

test flake: TestWorkspaceAgentClientCoordinate_ResumeToken
2 participants