test: Handle Filter flake with ctx errors #7119

Emyrk · 2023-04-13T13:58:23Z

johnstcn

This looks right to me, I think this is a tricky flake to hit.
Can we add a test to assert this behaviour?

spikecurtis

A variation on this bug probably exists in all the places in the code we call Authorize

spikecurtis · 2023-04-13T17:21:55Z

coderd/rbac/authz.go

@@ -137,6 +137,9 @@ func Filter[O Objecter](ctx context.Context, auth Authorizer, subject Subject, a
 			err := auth.Authorize(ctx, subject, action, o.RBACObject())
 			if err == nil {
 				filtered = append(filtered, o)
+			} else if ctx.Err() != nil {
+				// Exit early if the error comes from the context
+				return nil, ctx.Err()


This code implicitly assumes that there are only 2 classes of errors that could be returned from auth.Authorize

the authorization completed, but the subject is not authorized

the context is Done

Prior to this change, this code implicitly assumed that there was only 1 class of errors.

It is difficult to be sure that the right answer is 2. Could it be 3 classes? 4?

Honestly, I think a better design would be for auth.Authorize to return two values --- one that tells you whether the subject is authorized, and another to tell you whether or not there was an error checking the authorization.

Failing that, it's generally better for error checking to be explicit and specific:

err := auth.Authorize(...) if xerrors.Is(err, rbac.Unauthorized) { continue } if err != nil { return nil, err } filtered = append(filtered, o)

Actually, thinking about this more, it's not about implicit vs explicit. It's about how the code reacts if Authorize() returns an unexpected error. By only handling the context case, we'd be the in the same boat again of swallowing the unexpected error and treating it like an unauthorized response.

If we instead handle the expected errors, and return any others, we're less likely to incorrectly ignore them.

Here are the 2 classes of error returned by authorize:

coder/coderd/rbac/authz.go

Lines 326 to 333 in 6d41edc

results, err := a.query.Eval(ctx, rego.EvalParsedInput(astV))

if err != nil {

return ForbiddenWithInternal(xerrors.Errorf("eval rego: %w", err), subject, action, object, results)

}

if !results.Allowed() {

return ForbiddenWithInternal(xerrors.Errorf("policy disallows request"), subject, action, object, results)

}

Right now both are ForbiddenWithInternal. Maybe the correct solution here is to return something different on the Eval error.

Then I can check for rbac.IsUnauthorizedError() on the Filter function loop

Actually it gets a bit more annoying in the prepared one.

coder/coderd/rbac/authz.go

Line 434 in 6d41edc

EachQueryLoop:

🤔

spikecurtis · 2023-04-13T17:25:56Z

This looks right to me, I think this is a tricky flake to hit. Can we add a test to assert this behaviour?

Yeah, a test would be good. We generate a list of say, 3 Objecters --- but the middle one is a little bomb that, as a side effect to it's RBACObject() call, cancels the context. Assert that we get the error back instead of a list of 2.

Emyrk

Unit test added.
Correctly made 2 categories of errors.

I also added some code to clean up cancelled errors to be more uniform with expected context.Cancelled

Emyrk · 2023-04-14T15:36:37Z

coderd/rbac/authz.go

+		err = correctCancelError(err)
+		return xerrors.Errorf("evaluate rego: %w", err)


For authorize single items.

Emyrk · 2023-04-14T15:36:43Z

coderd/rbac/authz.go

+			err = correctCancelError(err)
+			return xerrors.Errorf("eval error: %w", err)


For prepared.

Emyrk · 2023-04-14T15:57:38Z

Ugh a unit test is really hard to make for this.

The ctx is checked for cancel in a go routine: https://github.com/open-policy-agent/opa/blob/main/rego/rego.go#L2002-L2004

If the policy executes fast enough, then the ctx is ignored. Which means this is racy.

spikecurtis · 2023-04-18T08:20:42Z

Ugh a unit test is really hard to make for this.

The ctx is checked for cancel in a go routine: https://github.com/open-policy-agent/opa/blob/main/rego/rego.go#L2002-L2004

If the policy executes fast enough, then the ctx is ignored. Which means this is racy.

The standard procedure for when your code needs to call some 3rd party thing that is non-deterministic, is to mock it out and have a test for each case/class of what the non-deterministic thing can return.

Emyrk · 2023-04-18T15:09:06Z

@spikecurtis yea, I thought mocking out the rego would be tough because they are all structs. But I just need to mock out our authz interface. I'll make this test work in another PR and open it again.

test: Handle Fitler flake with ctx errors

6d41edc

github-actions bot assigned Emyrk Apr 13, 2023

Emyrk requested a review from johnstcn April 13, 2023 15:55

johnstcn requested a review from spikecurtis April 13, 2023 16:26

johnstcn reviewed Apr 13, 2023

View reviewed changes

spikecurtis reviewed Apr 13, 2023

View reviewed changes

Emyrk added 2 commits April 14, 2023 10:24

Add unit test to check filter for proper error

633fab3

Correctly return category of errors

46b5099

Emyrk commented Apr 14, 2023

View reviewed changes

Emyrk requested review from johnstcn and spikecurtis April 14, 2023 15:37

Emyrk added 2 commits April 14, 2023 11:02

Add skip msg

c150c53

Fix

0c52c87

johnstcn approved these changes Apr 14, 2023

View reviewed changes

Emyrk added 3 commits April 14, 2023 11:13

Fix?

1d509f8

Fix typos

cddbafb

Fmt

75b840a

Emyrk changed the title ~~test: Handle Fitler flake with ctx errors~~ test: Handle Filter flake with ctx errors Apr 14, 2023

Emyrk merged commit 2137db0 into main Apr 14, 2023

Emyrk deleted the stevenmasley/rbac_context_err branch April 14, 2023 17:30

github-actions bot locked and limited conversation to collaborators Apr 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

test: Handle Filter flake with ctx errors #7119

test: Handle Filter flake with ctx errors #7119

Uh oh!

Emyrk commented Apr 13, 2023

Uh oh!

johnstcn left a comment

Uh oh!

spikecurtis left a comment

Uh oh!

spikecurtis Apr 13, 2023

Uh oh!

spikecurtis Apr 13, 2023

Uh oh!

Emyrk Apr 13, 2023

Uh oh!

Emyrk Apr 13, 2023

Uh oh!

Emyrk Apr 14, 2023

Uh oh!

spikecurtis commented Apr 13, 2023

Uh oh!

Emyrk left a comment

Uh oh!

Emyrk Apr 14, 2023

Uh oh!

Emyrk Apr 14, 2023

Uh oh!

Emyrk commented Apr 14, 2023

Uh oh!

spikecurtis commented Apr 18, 2023

Uh oh!

Emyrk commented Apr 18, 2023

Uh oh!

Uh oh!

	results, err := a.query.Eval(ctx, rego.EvalParsedInput(astV))
	if err != nil {
	return ForbiddenWithInternal(xerrors.Errorf("eval rego: %w", err), subject, action, object, results)
	}

	if !results.Allowed() {
	return ForbiddenWithInternal(xerrors.Errorf("policy disallows request"), subject, action, object, results)
	}

		err = correctCancelError(err)
		return xerrors.Errorf("evaluate rego: %w", err)

		err = correctCancelError(err)
		return xerrors.Errorf("eval error: %w", err)

test: Handle Filter flake with ctx errors #7119

test: Handle Filter flake with ctx errors #7119

Uh oh!

Conversation

Emyrk commented Apr 13, 2023

Uh oh!

johnstcn left a comment

Choose a reason for hiding this comment

Uh oh!

spikecurtis left a comment

Choose a reason for hiding this comment

Uh oh!

spikecurtis Apr 13, 2023

Choose a reason for hiding this comment

Uh oh!

spikecurtis Apr 13, 2023

Choose a reason for hiding this comment

Uh oh!

Emyrk Apr 13, 2023

Choose a reason for hiding this comment

Uh oh!

Emyrk Apr 13, 2023

Choose a reason for hiding this comment

Uh oh!

Emyrk Apr 14, 2023

Choose a reason for hiding this comment

Uh oh!

spikecurtis commented Apr 13, 2023

Uh oh!

Emyrk left a comment

Choose a reason for hiding this comment

Uh oh!

Emyrk Apr 14, 2023

Choose a reason for hiding this comment

Uh oh!

Emyrk Apr 14, 2023

Choose a reason for hiding this comment

Uh oh!

Emyrk commented Apr 14, 2023

Uh oh!

spikecurtis commented Apr 18, 2023

Uh oh!

Emyrk commented Apr 18, 2023

Uh oh!

Uh oh!