Java: Improve modelling of Spring requests, flow steps and XSS sinks #3653

lcartey · 2020-06-09T10:08:41Z

As part of some recent customer engagements I spent some time improving our modelling of Spring sources, flow steps and XSS sink.

The identification of trusted/untrusted data and XSS sinks is primarily taken from the Spring reference documentation for request mapping handler methods, which has a detailed breakdown of how parameters and return values are treated based on the type/annotations.

Review highlights:

This adds some general taint steps for flow out of Maps and Lists. This is required because the tainted parameters to a Spring request method may be either of these types, and we need to track flow out of them. One concern here is whether this can result in spurious flow through these types, although I believe this will not happen in practice because we generally don't track flow into them (i.e. we don't have List.add etc. as a taint step).
I've also added a taint step for String.replace, as I had a customer provided benchmark which required flow through an inadequate replace call. If we retain this step, we may need to add some sanitizers to the XSS (and other) queries for "safe" looking replaces.
Spring has a mechanism whereby complex datatypes can automatically be populated from the request parameters or the request body (and thus contain user-provided data). For example, if parameters are annotated with @RequestBody:
```
@PostMapping
public void createUser(@RequestBody User user) {
  ...
}
```
The User class will be populated from JSON provided in the request body, with fields populated recursively. To capture this aspect I have introduced the concept of SpringUntrustedDataType, and added taint flow steps through getter methods on these classes. I also wrote a helper (stripType) to unwrap generics (so that we can identify type parameters on lists etc. as also untrusted).

Work that still needs to be done:

Performance testing - I've only tried this on small to medium sized projects.
Verification that the additional taint steps do not create false positives in other projects.
Tidy up the QL

This addresses points 1-3 from: https://github.com/github/codeql-java-team/issues/9

aschackmull · 2020-06-09T11:06:17Z

This adds some general taint steps for flow out of Maps and Lists. This is required because the tainted parameters to a Spring request method may be either of these types, and we need to track flow out of them. One concern here is whether this can result in spurious flow through these types, although I believe this will not happen in practice because we generally don't track flow into them (i.e. we don't have List.add etc. as a taint step).

This is not entirely accurate. We do already have default taint steps for flow in and out of collections and maps, see https://github.com/github/codeql/blob/master/java/ql/src/semmle/code/java/dataflow/internal/ContainerFlow.qll for details about what we currently include.

aschackmull · 2020-06-09T11:34:06Z

Differences job started: https://jenkins.internal.semmle.com/job/Changes/job/Java-Differences/775/

aschackmull · 2020-06-16T07:49:52Z

Could you rebase onto latest master, please? That would make it a bit easier to get a working Difference job running.

@RequestMapping

- Recognise @<httpverb>Mapping as well as @RequestMapping. - Identify tainted/not tainted parameters of RequestMapping methods.

mapping.

- Identify ModelMaps correctly - Add extra not tainted param types (Pageable) - Identify ModelAttributes

- Only track if the body is a String type, as that is the only type at risk of XSS.

Look for Spring request methods which return a String value which may be coerced into a text/html output.

@RequestMapping

Methods annotated with a produces field which indicates a safe content-type should not be considered XSS sinks. For example: @RequestMapping(..., produces = "application/json")

Model the datatypes that may be populated on demand from request parameters.

getters.

- Also improve unwrapping of lists/arrays/maps etc.

aschackmull · 2020-06-24T09:17:11Z

Annoyingly, rebases don't provide notifications. In any case: https://jenkins.internal.semmle.com/job/Changes/job/Java-Differences/797/

aschackmull · 2020-07-02T09:44:49Z

The Differences job failed for some reason. Retrying: https://jenkins.internal.semmle.com/job/Changes/job/Java-Differences/815/

aschackmull · 2020-07-03T11:59:35Z

https://jenkins.internal.semmle.com/job/Changes/job/Java-Differences/822/

aibaars · 2020-07-03T15:59:15Z

java/ql/src/semmle/code/java/dataflow/internal/TaintTrackingUtil.qll

+  or
+  m.getDeclaringType().getSourceDeclaration().getASourceSupertype*().hasQualifiedName("java.util", "List") and
+  (
+    m.getName().regexpMatch("get|toArray|subList|spliterator|set|iterator|listIterator") or


@aschackmull Wouldn't it be better to replace these regexMatch calls with something like :

m.hasName([ "get", "toArray", ...])

aschackmull · 2020-07-06T15:07:55Z

I've made a bunch of suggested changes here: lcartey#1

aschackmull · 2020-07-06T15:09:25Z

The differences job shows a 3% time increase for jdk, so that looks acceptable.

aschackmull · 2020-07-06T15:12:00Z

One of the individual commit comments (7d555a7) suggests that some of the added steps are specific to XSS. Is this the case? If so, then we should move them to the XSS query.

Java: Review changes for github#3653

aschackmull · 2020-07-07T12:51:15Z

3 tests are failing due to parameters marked with @RequestParam no longer is enough to be considered a taint source - the parameter must now also belong to a SpringRequestMappingMethod. @lcartey could you verify that this is intentional? Because then I guess we'll just need to add @RequestMapping annotations in those 3 tests or something along those lines.

lcartey · 2020-07-07T13:29:13Z

One of the individual commit comments (7d555a7) suggests that some of the added steps are specific to XSS. Is this the case? If so, then we should move them to the XSS query.

I don't think they are specific to XSS, but they are most likely to be seen with XSS. I think they are fine as general taint steps.

lcartey · 2020-07-07T13:36:56Z

3 tests are failing due to parameters marked with @RequestParam no longer is enough to be considered a taint source - the parameter must now also belong to a SpringRequestMappingMethod. @lcartey could you verify that this is intentional? Because then I guess we'll just need to add @RequestMapping annotations in those 3 tests or something along those lines.

Yes, this is intentional. @RequestParam is not sufficient - there must be an @RequestMapping annotation (or similar) on the method itself, and a @Controller (or similar) on the declaring class. We have seen false positives for XSS caused by annotating dead @RequestParams as untrusted data.

Java: Fix qltests for github#3653

lcartey requested a review from a team as a code owner June 9, 2020 10:08

lcartey added 21 commits June 16, 2020 09:50

Java: Improve modelling of Spring request methods

f5dc033

- Recognise @<httpverb>Mapping as well as @RequestMapping. - Identify tainted/not tainted parameters of RequestMapping methods.

Java: Update RemoteFlowSource to use improve Spring request parameter

4300bc8

mapping.

Java: Add SpringWebRequest to RemoteTaintedMethod

6de2b93

Java: Add flow out of Map and List

7c4251d

Java: Improve Spring controller modelling

bfcc06d

- Identify ModelMaps correctly - Add extra not tainted param types (Pageable) - Identify ModelAttributes

Java: Modelling of the Spring HTTP classes.

fd2cd60

Java: Model Spring @responsebody methods.

1d12340

Java: Track flow through HttpEntity and ResponseEntity

7d555a7

- Only track if the body is a String type, as that is the only type at risk of XSS.

Java: Taint tracking through String.replace(all)?

c59042f

Java: Add Spring XSS sinks

8057dff

Look for Spring request methods which return a String value which may be coerced into a text/html output.

Java: Model produces parameter to RequestMapping attribute.

f6a99cb

Java: XSS - ignore Spring sinks when content-type is safe.

e2cec58

Methods annotated with a produces field which indicates a safe content-type should not be considered XSS sinks. For example: @RequestMapping(..., produces = "application/json")

Java: Model ResponseEntity.BodyBuilder

f6b2acc

Java: Model taint flow through ResponseEntity.

0db7cea

Java: SpringController - handle non-string literal produces values.

8bd5f74

Java: Model untrusted user data types

8678d5f

Model the datatypes that may be populated on demand from request parameters.

Java: Add taint step to flow through Spring tainted user data class

93c28d4

getters.

Java: Add Spring flow out of HttpEntity and HttpHeader

cd6339f

Java: Model Spring WebClients/RestTemplates.

9625e82

Java: Add Spring RestTemplate return values to untrusted data types

f2edc53

- Also improve unwrapping of lists/arrays/maps etc.

Java: Add RestTemplate as flow source.

2978af3

lcartey force-pushed the java/improve-spring-support branch from 60c536d to 2978af3 Compare June 16, 2020 08:51

Java: Split SpringWebRequestGetMethod into its own class.

6de612a

aibaars reviewed Jul 3, 2020

View reviewed changes

aschackmull added 10 commits July 6, 2020 14:18

Java: Make a few predicates private and autoformat SpringController.

a41c2d8

Java: Remove list, map, and StringReplaceMethod flow steps.

2ae15f9

Java: Clean up SpringHttp.qll

2ce0921

Java: Minor typo fix and autoformat

a80e663

Java: Misc grammar fixes.

5d8f9a7

Java: Cleanup TaintTrackingUtil.qll

e6658c5

Java: Add some qldoc and minor formatting.

5e9e7fe

Java: More qldoc and some formatting.

b06d1c7

Java: Misc grammar and formatting.

ae21de9

Java: Use SpringHttpEntity class.

f98460c

Merge pull request #1 from aschackmull/java/spring-3653

3fef5ca

Java: Review changes for github#3653

aschackmull and others added 5 commits July 8, 2020 13:06

Merge branch 'master' into java/spring-3653-2

48e4759

Java: Fix LdapInjection qltest

581d496

Java: Fix JndiInjection qltest

a4fe4f4

Java: Fix OgnlInjection qltest

b88ebd6

Merge pull request #2 from aschackmull/java/spring-3653-2

443c13d

Java: Fix qltests for github#3653

aschackmull approved these changes Jul 8, 2020

View reviewed changes

aschackmull merged commit 528f250 into github:master Jul 8, 2020

aschackmull mentioned this pull request Oct 7, 2020

[java] Merged with 3665 （https://github.com/github/codeql/pull/3665） #3674

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Java: Improve modelling of Spring requests, flow steps and XSS sinks #3653

Java: Improve modelling of Spring requests, flow steps and XSS sinks #3653

Uh oh!

lcartey commented Jun 9, 2020

Uh oh!

aschackmull commented Jun 9, 2020

Uh oh!

aschackmull commented Jun 9, 2020

Uh oh!

aschackmull commented Jun 16, 2020

Uh oh!

aschackmull commented Jun 24, 2020

Uh oh!

aschackmull commented Jul 2, 2020

Uh oh!

aschackmull commented Jul 3, 2020

Uh oh!

aibaars Jul 3, 2020

Uh oh!

aschackmull commented Jul 6, 2020

Uh oh!

aschackmull commented Jul 6, 2020

Uh oh!

aschackmull commented Jul 6, 2020

Uh oh!

aschackmull commented Jul 7, 2020

Uh oh!

lcartey commented Jul 7, 2020

Uh oh!

lcartey commented Jul 7, 2020

Uh oh!

Uh oh!

Java: Improve modelling of Spring requests, flow steps and XSS sinks #3653

Java: Improve modelling of Spring requests, flow steps and XSS sinks #3653

Uh oh!

Conversation

lcartey commented Jun 9, 2020

Uh oh!

aschackmull commented Jun 9, 2020

Uh oh!

aschackmull commented Jun 9, 2020

Uh oh!

aschackmull commented Jun 16, 2020

Uh oh!

aschackmull commented Jun 24, 2020

Uh oh!

aschackmull commented Jul 2, 2020

Uh oh!

aschackmull commented Jul 3, 2020

Uh oh!

aibaars Jul 3, 2020

Choose a reason for hiding this comment

Uh oh!

aschackmull commented Jul 6, 2020

Uh oh!

aschackmull commented Jul 6, 2020

Uh oh!

aschackmull commented Jul 6, 2020

Uh oh!

aschackmull commented Jul 7, 2020

Uh oh!

lcartey commented Jul 7, 2020

Uh oh!

lcartey commented Jul 7, 2020

Uh oh!

Uh oh!