[SPARK-11864] [SQL] Improve performance of max/min #9846

davies · 2015-11-19T21:59:05Z

This PR has the following optimization:

The greatest/least already does the null-check, so the If and IsNull are not necessary.
In greatest/least, it should initialize the result using the first child (removing one block).
For primitive types, the generated greater expression is too complicated (a > b ? 1 : (a < b) ? -1 : 0) > 0), should be as simple as a > b

Combine these optimization, this could improve the performance of ss_max query by 30%.

This reverts commit 3a23581.

nongli · 2015-11-19T22:19:17Z

LGTM

SparkQA · 2015-11-20T00:41:09Z

Test build #46358 has finished for PR 9846 at commit 7f7e33d.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-11-20T01:10:28Z

Test build #46362 has finished for PR 9846 at commit 593a361.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2015-11-20T01:14:01Z

Thanks - merging this.

This PR has the following optimization: 1) The greatest/least already does the null-check, so the `If` and `IsNull` are not necessary. 2) In greatest/least, it should initialize the result using the first child (removing one block). 3) For primitive types, the generated greater expression is too complicated (`a > b ? 1 : (a < b) ? -1 : 0) > 0`), should be as simple as `a > b` Combine these optimization, this could improve the performance of `ss_max` query by 30%. Author: Davies Liu <davies@databricks.com> Closes #9846 from davies/improve_max. (cherry picked from commit ee21407) Signed-off-by: Reynold Xin <rxin@databricks.com>

Davies Liu added 4 commits November 19, 2015 13:38

improve max/min

3a23581

Revert "improve max/min"

7141119

This reverts commit 3a23581.

improve max/min

7f7e33d

tuning

593a361

asfgit closed this in ee21407 Nov 20, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-11864] [SQL] Improve performance of max/min #9846

[SPARK-11864] [SQL] Improve performance of max/min #9846

Uh oh!

davies commented Nov 19, 2015

Uh oh!

nongli commented Nov 19, 2015

Uh oh!

SparkQA commented Nov 20, 2015

Uh oh!

SparkQA commented Nov 20, 2015

Uh oh!

rxin commented Nov 20, 2015

Uh oh!

Uh oh!

[SPARK-11864] [SQL] Improve performance of max/min #9846

[SPARK-11864] [SQL] Improve performance of max/min #9846

Uh oh!

Conversation

davies commented Nov 19, 2015

Uh oh!

nongli commented Nov 19, 2015

Uh oh!

SparkQA commented Nov 20, 2015

Uh oh!

SparkQA commented Nov 20, 2015

Uh oh!

rxin commented Nov 20, 2015

Uh oh!

Uh oh!