bpo-37587: Make json.loads faster for long strings #14752

mpaolini · 2019-07-13T15:05:21Z

https://bugs.python.org/issue37587

the-knights-who-say-ni · 2019-07-13T15:05:24Z

Hello, and thanks for your contribution!

I'm a bot set up to make sure that the project can legally accept your contribution by verifying you have signed the PSF contributor agreement (CLA).

Our records indicate we have not received your CLA. For legal reasons we need you to sign this before we can look at your contribution. Please follow the steps outlined in the CPython devguide to rectify this issue.

If you have recently signed the CLA, please wait at least one business day
before our records are updated.

You can check yourself to see if the CLA has been received.

Thanks again for your contribution, we look forward to reviewing it!

mangrisano · 2019-07-13T15:18:55Z

/cc @tiran @ezio-melotti @serhiy-storchaka

serhiy-storchaka

c is only used in the condition if (!(c == '"' || c == '\\')) after the loop. It duplicates the condition in the loop. Maybe move it in the loop and change to if (next == len)?

mpaolini · 2019-07-13T20:04:13Z

@serhiy-storchaka c is also used later on in the if (c == '"') test . Anyays that MOV turned out to be a red herring, what made the difference whas the strict check removal.

Thanks to @pablogsal and @zooba for helping me!

Forces the compiler to use a register variable for a tight loop in the hot-path. It also optimizes a condition for the common case strict=true.

json.loads hot path. This partially reverts the previous commit as we found out the culprit wasn't the MOV but the strictness checks

ncoghlan · 2019-07-14T01:16:59Z

Modules/_json.c

+            /* Defer the strict error until outside this (hot) loop. */
+            /* See bpo-37587 */
+            if (c <= 0x1f && invalid < 0) {
+                invalid = next;


It seems unlikely that an ordering comparison could be faster than a check against zero, so I suspect most or all of the speed-up here is coming from performing the usually-false check before the usually-true one.

What performance numbers do you get with just the reordering of the operands to put the character check first?

My other question would be to ask what happens to the assembly output if you explicitly mark "strict" as const with the original code? (the while loop body is long enough that I doubt compilers will be scanning it to see if "strict" is ever actually modified, so the explicit hint may make a difference)

just flipping around the condition makes it 11% faster than vanilla. moving the goto out of the loop gives you another 1% (maybe just because the loop is smaller)

In that case, I'd suggest changing the revised check to be:

if (c <= 0x1f && strict) { invalid = next; break; }

And then not re-checking strict outside the loop - just check for invalid >= 0.

(The currently posted version will be much slower when it comes to rejecting bad data, as it will continue a doomed scan, potentially until the end of the file, instead of exiting as soon as it sees an invalid character)

Being slower when rejecting bad data is okay though, since it's going to result in an exception.

Our "even faster" version defers the actual check until after the loop completes, and we keep the minimum seen c throughout the loop (which uses a branchless cmov instruction on Intel-like architectures). That way we only check strict once.

And I believe you're spot on - the performance improvement from simply switching comes because the first condition being false bypasses a couple of branches, whereas before they were being included, and when strict==0 it should make no difference. But avoiding the additional branching entirely is even better.

The last edition looks incorrect to me. Shouldn't break be added after invalid = next?

I am fine with just swapping strict and c <= 0x1f if it gives a noticeable effect (10% or like). We can try more subtle changes (with effect ~ 1%) in a separate issue.

we removed the break on purpose to make the loop as small as possible, the algorithm is still correct

@ncoghlan done as you suggested, just switched around the strict check

pablogsal · 2019-07-30T00:41:58Z

@mpaolini Can you attach here some of the benchmarks we did together with @zooba and @tiran on EuroPython?

mpaolini · 2019-07-30T00:45:37Z

@pablogsal some of the benchmarks we did during the sprint are attached to the issue here https://bugs.python.org/issue37587

ncoghlan · 2019-07-30T14:17:08Z

Thanks @mpaolini!

miss-islington · 2019-07-30T14:17:58Z

Thanks @mpaolini for the PR, and @ncoghlan for merging it 🌮🎉.. I'm working now to backport this PR to: 3.8.
🐍🍒⛏🤖

bedevere-bot · 2019-07-30T14:18:20Z

GH-15022 is a backport of this pull request to the 3.8 branch.

When scanning the string, most characters are valid, so checking for invalid characters first means never needing to check the value of strict on valid strings, and only needing to check it on invalid characters when doing non-strict parsing of invalid strings. This provides a measurable reduction in per-character processing time (~11% in the pre-merge patch testing). (cherry picked from commit 8a758f5) Co-authored-by: Marco Paolini <mpaolini@users.noreply.github.com>

When scanning the string, most characters are valid, so checking for invalid characters first means never needing to check the value of strict on valid strings, and only needing to check it on invalid characters when doing non-strict parsing of invalid strings. This provides a measurable reduction in per-character processing time (~11% in the pre-merge patch testing).

the-knights-who-say-ni added the CLA not signed label Jul 13, 2019

bedevere-bot added the awaiting review label Jul 13, 2019

serhiy-storchaka reviewed Jul 13, 2019

View reviewed changes

mpaolini added 2 commits July 13, 2019 21:08

bpo-37587: Make json.loads faster for long strings

e78b873

Forces the compiler to use a register variable for a tight loop in the hot-path. It also optimizes a condition for the common case strict=true.

Mimimize the conditionals and the numer of ops inside the

fa52a83

json.loads hot path. This partially reverts the previous commit as we found out the culprit wasn't the MOV but the strictness checks

mpaolini force-pushed the bpo-37587 branch from 921d4e6 to fa52a83 Compare July 13, 2019 20:10

Cleanup indentation

70feba6

ncoghlan reviewed Jul 14, 2019

View reviewed changes

Revert some changes not relevant for performance

f4ba2f0

the-knights-who-say-ni added CLA signed and removed CLA not signed labels Jul 30, 2019

ncoghlan approved these changes Jul 30, 2019

View reviewed changes

bedevere-bot added awaiting merge and removed awaiting review labels Jul 30, 2019

ncoghlan merged commit 8a758f5 into python:master Jul 30, 2019

bedevere-bot removed the awaiting merge label Jul 30, 2019

ncoghlan added the needs backport to 3.8 label Jul 30, 2019

bedevere-bot removed the needs backport to 3.8 label Jul 30, 2019

Uh oh!

bpo-37587: Make json.loads faster for long strings #14752

bpo-37587: Make json.loads faster for long strings #14752

Uh oh!

Conversation

mpaolini commented Jul 13, 2019 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

the-knights-who-say-ni commented Jul 13, 2019

Uh oh!

mangrisano commented Jul 13, 2019

Uh oh!

serhiy-storchaka left a comment

Choose a reason for hiding this comment

Uh oh!

mpaolini commented Jul 13, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ncoghlan Jul 14, 2019

Choose a reason for hiding this comment

Uh oh!

mpaolini Jul 14, 2019

Choose a reason for hiding this comment

Uh oh!

ncoghlan Jul 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zooba Jul 14, 2019

Choose a reason for hiding this comment

Uh oh!

serhiy-storchaka Jul 14, 2019

Choose a reason for hiding this comment

Uh oh!

mpaolini Jul 15, 2019

Choose a reason for hiding this comment

Uh oh!

mpaolini Jul 30, 2019

Choose a reason for hiding this comment

Uh oh!

pablogsal commented Jul 30, 2019

Uh oh!

mpaolini commented Jul 30, 2019

Uh oh!

ncoghlan commented Jul 30, 2019

Uh oh!

miss-islington commented Jul 30, 2019

Uh oh!

bedevere-bot commented Jul 30, 2019

Uh oh!

Uh oh!

mpaolini commented Jul 13, 2019 •

edited by bedevere-bot

Loading

mpaolini commented Jul 13, 2019 •

edited

Loading

ncoghlan Jul 14, 2019 •

edited

Loading