Introduce IR ops for unsigned extension and comparisons. #5186

sjrd · 2025-05-30T16:52:08Z

This completes the set of UnaryOps and BinaryOps to directly manipulate the unsigned representation of integers.

Unlike other operations, such as Int_unsigned_/, the unsigned extension and comparisons have efficient implementations in user land. It is common for regular code to directly use the efficient implementation (e.g., x.toLong & 0xffffffffL) instead of the dedicated library method (Integer.toUnsignedLong).

If we only changed replaced the body of the library methods with IR nodes, we would miss improvements in all the other code.

Therefore, in this case, we instead recognize the efficient patterns in the optimizer, and replace them with the unsigned IR operations through folding.

When targeting JavaScript, the new IR nodes do not actually make any difference. For int operations, the Emitter sort of "undoes" the folding of the optimizer to implement them. That said, it could choose an alternative implementation based on >>> 0, which we should investigate in the future. For Longs, the subexpressions of the patterns are expanded into the RuntimeLong operations before folding gets a chance to recognize them. That's fine, because internal folding of the underlying int operations will do the best possible thing anyway.

The size increase is only due to the additional always-reachable methods in RuntimeLong. Those can be removed by standard JS minifiers.

When targeting Wasm, this allows the emitter to produce the dedicated Wasm opcodes, which are more likely to be efficient.

To be fair, we could have achieved the same result by recognizing the patterns in the Wasm emitter instead. The deeper reason to add those IR operations is for completeness. They were the last operations from a standard set that were missing in the IR.

Second commit: Use x >>> 0 instead of x ^ 0x80000000 for unsigned comparisons.

Benchmarks show that this is slightly faster. Inspection of the source code of v8 also suggests that they do not recognize x ^ 0x80000000 as doing anything special, but they do recognize x >>> 0 as emitting an "Unsigned32", and they do generate unsigned comparisons when both inputs are known to be Unsigned32.

(this really is the last IR change I have in mind these days :p)

gzm0 · 2025-06-01T08:17:21Z

Could you split the first commit into it's own PR? I think it really does not relate to this one except for discovery. (also, we should get that one in fast, to not have broken printing).

sjrd · 2025-06-04T15:15:23Z

I added a commit that switches to using x >>> 0 in unsigned comparisons in the JS emitter. It does seem to make things faster.

gzm0

On a high level: I think adding the remaining opcodes makes sense for completeness.

However, I'm concerned about test coverage: IIUC the only way we generate these opcodes is through optimization. I'm not sure that's enough to test that the tail of the pipeline is handling the correctly?

linker/shared/src/main/scala/org/scalajs/linker/backend/emitter/FunctionEmitter.scala

linker/shared/src/main/scala/org/scalajs/linker/frontend/optimizer/OptimizerCore.scala

sjrd · 2025-06-07T07:56:03Z

However, I'm concerned about test coverage: IIUC the only way we generate these opcodes is through optimization. I'm not sure that's enough to test that the tail of the pipeline is handling the correctly?

What do you exactly refer to with the tail of the pipeline? The emitters definitely have to handle them correctly. The only exception is the JS emitter with RuntimeLong, for the Long operations.

We could introduce (some of) them in javalib methods, to exercise them more. UnsignedIntToLong has a natural corresponding method in jl.Integer.toUnsignedLong. The comparison operators, not so much. We can pick one of each set (Int_< and Long_<, for example) and put them in jl.Integer.compareUnsigned and jl.Long.compareUnsigned. But we don't have the opportunity to test all of them that way.

gzm0 · 2025-06-07T08:17:20Z

What do you exactly refer to with the tail of the pipeline?

I mean the emitters yes. What I was trying to say is that, to me, it is absolutely not obvious that our current test suite covers the relevant code in the emitters. And that is a bit scary :-/

Now that I'm writing this, I realize the comment also applies to the optimizer (and in fact it feels worse): If a given opcode is never in serialized IR, how do we test that the optimizations on that opcode are correct?

We could introduce (some of) them in javalib methods, to exercise them more. UnsignedIntToLong has a natural corresponding method in jl.Integer.toUnsignedLong.

That would probably help, yes. IIUC this is quite easy to do with the "replace body infrastructure"?

The comparison operators, not so much. We can pick one of each set (Int_< and Long_<, for example) and put them in jl.Integer.compareUnsigned and jl.Long.compareUnsigned. But we don't have the opportunity to test all of them that way.

I don't have a good intuition if this is acceptable. On one hand, a lot of code is quite mechanical and often shared. So it feels a bug is unlikely. On the other hand it seems to be sub-par for how we test usually.

I'll think about it some more.

sjrd · 2025-06-07T08:42:14Z

That would probably help, yes. IIUC this is quite easy to do with the "replace body infrastructure"?

Yes, that's very easy. I added a commit with just that for now, to see what it looks like.

Another possibility would be to recognize the unsigned comparison patterns in the compiler backend. That would significantly expand the coverage. (for UnsignedIntToLong, the body replacement of toUnsignedLong + recognizing the pattern in the optimizer should be plenty)

sjrd · 2025-06-07T12:54:17Z

And I pushed yet another commit where the compiler backend recognizes unsigned comparisons.

LMK which variant you think is best. I have a slight preference for the last one, even if it is a bit inelegant.

gzm0 · 2025-06-07T14:09:44Z

I have a slight preference for the last one,

I agree. Consider adding an AST test to make sure we're actually hitting the expected rewrites.

sjrd · 2025-06-07T15:40:26Z

Alright. Rebased, squashed, cleaned up, added AST test, and addressed the other earlier comments. It's ready for another round.

This completes the set of `UnaryOp`s and `BinaryOp`s to directly manipulate the unsigned representation of integers. Unlike other operations, such as `Int_unsigned_/`, the unsigned extension and comparisons have efficient (and convenient) implementations in user land. It is common for regular code to directly use the efficient implementation (e.g., `x.toLong & 0xffffffffL`) instead of the dedicated library method (`Integer.toUnsignedLong`). If we only replaced the body of the library methods with IR nodes, we would miss improvements in all the other code. Therefore, in this case, we instead recognize the user-space patterns in the optimizer, and replace them with the unsigned IR operations through folding. Moreover, for unsigned comparisons, we also recognize the patterns in the compiler backend. The purpose here is mostly to make sure that all these opcodes end up in the serialized IR, so that we effectively test them along the entire pipeline. When targeting JavaScript, the new IR nodes do not actually make any difference. For `int` operations, the Emitter sort of "undoes" the folding of the optimizer to implement them. That said, it could choose an alternative implementation based on `>>> 0`, which we should investigate in the future. For `Long`s, the subexpressions of the patterns are expanded into the `RuntimeLong` operations before folding gets a chance to recognize them (when they have not been transformed by the compiler backend). That's fine, because internal folding of the underlying `int` operations will do the best possible thing anyway. The size increase is only due to the additional always-reachable methods in `RuntimeLong`. Those can be removed by standard JS minifiers. When targeting Wasm, this allows the emitter to produce the dedicated Wasm opcodes, which are more likely to be efficient. To be fair, we could have achieved the same result by recognizing the patterns in the Wasm emitter instead. The deeper reason to add those IR operations is for completeness. They were the last operations from a standard set that were missing in the IR.

Benchmarks show that this is slightly faster. Inspection of the source code of v8 also suggests that they do not recognize `x ^ 0x80000000` as doing anything special, but they do recognize `x >>> 0` as emitting an "`Unsigned32`", and they do generate unsigned comparisons when both inputs are known to be `Unsigned32`.

gzm0

LGTM. Only one note, but for sure not actionable on this PR.

gzm0 · 2025-06-08T06:26:27Z

linker/shared/src/main/scala/org/scalajs/linker/frontend/optimizer/OptimizerCore.scala

+  private object IntFlipSign {
+    def unapply(tree: PreTransform): Option[PreTransform] = tree match {
+      case PreTransBinaryOp(BinaryOp.Int_^, PreTransLit(IntLiteral(Int.MinValue)), x) =>
+        Some(x)


Oh, dear: Looking at this I got confused as hell: It seems we do not consistently normalize literals to lsh / rhs.

Comparisons: literal to rhs

scala-js/linker/shared/src/main/scala/org/scalajs/linker/frontend/optimizer/OptimizerCore.scala

Lines 4517 to 4518 in 052d861

case (PreTransLit(IntLiteral(_)), _) =>

foldBinaryOp(flippedOp, rhs, lhs)

Bit ops (example): literal to lhs

scala-js/linker/shared/src/main/scala/org/scalajs/linker/frontend/optimizer/OptimizerCore.scala

Line 4323 in 052d861

case (_, PreTransLit(IntLiteral(_))) => foldBinaryOp(Int_^, rhs, lhs)

So this pattern match is correct, but the overall system is confusing :-/

Yes, for almost everything we normalize literals on the lhs, but for comparisons we normalize to the rhs. 🤷‍♂️

sjrd requested a review from gzm0 May 30, 2025 16:52

sjrd force-pushed the ir-unsigned-ops branch from 3fb4cac to 690cfd2 Compare May 31, 2025 15:29

sjrd mentioned this pull request Jun 1, 2025

Opt: Branchless addition, subtraction and negation for RuntimeLong. #5184

Merged

sjrd force-pushed the ir-unsigned-ops branch from 690cfd2 to 8cb4ab3 Compare June 3, 2025 08:03

sjrd force-pushed the ir-unsigned-ops branch from ec9a680 to e46ca69 Compare June 4, 2025 18:04

gzm0 requested changes Jun 7, 2025

View reviewed changes

sjrd force-pushed the ir-unsigned-ops branch from 2b59eaf to bac9d07 Compare June 7, 2025 11:18

sjrd force-pushed the ir-unsigned-ops branch from 1babd10 to a903431 Compare June 7, 2025 15:35

sjrd requested a review from gzm0 June 7, 2025 15:39

sjrd added 2 commits June 7, 2025 21:43

sjrd force-pushed the ir-unsigned-ops branch from a903431 to 6d179c9 Compare June 7, 2025 19:44

gzm0 approved these changes Jun 8, 2025

View reviewed changes

gzm0 merged commit 1312c97 into scala-js:main Jun 8, 2025
3 checks passed

sjrd deleted the ir-unsigned-ops branch June 8, 2025 06:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introduce IR ops for unsigned extension and comparisons. #5186

Introduce IR ops for unsigned extension and comparisons. #5186

Uh oh!

sjrd commented May 30, 2025 •

edited

Loading

Uh oh!

gzm0 commented Jun 1, 2025

Uh oh!

sjrd commented Jun 4, 2025

Uh oh!

gzm0 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjrd commented Jun 7, 2025 •

edited

Loading

Uh oh!

gzm0 commented Jun 7, 2025

Uh oh!

sjrd commented Jun 7, 2025

Uh oh!

sjrd commented Jun 7, 2025

Uh oh!

gzm0 commented Jun 7, 2025

Uh oh!

sjrd commented Jun 7, 2025

Uh oh!

gzm0 left a comment

Uh oh!

gzm0 Jun 8, 2025

Uh oh!

sjrd Jun 8, 2025

Uh oh!

Uh oh!

Uh oh!

	case (PreTransLit(IntLiteral(_)), _) =>
	foldBinaryOp(flippedOp, rhs, lhs)

Introduce IR ops for unsigned extension and comparisons. #5186

Introduce IR ops for unsigned extension and comparisons. #5186

Uh oh!

Conversation

sjrd commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gzm0 commented Jun 1, 2025

Uh oh!

sjrd commented Jun 4, 2025

Uh oh!

gzm0 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjrd commented Jun 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gzm0 commented Jun 7, 2025

Uh oh!

sjrd commented Jun 7, 2025

Uh oh!

sjrd commented Jun 7, 2025

Uh oh!

gzm0 commented Jun 7, 2025

Uh oh!

sjrd commented Jun 7, 2025

Uh oh!

gzm0 left a comment

Choose a reason for hiding this comment

Uh oh!

gzm0 Jun 8, 2025

Choose a reason for hiding this comment

Uh oh!

sjrd Jun 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sjrd commented May 30, 2025 •

edited

Loading

sjrd commented Jun 7, 2025 •

edited

Loading