More DCE wrt case object equality test #2396

japgolly · 2016-05-17T01:04:59Z

I was pleasantly surprised to see that entire case objects are eliminated when unused.

DCE doesn't reduce this snippet however:

sealed abstract class Blah
case object ABC extends Blah
case object XYZ extends Blah

def blahName(b: Blah): String =
  b match {
    case ABC => "abc"
    case XYZ => "xyz"
  }

def main(): Unit =
  println(blahName(ABC))

Because we know that XYZ is unused ignoring the following exceptions of use...

XYZ.unapply in a case clause, given XYZ.unapply isn't overridden or overloaded.
_: XYZ.type in a case clause.

...I believe DCE could eliminate:

The XYZ obejct itself.
All case clauses referencing XYZ.

Thereby reducing the above snippet to:

sealed abstract class Blah
case object ABC extends Blah
// case object XYZ extends Blah

def blahName(b: Blah): String =
  b match {
    case ABC => "abc"
    // case XYZ => "xyz"
  }

def main(): Unit =
  println(blahName(ABC))

The text was updated successfully, but these errors were encountered:

japgolly · 2016-05-17T01:08:56Z

Another instance of use that could be eliminated is _: XYZ.type in a case clause.

Will update head comment.

gzm0 · 2016-05-17T05:28:04Z

What happens if you mark blahName as @inline?

gzm0 · 2016-05-17T05:34:31Z

@sjrd I guess we could teach the optimizer to check for existence of instances when doing isInstanceOf checks, which should fold the if branch. The question is how much impact this will have on invalidation.

sjrd · 2016-05-17T06:20:57Z

Well here the thing the optimizer sees is if (XYZ == b). Collapsing that to false is hard.

japgolly · 2016-05-18T03:39:44Z

What happens if you mark blahName as @inline?

Just gave it a try with @inline; it doesn't eliminate XYZ.

Well here the thing the optimizer sees is if (XYZ == b). Collapsing that to false is hard.

I don't know if this is a naive view but I imagine this optimisation could be implemented by something like:

change usage analysis of case objects so that references (usages) are recorded in two distinct buckets: one for value inspection usage like case clauses, equality or reference comparison, instanceOf checking; and a bucket for everything else.
DCE₁: remove objects without any references in usage-bucket-2.
DCE₂: for all references in all removed objects' usage-bucket-1s, remove case clauses, collapse equality tests to constant false.
Today's DCE runs and removes dead branches which now includes those introduced above.

Would that work? Is the reason you foresee this being hard due to logic (in that above is too naive) or in implementation?

sjrd · 2016-05-18T04:27:17Z

Not really. What we want to get rid of is Obj == x (Scala's equality) for pattern matching. But == is not always reference equality. For example Nil == Vector.empty.

After a round of optimization, == simplifies to === (aka eq) when that's the actual implementation. But after the optimizer we only have one round of dce left, which is not enough to implement the rest of your "plan" (steps 2 and following). In particular what you call "DCE 2" actually needs a round of optimization, not dce.

gzm0 · 2023-09-16T05:47:19Z

Note to self: It seems like

Ignoring _ === LoadModule(_) trees in reachability analysis and
Folding the above to false if the module isn't instantiated

Will give us the optimization we'd want. But the second step comes too late (Emitter would be our earliest phase). We can optimize the expression, but we cannot eliminate code that is unused because of it, providing limited value.

(Yes, this is what sjrd has been saying all along)

gzm0 · 2025-01-19T10:24:45Z

I've had another (unsuccessful) pass at this. Sharing a negative result.

IIUC, as an approximation, the IR trees we are interested in are:

x.equals;Ljava.lang.Object;Z(y)

where x is a value of

module class type,
without an override of equals

My thought was to special case this in the Analyzer (in callMethod) to
extract the fact that we are checking module class identity.

This seems to be possible; however, it does not solve the problem:
we cannot trace back what x is in the analyzer phase, so we cannot
eliminate / flag the LoadModule call (which we assigned to x) as "readonly".

It seems like the only other option (short of adding another optimizer-like phase)
would be to investigate compiler support (compile to a linker friendlier tree).
But IIUC that would not work in terms of separate compilation guarantees we need to offer (a module class needs to be able to override equals without re-compilation of usage sites).

gzm0 added the optimization Optimization only. Does not affect semantics or correctness. label May 17, 2016

sjrd changed the title ~~More DCE wrt case object unapply~~ More DCE wrt case object equality test Mar 23, 2017

sjrd added this to the Post-v1.0.0 milestone Oct 11, 2017

gzm0 removed this from the Post-v1.0.0 milestone Apr 8, 2020

gzm0 mentioned this issue Jan 17, 2025

Add a desugaring pass between the base linker and the optimizer. #5101

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More DCE wrt case object equality test #2396

More DCE wrt case object equality test #2396

japgolly commented May 17, 2016 •

edited

Loading

japgolly commented May 17, 2016

gzm0 commented May 17, 2016

gzm0 commented May 17, 2016

sjrd commented May 17, 2016 •

edited

Loading

japgolly commented May 18, 2016

sjrd commented May 18, 2016

gzm0 commented Sep 16, 2023

gzm0 commented Jan 19, 2025

More DCE wrt case object equality test #2396

More DCE wrt case object equality test #2396

Comments

japgolly commented May 17, 2016 • edited Loading

japgolly commented May 17, 2016

gzm0 commented May 17, 2016

gzm0 commented May 17, 2016

sjrd commented May 17, 2016 • edited Loading

japgolly commented May 18, 2016

sjrd commented May 18, 2016

gzm0 commented Sep 16, 2023

gzm0 commented Jan 19, 2025

japgolly commented May 17, 2016 •

edited

Loading

sjrd commented May 17, 2016 •

edited

Loading