Macros: Size of generated code is exponential #108

FlorianKirmaier · 2018-11-12T17:25:15Z

The size of the code, generated by macros gets pretty big.
The generated code is exponential to the depth of the case-class-hierarchy.

I think the reason for this behavior is, that the picklers are generated both for serializing and deserializing.

In the following Sample, the pickler for TestCaseClass1 is created 16 times.

case class TestCaseClass5(x: TestCaseClass4)
case class TestCaseClass4(x: TestCaseClass3)
case class TestCaseClass3(x: TestCaseClass2)
case class TestCaseClass2(x: TestCaseClass1)
case class TestCaseClass1(x: Int)

The text was updated successfully, but these errors were encountered:

ochrons · 2018-11-12T19:22:41Z

This seems to be just one of many examples where automatic derivation of picklers is a bad practise. It's recommended to explicitly declare the implicit picklers in one place and use them instead of relying on automatic generation at every call site.

For example https://scalafiddle.io/sf/ZRO6far/0 "solves" this issue through explicit pickler definition.

Or am I missing something?

FlorianKirmaier · 2018-11-12T20:18:03Z

I understand that the automatic derivation is currently bad practice, for numerous reasons.
But the generated code shouldn't grow exponential.

But the automatic derivation also has some advantages. It makes experimenting much easier. It allowed me to switch from upickle to boopickle without many changes in my codebase.

I think, by fixing this issue, the generated code for other use cases might reduce also.

I think the problem happens in the following file:
https://github.com/suzaku-io/boopickle/blob/master/boopickle/shared/src/main/scala/boopickle/PicklerMaterializersImpl.scala

The pickler- and unpicklerlogic both instantiate an own pickler in their own code.
Moving it to a def, which is shared between both methods, might fix this issue.
But it would be important, to check whether it influences the performance of boopickle.

ochrons · 2018-11-12T21:03:02Z

Yea, there could be some room for optimization there, which would cut down the number of derived picklers by half, but it wouldn't solve the root cause. It might make sense to collect the types of case class accessors separately, assign the derived picklers for each type into an object private val and then use those in both pickle and unpickle methods.

Want to experiment with it? :)

FlorianKirmaier · 2018-11-14T14:59:48Z

I don't have time for it at the moment, but maybe in the future.

Motivation: Automatic case class derivation is bad practice because generation would happen for every call site. Moreover, there's a bug in Boopickle that causes generation to be exponential in case classes hierarchies, see suzaku-io/boopickle#108. Modification: Explicitly create picklers. Result: Smaller gatling jars

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Macros: Size of generated code is exponential #108

Macros: Size of generated code is exponential #108

FlorianKirmaier commented Nov 12, 2018

ochrons commented Nov 12, 2018

FlorianKirmaier commented Nov 12, 2018

ochrons commented Nov 12, 2018 •

edited

Loading

FlorianKirmaier commented Nov 14, 2018

Macros: Size of generated code is exponential #108

Macros: Size of generated code is exponential #108

Comments

FlorianKirmaier commented Nov 12, 2018

ochrons commented Nov 12, 2018

FlorianKirmaier commented Nov 12, 2018

ochrons commented Nov 12, 2018 • edited Loading

FlorianKirmaier commented Nov 14, 2018

ochrons commented Nov 12, 2018 •

edited

Loading