TL;DR
Fixing serialisation issues is a big, risky effort and should mostly done by using custom serialisation rather than the default mechanism.
Intro
Java Object serialization API provides a framework for encoding objects as byte streams and reconstructing objects from their byte-stream encodings.
Problem
MGNLUI-351 - Getting issue details... STATUS
As explained in the issue above, Magnolia is currently unable to serialise and deserialise its own Admincentral UI. This may be a problem especially in clustered environments where a user session may need to be replicated across multiple Java VMs.
Cause
In Magnolia's case the root object being serialised when a servlet container shuts down is VaadinSession
which is a Serializable
class. Most Vaadin classes are serializable, including UI
from which our AdmincentralUI
inherits.
Goals
We want to detect and remove all serialisation issues in our code.
Furthermore, we want to find a way to detect serialization issues during Magnolia build so that they are caught and tackled early.
Risks of Serialisation
Before proceeding with a possible solution I came up during my investigations and trials I'd like to point out some of the not so negligible disadvantages Serialization entails.
The points are mostly taken from Effective Java, 2nd Edition by Joshua Bloch which begins his chapter on Serialization with this significant warning: implement Serializable judiciously.
- Implementing Serializable decreases the flexibility to change a class’s implementation once it has been released
- When a class implements Serializable, its byte-stream encoding (or serialized form) becomes part of its exported API. Once you distribute a class widely, you are generally required to support the serialized form forever, just as you are required to support all other parts of the exported API. If you do not make the effort to design a custom serialised form, but merely accept the default, the serialised form will forever be tied to the class’s original internal representation.
- Increases the testing burden associated with releasing a new version of a class
- When a serializable class is revised, it is important to check that it is possible to serialise an instance in the new release and deserialise it in old releases, and vice versa. [...] These tests cannot be constructed automatically because, in addition to binary compatibility, you must test for semantic compatibility. In other words, you must ensure both that the serialization-deserialization process succeeds and that it results in a faithful replica of the original object.
- When a serializable class is revised, it is important to check that it is possible to serialise an instance in the new release and deserialise it in old releases, and vice versa. [...] These tests cannot be constructed automatically because, in addition to binary compatibility, you must test for semantic compatibility. In other words, you must ensure both that the serialization-deserialization process succeeds and that it results in a faithful replica of the original object.
- It can consume excessive space
- Especially using the default serialization mechanism you might end up with a huge object graph which painstakingly and recursively mirror every field and/or entry in a Collection.
- It can consume excessive time
- The serialization logic has no knowledge of the topology of the object graph, so it must go through an expensive graph traversal.
Detecting and removing serialization issues
Suggested approach
After several attempts (which will be mentioned later on), this is the approach I would suggest
- start tomcat with this JVM parameter
-Dsun.io.serialization.extendedDebugInfo=true
- login into Magnolia
- stop and restart Magnolia
- in the logs something like the following stack trace will show up
- With this information try to fix the "offending" class.
- In the case above, an inner class of
ResettableEventBus
seems to be not serialisable. It is interesting to notice how the root object being serialised is aVaadinSession
. Thanks to theextendedDebugInfo
you can follow up the whole serialisation path in the object graph until it throws ajava.io.NotSerializableException
Downsides
Unfortunately the report provided by extendedDebugInfo
is not a full one, meaning that it stops at the first error encountered. This means your only option is to fix and start over the process outlined above until no more serialization exceptions show up.
Failed attempts
Before giving up to the laborious manual process above, I tried several options, including some fancy recursive scripts through the whole object graph using reflections and other magic. But to no avail. You can read about them in my comment to JIRA issue.
Use default serialization mechanism sparingly
In the first attempt I tried, I basically made each problematic class implement Serializable and let Java do the job. This soon turned out into having a gigantic object graph to be serialised where even Magnolia's core classes came into the picture. Consider the following example.
By simply implementing Serializable
you get rid of the NotSerializableException
for ShellImpl
, still you now need to do the same for all its fields and the fields they're made of internally and so on and so forth. AppControllerImpl
, for one, has a reference to info.magnolia.module.ModuleRegistry
which brings in core classes and soon you get a big, unmanageable mess of classes throwing NotSerializableException.
The most reasonable way to proceed so far seems to use custom serialisation by making some or all fields transient and then reconstructing the object with the special method private void readObject(ObjectInputStream stream) throws IOException, ClassNotFoundException
For instance, the following could be a custom serialization for ShellImpl
At any rate, when adopting custom serialisation one should be aware of some pitfalls as outlined by Effective Java, Item75: Consider using a custom serialised form.
Finally doing all this for Magnolia UI is a quite a big effort, considering the amount of classes involved and the cumbersome run and fix process outlined above.
Finding a way to detect serialization issues during Magnolia build
In this case I came up with a very simple Groovy script
import org.apache.commons.lang3.SerializationUtils import com.vaadin.server.VaadinSession /** * This script will attempt to serialise a VaadinSession, that is the root object which is * usually serialised by a Servlet container when shutting down. * It will throw a java.io.NotSerializableException if something goes wrong in the serialization process at any point of the object graph. * The Servlet container should be run with the following JVM option -Dsun.io.serialization.extendedDebugInfo=true * in order to have a useful debugging output in case of error. */ vaadinSession = VaadinSession.getCurrent() SerializationUtils.serialize(vaadinSession)
This works fine when run through the Magnolia Groovy Console and will basically output the same stack trace by Tomcat. I thought it could be run as an integration test, like we do for our crawler.groovy
. However this does not work, the Groovy Maven plugin being basically disconnected from the Magnolia test instance (two different threads).
Proposal
Find a way to run the script above as an integration test against a real Magnolia instance. One idea could be registering a GroovyTestServlet which will get passed the script source and executes it. Then we assert that the output does not contain java.io.NotSerializableException.
I managed to get a working test https://git.magnolia-cms.com/projects/PLATFORM/repos/ce/commits/482511eaf4453c0cfc8984bf238706d09b7d004d but this is testing a sort of mock object which only has some of the classes which actually end up in a real VaadinSession
. Getting hold of the real thing seems to be impossible outside of the Vaadin's thread running the actual AdmicentralUI.
The only way I see to check for serialization issues against an actual Magnolia instance seems to be
- startup Magnolia
- login
- open some apps so to create a sort of real scenario (if you just login without opening apps, several UI classes won't be part of the session)
- stop Magnolia
- restart Magnolia
- scan logs for java.io.NotSerializableException
Architecture meeting decisions
https://wiki.magnolia-cms.com/pages/viewpage.action?spaceKey=ARCHI&title=2016-06-23+Meeting+notes
- Disable session persistence in Tomcat MGNLCE-46 - Getting issue details... STATUS
- Investigate other containers DOCU-751 - Getting issue details... STATUS
- make vaadin-session transient in http-session
no dice: I tried to wrap {{VaadinSession}} (see code below) and make it transient but this fails with