[dev-servo] New DOM implementation

Patrick Walton Sat, 16 Feb 2013 10:55:12 -0800

I have started work a new DOM implementation in the "dom" branch. It iscurrently building except for an internal compiler error, the fix forwhich is currently being upstreamed to Rust.

This new DOM implementation is based on structs instead of enums. Thebase type is `AbstractNode`, which is an opaque pointer with manyaccessors that let you query what type of node it is and downcastappropriately. (I'm thinking of changing `AbstractNode` to `Node` andchanging the concrete `Node` to something else, though, because the word`Abstract` is littering the codebase.) When structs are allowed toinherit from other structs in Rust, then the implementation will becomea bit simpler and will use less unsafe code internally.

The advantages of the new implementation over the previous one are (a)several layers of indirection are gone; (b) the copy-on-write interfaceis dramatically simplified, eliminating the need for answers to annoyingquestions like how to collect dead handles; (c) DOM nodes now only takeup as much memory as they need to; (d) we can make borrowing of nodessound, because the new interface is amenable to the dynamic checksdescribed in Niko's "Imagine Never Hearing the Phrase 'Aliasable,Mutable' Again" blog post.

This new DOM implementation currently exposes an unsafe interface in afew ways:

(1) When creating a node (converting from the base type toAbstractNode), there is no check performed to ensure that the thing youpassed in actually is a Node, and moreover that it is the node that youclaimed it was. This can be fixed by providing safe wrappers around nodeconstructors and moving the low-level node constructors into the trustedcomputing base (hereafter TCB). These constructor primitives are verysimple operations, so the TCB should remain easy to audit for security.It can also be fixed to some degree by adding struct inheritance toRust, as I plan to propose. But note that node construction must remainpart of the TCB to some degree, because nodes are owned by theSpiderMonkey garbage collector and not the Rust garbage collector.

(2) Borrowing of nodes (i.e. downcasting from AbstractNode to a concreteNode subclass) currently does not perform the dynamic checks needed forsafety. What this means is that it is possible to cause segfaults withcertain combinations of mutations and pointer borrowing. The fix willsimply be to add these checks. Once these checks are added to the TCB,segfaults should not be possible in the safe Servo code.

(3) There is nothing currently preventing Rust code in the script taskfrom accessing (and racing on) layout data structures, and layout fromaccessing dirty nodes. I believe this can be fixed with phantom types:layout will see a type that prevents access to the dirty parts of nodes,and script will see a type that prevents direct access to layout info.

I believe there are solutions to each of these problems, and of coursefixing them is a high priority for the project. But note that, as Idescribed before, there will always be some unsafe code relating to nodememory management as part of the TCB, because SpiderMonkey's garbagecollector manages the nodes. The goals are (a) to use as little unsafecode as possible, and, most importantly, (b) to prevent the unsafenessfrom leaking out into script and layout code.

Finally, note that the copy-on-write scheme is not yet implemented;right now script will just block on layout. Fixing this is a highpriority as well.


Patrick
_______________________________________________
dev-servo mailing list
dev-servo@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-servo

[dev-servo] New DOM implementation

Reply via email to