[dev-servo] Leaf set construction is probably unnecessary

Patrick Walton Sun, 09 Feb 2014 10:25:37 -0800

(Copy and paste from GitHub issue #1650 because I figure mailing listdiscussion is potentially more fruitful than GitHub issues for designdiscussions.)

I realized this morning that we can probably avoid the need for a leafset by intertwining selector matching and flow construction, and flowconstruction and intrinsic-width-bubbling, to some degree. This issimilar to what Gecko and WebKit do, but potentially somewhat cleanerbecause they are still separate functions and the implementations arestrictly separate (no bouncing back and forth to handle specialincremental-reflow cases); the parallel driver just knows how to invokethem. This works thanks to the heterogeneous nature of the `WorkQueue`:it can accept heterogeneous tasks and can run them all in parallel.

* Once the selectors have been matched for a leaf node, we canimmediately start constructing its flows. Just call `construct_flows`once the node has been matched.* Trickier, but also likely possible: Once flows have been constructedfor a leaf node, immediately call `bubble_widths` on it. This worksbecause we always know when a flow is going to be a leaf sincee579daefc2956a2eb151588b628c51342de236d0.* Once `assign_widths` has been called on a leaf, immediately startassigning its heights via `assign_heights`.

Assuming this works out, all parallel traversals will start from theroot and go down, eliminating the need for a leaf set. We will probablystill want a "backdoor" that sequentially computes bubble-widths for tworeasons: (1) during incremental reflow, min/pref widths may have beeninvalidated without invalidating the flow; (2) it's easier to benchmarkstyle recalc against Gecko and WebKit when it's not intertwined withintrinsic width calculation.


This would have numerous benefits:

1. Leaf set construction is expensive. On Wikipedia it's 16% of selectormatching time on 4 cores. For comparison, that's difference betweengetting a 2.4x speedup for selector matching and getting a 2.9x speedupon 4 cores.2. Eliminates one or two parallel traversals, reducing overhead. (Inparticular the warmup phase will go to essentially zero.)3. Eliminates the synchronization point between selector matching andflow construction, allowing better multicore utilization.4. Eliminates the necessity of ensuring that DOM nodes in the leaf setare alive which will be a bit of a pain when we start doing incrementalreflow.

5. Better memory usage since the leaf set data structures will go away.

Patrick
_______________________________________________
dev-servo mailing list
dev-servo@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-servo

[dev-servo] Leaf set construction is probably unnecessary

Reply via email to