[dev-servo] Crazy idea: CSS selector JITting at parse time

Patrick Walton Sat, 29 Mar 2014 17:24:21 -0700

Hi everyone,

I've been discussing this idea with a few people in person over the pastweek, and nobody told me it was completely insane. ;) So I thought I'dsend this idea around.

By now many people have heard of WebKit's CSS JIT. It's a surprisinglysmall amount of code. One of the issues that they cite in their blogpost [1] is compilation time: the compilation has to be very fast inorder to avoid nullifying the gains. With that in mind, I had an idea:what if the "AST" that the CSS parser generated was actually machine code?

The AST for the CSS grammar is extremely small: 176 lines of Rust forthe whole thing [2], and the selector grammar is just a fraction ofthat. This is because selectors describe a very simple language, evenwith all the extensions that have been added over the years. So onecould write a simple template-emitting compiler that takes CSS selectorsand writes a small template of machine code. (This is in fact theapproach that WebKit uses.) For example, I could imagine that the JITcode for the selector ".foo #a" might look like (assuming that "#a" hasalready been matched in the ID hash):


        ; assume that node is in ebx
    l1: mov ebx,[ebx+offsetof(parentNode)]
        test ebx,ebx
        jz nomatch
        mov edx,[ebx+offsetof(class)]
        xor ecx,ecx
    l2: cmp [edx+ecx*4],"foo"
        je match
        inc ecx
        cmp ecx,[ebx+offsetof(classcount)]
        jne l2
        jmp l1

This is just 29 bytes of code when assembled. This is likely larger thanthe equivalent `nsRuleNode`, but `nsRuleNode` instances are not a largeamount of the heap in my cursory glances at `about:memory`.

The major question is: If you don't have an AST, then how do youimplement the CSSOM and style debugging? Here's the trick: Since CSSselectors are so simple compared to, say, the output of a JS JIT, itshould be possible to *disassemble* the code by simply pattern-matching.Since the compiler is nothing more than a very simple template processor(and it has to be, for speed reasons) it should be possible to reversethe output and go from JIT code back to the CSS selectors. This would bedone by performing some very simple analysis (not full disassembly) onthe machine code. For instance, on the above selector, it might do:


        ; start with `#a` from the rule hash

    l1: mov ebx,[ebx+offsetof(parentNode)]
        test ebx,ebx
        jz nomatch
        ; found a test for parentNode, so we must be a descendant or
        ; child selector (depending on whether we see a backwards jump)

        mov edx,[ebx+offsetof(class)]
        xor ecx,ecx
    l2: cmp [edx+ecx*4],"foo"
        je match
        inc ecx
        cmp ecx,[ebx+offsetof(classcount)]
        jne l2
        ; found a test for the "foo" class

        jmp l1
        ; found a backwards jump, so we know we're a descendant selector
        ; result: `.foo #a`

So you wouldn't need a full disassembler, just a simple pattern matcher.

Note that one doesn't have to JIT everything: for more complex testssuch as "attribute contains substring", we can just call out tofunctions implemented in Rust in the JIT code. The disassembler can justpattern match on those function addresses and arguments. This alsoincreases the maintainability of this approach in the presence of newstuff that the CSS working groups add to the spec: we can implement allnew features as functions at first, and only JIT them if they becomewidely used.


Thoughts?

Patrick

[1]: https://www.webkit.org/blog/3271/webkit-css-selector-jit-compiler/

[2]: https://github.com/mozilla-servo/rust-cssparser/blob/master/ast.rs
_______________________________________________
dev-servo mailing list
dev-servo@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-servo

[dev-servo] Crazy idea: CSS selector JITting at parse time

Reply via email to