Re: [dev-servo] HTML parser-related datatypes

2012-10-11 Thread David Herman
On Oct 11, 2012, at 2:01 PM, Boris Zbarsky wrote: > On 10/11/12 4:09 PM, Zack Weinberg wrote: >> I'm not seeing why the JS engine has to use any particular >> representation internally just because JS's exposed semantics are >> defined in terms of UCS-2. > > Well, because it's simpler and becaus

Re: [dev-servo] HTML parser-related datatypes

2012-10-11 Thread Boris Zbarsky
On 10/11/12 4:09 PM, Zack Weinberg wrote: I'm not seeing why the JS engine has to use any particular representation internally just because JS's exposed semantics are defined in terms of UCS-2. Well, because it's simpler and because it makes charAt() fast? (This presumes that part of Servo in

Re: [dev-servo] HTML parser-related datatypes

2012-10-11 Thread Zack Weinberg
On 2012-10-10 9:03 PM, Johnny Stenback wrote: Hey Henri, On 10/10/2012 5:51 AM, Henri Sivonen wrote: I am researching/prototyping a translation of the same HTML parser we use in Gecko into Rust for use in Servo. Should the HTML parser in Servo operate on UTF-8, UTF-16 or CESU-8? What will the D