Re: [R] trouble for parsing HTML files

2012-03-23 Thread Milan Bouchet-Valat
Le vendredi 23 mars 2012 à 08:10 +0100, Julien Velcin a écrit : > Here it is: > > R version 2.14.2 (2012-02-29) > Platform: i386-apple-darwin9.8.0/i386 (32-bit) > > locale: > [1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8 I guess the OS uses a French locale? Maybe the discrepan

Re: [R] trouble for parsing HTML files

2012-03-23 Thread R. Michael Weylandt
No idea at this point; perhaps get in touch with the maintainer (email given by maintainer("XML")) and see if he has any insights. Sorry I can't get this one for you: without being able to reproduce it (the odd error message with the euro sign) myself, I'm pretty stumped. If you do get to a soluti

Re: [R] trouble for parsing HTML files

2012-03-23 Thread Julien Velcin
Thank you for your comments Michael, but : i) The website is generating different content for you than for Milan and me [wild guess] It'd be highly surprising. If this is the case, I don't know what I can do to fix it. ii) Something in the OS 10.5 -> 10.6 difference [process of eliminati

Re: [R] trouble for parsing HTML files

2012-03-23 Thread R. Michael Weylandt
I just tried it on R. 2.14.1 and R 2.15.0 RC (r58802) and both worked with XML 3.9-4 on both 32 and 64-bit R on my Mac OS X 10.6.8 with the same locale setting so I can only guess it's one of three things: i) The website is generating different content for you than for Milan and me [wild guess] ii

Re: [R] trouble for parsing HTML files

2012-03-23 Thread Julien Velcin
Here it is: R version 2.14.2 (2012-02-29) Platform: i386-apple-darwin9.8.0/i386 (32-bit) locale: [1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8 attached base packages: [1] stats graphics grDevices utils datasets methods base other attached packages: [1] XML_3.9-4

Re: [R] trouble for parsing HTML files

2012-03-22 Thread R. Michael Weylandt
Please give sessionInfo() so we can know your version of XML. Michael On Thu, Mar 22, 2012 at 2:04 PM, Julien Velcin wrote: > I use mac OS 10.5.8 with this version of R: > > R version 2.14.1 (2011-12-22) > Platform: i386-apple-darwin9.8.0/i386 (32-bit) > > I've tried the command "RSiteSearch", b

Re: [R] trouble for parsing HTML files

2012-03-22 Thread Julien Velcin
I use mac OS 10.5.8 with this version of R: R version 2.14.1 (2011-12-22) Platform: i386-apple-darwin9.8.0/i386 (32-bit) I've tried the command "RSiteSearch", but with no result. BTW, I recall that the code I've posted works for some websites. Julien 2012/3/22, Milan Bouchet-Valat : > Le je

Re: [R] trouble for parsing HTML files

2012-03-22 Thread Julien Velcin
Sorry for the multiple messages. -- I use mac OS 10.5.8 with this version of R: R version 2.14.1 (2011-12-22) Platform: i386-apple-darwin9.8.0/i386 (32-bit) I've tried the command "RSiteSearch", but with no result. BTW, I recall that the code I've posted works for some websites. Julien _

Re: [R] trouble for parsing HTML files

2012-03-22 Thread Milan Bouchet-Valat
Le jeudi 22 mars 2012 à 17:20 +0100, Julien Velcin a écrit : > Hi all, > > Using the XML package, I'm not able to parse some html webpages. Here > is my code and the error message: > > library("XML") > url <- "http://www.huffingtonpost.com/social/GraniteSkyline?action=fans"; > doc <- htmlParse(

[R] trouble for parsing HTML files

2012-03-22 Thread Julien Velcin
Hi all, Using the XML package, I'm not able to parse some html webpages. Here is my code and the error message: library("XML") url <- "http://www.huffingtonpost.com/social/GraniteSkyline?action=fans"; doc <- htmlParse(url) Error: Namespace prefix ꛀ of attribute (null) is not defined I've