I am scraping data from a web page using XML (excellent package BTW - that's 
scraping data the easy way!).

So far, I've got the code:

tables <- readHTMLTable(theurl)
rhf <- tables$tabResHistFull
div1 <- rhf[which(rhf$V1=="Div ps"),]
div1

which is giving me the result:
       V1 V2    V3    V4    V5    V6    V7          V8    V9   V10   V11   
V12   V13   V14  V15
15 Div ps  p 32.31 35.64 40.17 42.55 45.13 46.36+17.22 51.11 55.72 70.78 71.72 
76.99 82.20 <NA>

I don't know a priori how many columns are in the table.

I want to be able to extract the numbers to the best of my ability, and check 
to see if they form a monotonic sequence.


How do I do this?

If I type:
   div2 <- div1[is.numeric(div1)]
   div2
I get
   data frame with 0 columns and 1 rows

OTOH, if I type
   div2 <- as.numeric(div1)
   div2
I get
   [1]  9  4 15 16 18 20 17 21 21 18 24 24  8  8 NA
Huh??

What do I need to do? The data in V8 in this particular instance is 
aberrational - I don't care how it gets treated; although I really want the 
number 46.36 out of it. Any sane solution will do, though.
        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to