Re: [R] Extract lines from pdf files

2019-11-20 Thread Bert Gunter
I think you are more likely to get a helpful answer if you give a minimal example of what your lines look like. I certainly don't have a clue, though maybe someone else will. Cheers, Bert On Wed, Nov 20, 2019 at 12:21 PM Thomas Subia via R-help < r-help@r-project.org> wrote: > Thanks all for th

Re: [R] Extract lines from pdf files

2019-11-20 Thread Thomas Subia via R-help
Thanks all for the help. I appreciate the feedback I've developed another method to extract my desired data from multiple pdfs in a directory. # Combine all pdfs to a combined pdf files <- list.files(pattern = "pdf$") pdf_combine(files, output = "joined.pdf") # creates a text file from joined.pd

Re: [R] Extract lines from pdf files

2019-11-20 Thread Bert Gunter
t; For a "it works" method I used >> >> start_time <- Sys.time() >> >> insert code of interest >> >> end_time <- Sys.time() >> end_time - start_time >> >> -----Original Message----- >> From: R-help On Behalf

Re: [R] Extract lines from pdf files

2019-11-20 Thread Bert Gunter
insert code of interest > > end_time <- Sys.time() > end_time - start_time > > -Original Message- > From: R-help On Behalf Of Eric Berger > Sent: Wednesday, November 20, 2019 9:58 AM > To: Jeff Newmiller > Cc: Thomas Subia ; Thomas Subia via R-help >

Re: [R] Extract lines from pdf files

2019-11-20 Thread Jeff Reichman
20, 2019 9:58 AM To: Jeff Newmiller Cc: Thomas Subia ; Thomas Subia via R-help Subject: Re: [R] Extract lines from pdf files Hi Thomas, As Jeff wrote, your HTML email is difficult to read. This is a "plain text" forum. As for "pointers", here is one suggestion. Since you

Re: [R] Extract lines from pdf files

2019-11-20 Thread Eric Berger
Hi Thomas, As Jeff wrote, your HTML email is difficult to read. This is a "plain text" forum. As for "pointers", here is one suggestion. Since you write that you can do the necessary actions with a specific file, try to write a function that carries out those actions for that same file. Except when

Re: [R] Extract lines from pdf files

2019-11-19 Thread Jeff Newmiller
Please don't spam the mailing list. Especially with HTML format messages. See the Posting Guide. PDF is designed to present data graphically. It is literally possible to place every character in the page in random order and still achieve this visual readability while practically making it nearl

[R] Extract lines from pdf files

2019-11-19 Thread Thomas Subia via R-help
Colleagues,   I can extract specific data from lines in a pdf using:   library(pdftools) pdf_text("10619.pdf") txt <- pdf_text(".pdf") write.table(txt,file="mydata.txt") con <- file('mydata.txt') open(con) serial <- read.table(con,skip=5,nrow=1) #Extract[3]flatness <- read.table(con,sk