Hello:

Apart from readPDF in the tm package, you can use the pdf to text converter 
command in linux, which is "pdftotext".  Say "file.pdf" is your file, from R 
you'd use:

system("pdftotext file.pdf -layout")

This invokes the pdftotext command from within R and creates a file called 
"file.txt" with the converted pdf, which you'd have to read into R.  The 
-layout option is so the conversion to text is as similar as possible to the 
original layout of the pdf file.

Regards,

jose loreto romero palma
        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to