Hello: Apart from readPDF in the tm package, you can use the pdf to text converter command in linux, which is "pdftotext". Say "file.pdf" is your file, from R you'd use:
system("pdftotext file.pdf -layout") This invokes the pdftotext command from within R and creates a file called "file.txt" with the converted pdf, which you'd have to read into R. The -layout option is so the conversion to text is as similar as possible to the original layout of the pdf file. Regards, jose loreto romero palma [[alternative HTML version deleted]]
______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.