On 16/01/2008, martin f krafft <[EMAIL PROTECTED]> wrote: > Even though Tesseract does a good job at reading the docs I scanned, > gscan2pdf seems to save the PDF as image, meaning that neither > search nor copy-paste work on the final product. This makes me doubt > the usefulness of having such a nice integration of OCR in > gscan2pdf, but I assume it's unintended.
The OCR output is embedded as plain text behind the image. This means that Beagle, etc., can index it. -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]