Re: proofing searchable pdf files

Doug Thu, 30 Oct 2014 18:58:09 -0700

On 10/30/2014 08:47 PM, Gary Roach wrote:

Hi all,


Problem:
     I am working on an archiving project and wish to archive documents to 
searchable pdf files but can't seem to figure out how to proof read and correct 
the text overlay. Any suggestions.

System:
     Debian Wheezy
     Intel i5-750 processor
     HP Officejet Pro 8600 wireless all in one printer/fax/scanner
     gscan2pdf software with Tesseract ocr
     300 to 600 dpi scans.

Tesseract seems to do a really great job but I have no good way of proving this 
or correcting any mistakes. Some of the documents are 100 years old and may not 
be in such great shape. I can always retype everything but would like to avoid 
this, as much as possible, for obvious reasons.

Gary R.

Not sure I understand what you're doing, but:
Are you trying to run in two layers, as it were, one of which is the original 
document, and the other is your commentary/correction/whatever?
I don't know much about pdf editing, and nothing about whether it can support 
layers, but I know somethings that can: AutoCAD LT, or DraftSight.
You could scan the document into the computer, and import it as a layer into, 
say, DraftSight. It would not be modifiable, but it could be a template.
Then in an active layer, you could type text over it or next to it or 
something.  You might have to do a bit of figuring out, but I think
maybe this might work.

--doug


--

To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.orgwith a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org

Archive: https://lists.debian.org/5452ec79.70...@optonline.net

Re: proofing searchable pdf files

Reply via email to