On Wed, Nov 14, 2001 at 09:34:24AM -0800, Jeffrey W. Baker wrote: > > I've just received a grant for a project that will involve scanning and > > storing a substantial number (e.g., around 3000) of short documents. These > > documents will be analyzed as text, which means I'll have to use OCR > > software as well as a scanner with an automatic document feed.
[...] > There is an OCR package from Mentalix called Pixel!FX. It supports only > SCSI scanners, and I believe it is very expensive. Before spending lots of money, you may want to check `gocr' (apt-get install gocr) if it matches your needs. I found it suitable enough for scanning short (1-2p.) documents (and I assume it'd do the job for longer ones as well) and since it has a console interface, its usage can be easily automated (and even customized, thanks to the libgocr library). Regards, -- BALI, Andra's GPG keyID: 78560E1C [EMAIL PROTECTED] [EMAIL PROTECTED] [EMAIL PROTECTED]