Re: tesseract: ocr that works

2008-12-29 Thread Rainer Kluge
Hugo Vanwoerkom schrieb: Hi, Recently there was a post mentioning tesseract. Turns out that is an award winning opensource OCR that works! Hugo I use it with the gscan2pdf frontend and it works perfectly (at least for documents in german language) -- To UNSUBSCRIBE, email to debian-user

Re: tesseract: ocr that works

2008-12-28 Thread Anthony Campbell
On 28 Dec 2008, andmalc wrote: > On Dec 28, 5:10 am, Anthony Campbell wrote: > > On 21 Dec 2008, Hugo Vanwoerkom wrote: > > > [snip] > > > Yes, tesseract does work well. Here, xsane gives depth 24, but conversion > > to depth 8 is neither possible nor necessary. Following the docs, I did > > The

Re: tesseract: ocr that works

2008-12-28 Thread andmalc
On Dec 28, 5:10 am, Anthony Campbell wrote: > On 21 Dec 2008, Hugo Vanwoerkom wrote: > [snip] > Yes, tesseract does work well. Here, xsane gives depth 24, but conversion > to depth 8 is neither possible nor necessary. Following the docs, I did There is an option at the top of the Preferences/Fil

Re: tesseract: ocr that works

2008-12-28 Thread Anthony Campbell
On 21 Dec 2008, Hugo Vanwoerkom wrote: > Hi, > > Recently there was a post mentioning tesseract. > > Turns out that is an award winning opensource OCR that works! > > I tried it out: > > 1. apt-get install tesseract-ocr > 2. apt-get install tesseract-ocr-eng > 3. use xsane to scan a page at dpi 300

Re: tesseract: ocr that works

2008-12-27 Thread Bryan Bishop
On Sat, Dec 27, 2008 at 5:59 AM, Dotan Cohen wrote: > 2008/12/21 Hugo Vanwoerkom : >> [3] don't scan at less than 300 dpi > > And don't scan above 600 DPI! > > I forget which OCR I played with a few years ago, but 300 and 600 DPI > yielded satisfactory results. 1200 DPI made things _worse_ not bet

Re: tesseract: ocr that works

2008-12-27 Thread Dotan Cohen
2008/12/21 Hugo Vanwoerkom : > [3] don't scan at less than 300 dpi And don't scan above 600 DPI! I forget which OCR I played with a few years ago, but 300 and 600 DPI yielded satisfactory results. 1200 DPI made things _worse_ not better, possibly because of noise. This was on Fedora, so maybe it