I am confirming having this bug as well. It seems that the English unicharset was not included in the package.
I am using Ubuntu 8.04.1 and tesseract-ocr 2.01-3. The workaround is to install the package manually. Open a terminal and run: $ sudo apt-get install tesseract-ocr-eng I found that you should use a high quality image when converting to text through OCR or you are likely to run into spelling errors. Please make english part of the default package (instead of German) or make it a dependency when packaging. -- Installing tesseract-ocr should also install tesseract-ocr-eng https://bugs.launchpad.net/bugs/224264 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs