Package: tesseract-ocr Version: 2.03-1 Severity: normal Tags: patch The INSTALL file for ocropus says -------------------------------------------------------- The 2.03 release of Tesseract has a bug. We have a patch for it, it's called tesseract-2.03-patch.diff and located in the top-level OCRopus directory. So the commands to install Tesseract 2.03 might look like this:
wget http://tesseract-ocr.googlecode.com/files/tesseract-2.03.tar.gz tar xzf tesseract-2.03.tar.gz cd tesseract-2.03 wget http://tesseract-ocr.googlecode.com/files/tesseract-2.00.eng.tar.gz tar xzf tesseract-2.00.eng.tar.gz # or other language packages patch -p1 <../ocropus-0.2/tesseract-2.03-patch.diff # check this path! ./configure # CXXFLAGS="-fPIC -O2" ./configure if you want Python later make sudo make install # installs in /usr/local The installation will finish with an error message about having no install target in java/ subdirectory. That's another bug in 2.03 - just ignore it. Alternatively, you can use SVN: svn co http://tesseract-ocr.googlecode.com/svn/trunk/ tesseract-ocr cd tesseract-ocr ./configure # CXXFLAGS="-fPIC -O2" ./configure if you want Python later make sudo make install # installs in /usr/local ----------------------------------------------- Those instructions assume you have ocropus. It's available via svn checkout http://ocropus.googlecode.com/svn/trunk/ ocropus See also ITP ocropus, debian bug 461097. -- System Information: Debian Release: lenny/sid APT prefers testing APT policy: (990, 'testing'), (990, 'stable'), (50, 'unstable') Architecture: i386 (i686) Kernel: Linux 2.6.24-1-686 (SMP w/2 CPU cores) Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/bash Versions of packages tesseract-ocr depends on: ii libc6 2.7-10 GNU C Library: Shared libraries ii libgcc1 1:4.3.0-5 GCC support library ii libstdc++6 4.3.0-5 The GNU Standard C++ Library v3 ii libtiff4 3.8.2-8 Tag Image File Format (TIFF) libra ii tesseract-ocr-deu [tesseract- 2.00-1 tesseract-ocr language files for G ii tesseract-ocr-eng [tesseract- 2.00-1 tesseract-ocr language files for E tesseract-ocr recommends no packages. -- no debconf information -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]