Package: gscan2pdf Version: 0.9.31-1 Severity: normal Tags: l10n Recognition of a scanned text within gscan2pdf using ocropus results in a worse quality than running e.g. ocrodjvu afterwards. Besides the general quality problems, umlauts like äöü are not recognized at all, e.g. ü is always replaced with ii. This makes the OCR feature impractical for german language texts.
Since a separate run of ocropus after saving a djvu from gscan2pdf gives good results, this seems to be an issue of how ocropus is called from within gscan2pdf. Thanks for your work! Michael Below -- System Information: Debian Release: squeeze/sid APT prefers testing APT policy: (900, 'testing'), (500, 'proposed-updates'), (500, 'stable'), (10, 'unstable') Architecture: amd64 (x86_64) Kernel: Linux 2.6.35-trunk-amd64 (SMP w/4 CPU cores) Locale: LANG=de_DE.UTF-8, LC_CTYPE=de_DE.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/bash Versions of packages gscan2pdf depends on: ii graphicsmagick-imagemagick 1.3.12-1 image processing tools providing I ii libconfig-general-perl 2.48-1 Generic Configuration Module ii libforks-perl 0.33-1 forks - emulate threads with fork ii libgoo-canvas-perl 0.06-1 Perl interface to the GooCanvas ii libgtk2-ex-simple-list-per 0.50-2 simple interface to Gtk2's complex ii libgtk2-imageview-perl 0.05-1 Perl bindings for the GtkImageView ii libhtml-parser-perl 3.66-1 collection of modules that parse H ii liblocale-gettext-perl 1.05-6 Using libc functions for internati ii libpdf-api2-perl 0.73-1 module for creating or modifying P ii libproc-processtable-perl 0.45-1 Perl library for accessing process ii libreadonly-perl 1.03-2 Facility for creating read-only sc ii librsvg2-common 2.26.3-1 SAX-based renderer library for SVG ii libsane-perl 0.03-1 Perl bindings for the SANE (Scanne ii libset-intspan-perl 1.14-1 Perl module to manage sets of inte ii libtiff-tools 3.9.4-4 TIFF manipulation and conversion t ii perl-modules [libarchive-t 5.10.1-14 Core Perl modules ii perlmagick 8:6.6.0.4-2.2 Perl interface to the ImageMagick ii sane-utils 1.0.21-4 API library for scanners -- utilit Versions of packages gscan2pdf recommends: ii cuneiform 0.7.0+dfsg.1-1 multi-language OCR system ii djvulibre-bin 3.5.23-3 Utilities for the DjVu image forma ii gocr 0.48-1 A command line OCR ii libgtk2-ex-podviewer 0.18-1 Perl Gtk2 widget for displaying Pl ii sane 1.0.14-9 scanner graphical frontends ii tesseract-ocr 2.04-2+b1 Command line OCR tool ii unpaper 0.3-1 post-processing tool for scanned p ii xdg-utils 1.0.2+cvs20100307-2 desktop integration utilities from gscan2pdf suggests no packages. -- no debconf information -- To UNSUBSCRIBE, email to [email protected] with a subject of "unsubscribe". Trouble? Contact [email protected]

