Package: gscan2pdf
Version: 0.9.31-1
Severity: normal
Tags: l10n

Recognition of a scanned text within gscan2pdf using ocropus
results in a worse quality than running e.g. ocrodjvu afterwards.
Besides the general quality problems, umlauts like äöü are not
recognized at all, e.g. ü is always replaced with ii. This makes
the OCR feature impractical for german language texts.

Since a separate run of ocropus after saving a djvu from gscan2pdf
gives good results, this seems to be an issue of how ocropus is
called from within gscan2pdf.

Thanks for your work!

Michael Below

-- System Information:
Debian Release: squeeze/sid
  APT prefers testing
  APT policy: (900, 'testing'), (500, 'proposed-updates'), (500, 'stable'), 
(10, 'unstable')
Architecture: amd64 (x86_64)

Kernel: Linux 2.6.35-trunk-amd64 (SMP w/4 CPU cores)
Locale: LANG=de_DE.UTF-8, LC_CTYPE=de_DE.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages gscan2pdf depends on:
ii  graphicsmagick-imagemagick 1.3.12-1      image processing tools providing I
ii  libconfig-general-perl     2.48-1        Generic Configuration Module
ii  libforks-perl              0.33-1        forks - emulate threads with fork
ii  libgoo-canvas-perl         0.06-1        Perl interface to the GooCanvas
ii  libgtk2-ex-simple-list-per 0.50-2        simple interface to Gtk2's complex
ii  libgtk2-imageview-perl     0.05-1        Perl bindings for the GtkImageView
ii  libhtml-parser-perl        3.66-1        collection of modules that parse H
ii  liblocale-gettext-perl     1.05-6        Using libc functions for internati
ii  libpdf-api2-perl           0.73-1        module for creating or modifying P
ii  libproc-processtable-perl  0.45-1        Perl library for accessing process
ii  libreadonly-perl           1.03-2        Facility for creating read-only sc
ii  librsvg2-common            2.26.3-1      SAX-based renderer library for SVG
ii  libsane-perl               0.03-1        Perl bindings for the SANE (Scanne
ii  libset-intspan-perl        1.14-1        Perl module to manage sets of inte
ii  libtiff-tools              3.9.4-4       TIFF manipulation and conversion t
ii  perl-modules [libarchive-t 5.10.1-14     Core Perl modules
ii  perlmagick                 8:6.6.0.4-2.2 Perl interface to the ImageMagick 
ii  sane-utils                 1.0.21-4      API library for scanners -- utilit

Versions of packages gscan2pdf recommends:
ii  cuneiform            0.7.0+dfsg.1-1      multi-language OCR system
ii  djvulibre-bin        3.5.23-3            Utilities for the DjVu image forma
ii  gocr                 0.48-1              A command line OCR
ii  libgtk2-ex-podviewer 0.18-1              Perl Gtk2 widget for displaying Pl
ii  sane                 1.0.14-9            scanner graphical frontends
ii  tesseract-ocr        2.04-2+b1           Command line OCR tool
ii  unpaper              0.3-1               post-processing tool for scanned p
ii  xdg-utils            1.0.2+cvs20100307-2 desktop integration utilities from

gscan2pdf suggests no packages.

-- no debconf information



--
To UNSUBSCRIBE, email to [email protected]
with a subject of "unsubscribe". Trouble? Contact [email protected]

Reply via email to