Package: ocropus Version: 0.3.1-2 Severity: normal The doctype declaration in the html file produced by ocroscript 0.3.1-2 is missing two quotation marks. This causes a problem with ElementTree.py parsing, and also fails a html validation test.
The original html output with this doctype declaration failed validation at http://validator.w3.org/check <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> Adding two quotation marks produced a file that passed. This doctype element is one of the recommended list of doctype declarations at http://www.w3.org/QA/2002/04/valid-dtd-list.html. <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> -- System Information: Debian Release: squeeze/sid APT prefers testing APT policy: (500, 'testing') Architecture: i386 (i686) Kernel: Linux 2.6.30-2-486 Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Versions of packages ocropus depends on: ii libc6 2.10.2-2 GNU C Library: Shared libraries ii libgcc1 1:4.4.2-9 GCC support library ii libiulib0 0.3-1+b1 C++ library of image understanding ii libjpeg62 6b-15 The Independent JPEG Group's JPEG ii liblua5.1-0 5.1.4-5 Simple, extensible, embeddable pro ii libpng12-0 1.2.42-1 PNG library - runtime ii libstdc++6 4.4.2-9 The GNU Standard C++ Library v3 ii libtiff4 3.9.2-2 Tag Image File Format (TIFF) libra ii ocropus-data 0.3.1-2 document analysis and OCR system - ii zlib1g 1:1.2.3.4.dfsg-3 compression library - runtime Versions of packages ocropus recommends: ii tesseract-ocr 2.04-2 Command line OCR tool ocropus suggests no packages. -- no debconf information -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org