Bug#572522: ocrodjvu: new problem with cuneiform engine

2010-03-05 Thread Janusz S. =?UTF-8?Q?Bie=C5=84
On Fri, 5 Mar 2010 Jakub Wilk wrote: [...] > That said, if you insist on ignoring exceptions, you can easily achieve > that with a simple shell script like: > > cp in.djvu out.djvu > djvused -e remove-txt out.djvu > for p in $(seq 1 $(djvused -e n out.djvu)) > do > ocrodjvu -p $p --in-plac

Bug#572522: ocrodjvu: new problem with cuneiform engine

2010-03-05 Thread Jakub Wilk
* Janusz S. Bień , 2010-03-05, 06:30: [...] ocrodjvu indeed crashes, but on the garbage-in-garbage-out principle. If you run ocrodjvu with the --debug option, you'll see that resulting hOCR files don't contain anything legible. In fact, hOCR for page 2 contains also some control characters, which

Bug#572522: ocrodjvu: new problem with cuneiform engine

2010-03-04 Thread Janusz S. =?UTF-8?Q?Bie=C5=84
On Thu, 4 Mar 2010 Jakub Wilk wrote: [...] > ocrodjvu indeed crashes, but on the garbage-in-garbage-out principle. If > you run ocrodjvu with the --debug option, you'll see that resulting hOCR > files don't contain anything legible. In fact, hOCR for page 2 contains > also some control char

Bug#572522: ocrodjvu: new problem with cuneiform engine

2010-03-04 Thread Jakub Wilk
retitle 572522 ocrodjvu: crashes with ValueError on malformed hOCR severity 572522 minor thanks ocrodjvu --render all --engine cuneiform --language pol --clear-text -o out.djvu in.djvu Processing 'in.djvu': - Page #1 - Page #2 Exception in thread Thread-2: Traceback (most recent call last): Fi

Bug#572522: ocrodjvu: new problem with cuneiform engine

2010-03-04 Thread jsbien
Package: ocrodjvu Version: 0.4.2-1 Severity: normal On Mon, 01 Mar 2010 jsb...@mimuw.edu.pl wrote: > The input file is temporarily available at > http://fleksem.klf.uw.edu.pl/~jsbien/tmp/in.djvu. Now I get: --