Package: libtesseract-dev
Version: 4.0.0-2
Severity: normal

Hi!

Thanks for packaging tesseract!

I was looking for the training scripts and found them in
libtesseract-dev, which doesn't quite make sense to me: they are not
library headers or include files.  (I am referring to the three
scripts in /usr/share/tesseract-ocr, namely language-specific.sh,
tesstrain.sh and tesstrain_utils.sh.)

At the same time, the training binaries are in tesseract-ocr, such as
classifier_tester, lstmtraining and so on.  Would it not make more
sense to have *only* /usr/bin/tesseract in tesseract-ocr, and all of
the other binaries, along with the shell scripts noted above, in a
separate package called something like tesseract-training?  Few people
will be interested in these training scripts, and those who are will
probably want the tesstrain.sh script as well.

Note, though, that the shell scripts tesstrain_utils.sh and
language-specific.sh) are called from tesstrain.sh using $(dirname
$0); that could be fixed by putting tesstrain.sh as /usr/bin/tesstrain
and changing the references to the subscripts to explicitly refer to
/usr/share/tesseract-ocr/.

Best wishes,

   Julian

Reply via email to