Package: libtesseract-dev Version: 4.0.0-2 Severity: normal Hi!
Thanks for packaging tesseract! I was looking for the training scripts and found them in libtesseract-dev, which doesn't quite make sense to me: they are not library headers or include files. (I am referring to the three scripts in /usr/share/tesseract-ocr, namely language-specific.sh, tesstrain.sh and tesstrain_utils.sh.) At the same time, the training binaries are in tesseract-ocr, such as classifier_tester, lstmtraining and so on. Would it not make more sense to have *only* /usr/bin/tesseract in tesseract-ocr, and all of the other binaries, along with the shell scripts noted above, in a separate package called something like tesseract-training? Few people will be interested in these training scripts, and those who are will probably want the tesstrain.sh script as well. Note, though, that the shell scripts tesstrain_utils.sh and language-specific.sh) are called from tesstrain.sh using $(dirname $0); that could be fixed by putting tesstrain.sh as /usr/bin/tesstrain and changing the references to the subscripts to explicitly refer to /usr/share/tesseract-ocr/. Best wishes, Julian