Unless I'm mistaken, other locales didn't solve the problem. Here's how I tested them:
$ locale -a C C.UTF-8 en_US en_US.iso88591 en_US.utf8 POSIX $ for encoding in $(pdftotext -listenc | sed 1d) ; do echo Trying encoding $encoding ; for loc in $(locale -a) ; do LC_ALL="$loc" pdftotext -enc $encoding -layout -nopgbrk /tmp/pone.0009339.pdf - ; done ; done | egrep "Atractylodes japonica" | less -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org