I tested whether a different encoding would fix it in version 0.16.7-3. Done did.
Here's how I tested all the available encodings: $ for encoding in $(pdftotext -listenc | sed 1d) ; do pdftotext -enc $encoding -layout -nopgbrk /tmp/pone.0009339.pdf - ; done | egrep "Atractylodes japonica" | less Thanks, Kingsley -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org