I'm having similar issue. I can confirm that it is not related to Cuneiform. I'm using ocropus (ocroscript recognize) (which uses Tesseract) and I have check the resulting .html (hocr) which seems valid and pixel perfect. However, hocr2pdf misalign the text with their related bounding boxes. I've tried ocroscript recognize with and without the --charboxes options and the result is always wrong (the text has an offset on the Y axis).
This is with exactimage 0.8.1-3build1 on Ubuntu Natty. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/623438 Title: Font size not correct in merged sandvich PDF To manage notifications about this bug go to: https://bugs.launchpad.net/cuneiform-linux/+bug/623438/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs