I'm having similar issue. I can confirm that it is not related to Cuneiform. 
I'm using ocropus (ocroscript recognize) (which uses Tesseract)  and I have 
check the resulting .html (hocr) which seems valid and pixel perfect.
 
However, hocr2pdf misalign the text with their related bounding boxes. I've 
tried ocroscript recognize with and without the --charboxes options and the 
result is always wrong (the text has an offset on the Y axis).

This is with exactimage 0.8.1-3build1 on Ubuntu Natty.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/623438

Title:
  Font size not correct in merged sandvich PDF

To manage notifications about this bug go to:
https://bugs.launchpad.net/cuneiform-linux/+bug/623438/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to