On Fri, Oct 2, 2009 at 11:23 PM, julien <[email protected]> wrote:

> New rev ready to be pulled from. I have tested the hocr output and it works 
> fine.
> Now ocr_line folllows the standard according to the hocr ref from 2007 
> mentioned earlier.
> (E.g. the char bboxes are in ocr_cinfo, and the text line is in pure text as 
> text content for the ocr_line tag).

Looks nice, thanks. However your editor seems to have mangled the
russian comments somehow. I get tons of lines like these:

-    // ?????????? ??? ????????
+    // ���������� ��� ��������

In case it does not get through properly, the first one has question
marks while the second one has Unicode unrepresentable symbol
characters. Could you look into fixing this?

_______________________________________________
Mailing list: https://launchpad.net/~cuneiform
Post to     : [email protected]
Unsubscribe : https://launchpad.net/~cuneiform
More help   : https://help.launchpad.net/ListHelp

Reply via email to