Bug#470200: Unicode ligatures to ASCII

2008-04-26 Thread Kartik Mistry
On Sunday 27 Apr 2008 12:20:30 am William J Poser wrote: > U+201C, U+201D,and U+2212 are already handled by the -e option. > I don't understand why there would be a problem with them. > > Version 4.7, now available on my web site, adds U+FB00-U+FB04 and U+FB06 > to the -x option. It also adds a -B

Bug#470200: Unicode ligatures to ASCII

2008-04-26 Thread William J Poser
U+201C, U+201D,and U+2212 are already handled by the -e option. I don't understand why there would be a problem with them. Version 4.7, now available on my web site, adds U+FB00-U+FB04 and U+FB06 to the -x option. It also adds a -B option as a shorthand for cdefx and a -P option that passes throu

Bug#470200: unicode ligatures to ASCII

2008-03-09 Thread jidanni
Package: uni2ascii Version: 4.4-1 Severity: minor I would like to discuss today the Unicodes ¯ ’“”− ff fi fl ffi ... that is 00AF 2019 201C 201D 2212 FB00 FB01 FB02 FB03 ... You see, I noticed them when I used pdftotext on http://www.cs.ucr.edu/~anirban/Anir-networking07.pdf and then tired to read the