Re: UTF-8 and latin accents

2009-10-08 Thread Yonik Seeley
On Thu, Oct 8, 2009 at 12:48 PM, Claudio Martella wrote: > I'm trying to index documents with latin accents (italian documents). I > extract the text from .doc documents with Tika directly into .xml files. > If i open up the XML document with my Dashcode (i run mac os x) i can > see the characters

UTF-8 and latin accents

2009-10-08 Thread Claudio Martella
Hello list, I'm trying to index documents with latin accents (italian documents). I extract the text from .doc documents with Tika directly into .xml files. If i open up the XML document with my Dashcode (i run mac os x) i can see the characters correctly. my xml document is an xml document with t