Posting utf8-example.xml is the first thing I tried when I ran into this problem, and like the other files I had been working with, query results return garbage characters inside of unicode.
On 5/25/07, Yonik Seeley <[EMAIL PROTECTED]> wrote:
On 5/25/07, Ethan Gruber <[EMAIL PROTECTED]> wrote: > Yes, it's definitely encoded in UTF-8. I'm going to attempt either today or > Tuesday to post the files to a solr index that is online (as opposed to > localhost as was my case a few days ago) using post.sh through SSH and let > you know how it turns out. That should definitely indicate whether or not > the problem is with my files themselves or the post.jar file. Why don't you try a file that we know is encoded in UTF-8, the solr/example/exampledocs/utf8-example.xml Try it first without modifying it (an editor can change the encoding a file is stored in). -Yonik