You might also try using CDATA blocks to wrap your Unicode text. It is usually much easier to view the text while debugging these problems.
On Thu, Mar 11, 2010 at 12:13 AM, Eric Pugh <ep...@opensourceconnections.com> wrote: > So I am using Sunspot to post over, which means an extra layer of > indirection between mean and my XML! I will look tomorrow. > > > On Mar 10, 2010, at 7:21 PM, Chris Hostetter wrote: > >> >> : Any time a character like that was index Solr through a unknown entity >> error. >> : But if converted to À or À then everything works great. >> : >> : I tried out using Tomcat versus Jetty and got the same results. Before >> I edit >> >> Uh, you mean like the characters in exampledocs/utf8-example.xml ? >> >> it contains literale utf8 characters, and it works fine. >> >> Based on your "À" comment I assume you are posting XML ... are you >> sure you are using the utf8 charset? >> >> -Hoss >> > > ----------------------------------------------------- > Eric Pugh | Principal | OpenSource Connections, LLC | 434.466.1467 | > http://www.opensourceconnections.com > Co-Author: Solr 1.4 Enterprise Search Server available from > http://www.packtpub.com/solr-1-4-enterprise-search-server > Free/Busy: http://tinyurl.com/eric-cal > > > > > > > > > -- Lance Norskog goks...@gmail.com