Hello everyone,

I have a question that I imagine has been asked many times before, so I 
apologize for the repeat.

I have a basic text field with the following text:
        the word ”stemming” in quotes

Uploading the data yields no errors, however when it is indexed, the text looks 
like this:

the word �stemming� in quotes


Searching for the word stemming, without quotes or otherwise, does not return 
any hits.

Just some basic facts:

- I included the solr.CollationKeyFilterFactory filter on the fieldType.
- Updating the index is done via a "solr xml" document. I've confirmed that the 
document contains the right quote marks immediately prior to uploading.
- Updating the index is done via solrj, essentially:
        DirectXmlRequest up = new DirectXmlRequest( "/update", xml );
        solrServer.request( up );
        solrServer.commit();
- In solr admin, the characters look like garbage, surrounding the word 
stemming (as shown above)


Thanks in advance for any details you can provide!
-Kristian

Reply via email to