Hello everyone,
I have a question that I imagine has been asked many times before, so I
apologize for the repeat.
I have a basic text field with the following text:
the word ”stemming” in quotes
Uploading the data yields no errors, however when it is indexed, the text looks
like this:
the word �stemming� in quotes
Searching for the word stemming, without quotes or otherwise, does not return
any hits.
Just some basic facts:
- I included the solr.CollationKeyFilterFactory filter on the fieldType.
- Updating the index is done via a "solr xml" document. I've confirmed that the
document contains the right quote marks immediately prior to uploading.
- Updating the index is done via solrj, essentially:
DirectXmlRequest up = new DirectXmlRequest( "/update", xml );
solrServer.request( up );
solrServer.commit();
- In solr admin, the characters look like garbage, surrounding the word
stemming (as shown above)
Thanks in advance for any details you can provide!
-Kristian