On 7/26/2013 2:03 PM, Gustav wrote:
> Thanks for the answer Shawn,
> 
> The problem here is that in my client's application, the query beign encoded
> in iso-8859-1 its a *must*. So, this is kind of a trouble here.
> I just dont get how this encoding could work on queries in version 3.5, but
> it doesnt in 4.3.

Version 3.5 didn't force UTF-8, which led to a TON of problems with
misconfigured containers, notably tomcat.  SOLR-4265 (first available in
4.1.0) fixed this problem.

https://issues.apache.org/jira/browse/SOLR-4265

In your case, because you are actually aware of your character set
encoding, you use a Tomcat setting that would be incorrect for most
people, but it's correct for you.

It might be advisable for us to make the charset configurable in Solr.
I have started a developer discussion on it.  I do not know how
difficult it will be to implement, how long it might take, or whether it
will even be done.

Is there any way you can convert to UTF-8 before Solr sees the data?

Thanks,
Shawn

Reply via email to