On 7/26/2013 7:05 AM, Gustav wrote:
> Hey guys, i have a Solr 4.3 instance running in my server, but Im having some
> troubles with encoding URL querystring. 
> Im currently encoding my query characters, so, when its searched for "Café",
> its actually encoded to "caf%E9" and "cão" is encoded to "c%E3o".
> My URLencoding in tomcat is "iso-8859-1", but when i do a query like that to
> solr(?q="caf%E9") It returns the error {msg=URLDecoder: Invalid character
> encoding detected after position 2 of query string / form data (while
> parsing as UTF-8),code=400}. It works perfectly in my Solr 3.5 instance(with
> the same configurations), but in Solr 4.3 it just wont go.
> 
> Is there any kind of configuration that i could change in Solr 4.3's config
> that interprets this kind of encoding?

As I said in #solr (before I noticed this message), newer Solr versions
force the container to UTF-8, no matter what the container config says.

You (or your tools) will need to do the conversion to UTF-8 before Solr
sees the data.  This goes for both indexing and queries.

Thanks,
Shawn

Reply via email to