On 7/26/2013 7:05 AM, Gustav wrote: > Hey guys, i have a Solr 4.3 instance running in my server, but Im having some > troubles with encoding URL querystring. > Im currently encoding my query characters, so, when its searched for "Café", > its actually encoded to "caf%E9" and "cão" is encoded to "c%E3o". > My URLencoding in tomcat is "iso-8859-1", but when i do a query like that to > solr(?q="caf%E9") It returns the error {msg=URLDecoder: Invalid character > encoding detected after position 2 of query string / form data (while > parsing as UTF-8),code=400}. It works perfectly in my Solr 3.5 instance(with > the same configurations), but in Solr 4.3 it just wont go. > > Is there any kind of configuration that i could change in Solr 4.3's config > that interprets this kind of encoding?
As I said in #solr (before I noticed this message), newer Solr versions force the container to UTF-8, no matter what the container config says. You (or your tools) will need to do the conversion to UTF-8 before Solr sees the data. This goes for both indexing and queries. Thanks, Shawn