In firefox, character encoding is set to UTF-8 Yes, I'm sending the query directly to solr using apache httpclient and I set the http request header content type to : Content-Type="text/html; charset=UTF-8"
Any suggestions Thanks in advance -----Original Message----- From: Grant Ingersoll [mailto:[EMAIL PROTECTED] Sent: Wednesday, October 10, 2007 2:43 PM To: solr-user@lucene.apache.org Subject: Re: problems with arabic search Can you give more detail about what you have done? What character encoding do you have your browser set to? In Firefox, do View -> Character Encoding to see what it is set to when you are on the input page? Internet Explorer and other browsers have other options. Are you sending the query directly to Solr or is it going through some other servlet? If you are doing this, and _IF_ I recall correctly, I believe you need to tell your servlet the input is UTF-8 before doing anything else with the request. See http://kickjava.com/src/filters/ SetCharacterEncodingFilter.java.htm for a Servlet Filter that does this (it's even Apache licensed!) You will need to hook it up in your web.xml. On Oct 10, 2007, at 2:59 AM, Heba Farouk wrote: > I'm developing a java application using solr, this application is > working with English search > > Yes, I have tried querying solr directly for Arabic and it's working > > Any suggestions ?? > > -----Original Message----- > From: Chris Hostetter [mailto:[EMAIL PROTECTED] > Sent: Wednesday, October 10, 2007 5:50 AM > To: solr-user@lucene.apache.org > Subject: Re: problems with arabic search > > > FYI: you don't need to resend your question just because you didn't > get > a > reply within a day, either people haven't had a chance to reply, or > they > > don't know the answer. > > : XML Parsing Error: mismatched tag. Expected: </HR>. > : > : Location: > http://localhost:8080/solrServlet/searchServlet?query=%D9%85%D8%AD% > D9%85 > %D8%AF&cmdSearch=Search%21 > > this doesn't look like a query error .. and that doesn't look like a > solr > URL, this looks something you have in front of Solr. > > : </head><body><h1>HTTP Status 400 - Query parsing error: Cannot parse > : '????': '*' or '?' not allowed as first character in > > that looks like a Solr error. i'm guessing that your app isn't > dealing > with the UTF8 correctly, something is substituting "?" characters in > place > of any character it doesn't understand - and Solr thinks you are > trying > to > do a wildcard query. > > have you tried querying solr directly (in your browser or using curl) > for > your arabic word? > > > -Hoss > -------------------------- Grant Ingersoll http://lucene.grantingersoll.com Lucene Boot Camp Training: ApacheCon Atlanta, Nov. 12, 2007. Sign up now! http:// www.apachecon.com