In firefox, character encoding is set to UTF-8
Yes, I'm sending the query directly to solr using apache httpclient and
I set the http request header content type to : Content-Type="text/html;
charset=UTF-8"

Any suggestions

Thanks in advance

-----Original Message-----
From: Grant Ingersoll [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, October 10, 2007 2:43 PM
To: solr-user@lucene.apache.org
Subject: Re: problems with arabic search

Can you give more detail about what you have done?  What character  
encoding do you have your browser set to?  In Firefox, do View ->  
Character Encoding to see what it is set to when you are on the input  
page?  Internet Explorer and other browsers have other options.  Are  
you sending the query directly to Solr or is it going through some  
other servlet?  If you are doing this, and _IF_ I recall correctly, I  
believe you need to tell your servlet the input is UTF-8 before doing  
anything else with the request.

See http://kickjava.com/src/filters/ 
SetCharacterEncodingFilter.java.htm for a Servlet Filter that does  
this (it's even Apache licensed!)  You will need to hook it up in  
your web.xml.

On Oct 10, 2007, at 2:59 AM, Heba Farouk wrote:

> I'm developing a java application using solr, this application is
> working with English search
>
> Yes, I have tried querying solr directly for Arabic and it's working
>
> Any suggestions ??
>
> -----Original Message-----
> From: Chris Hostetter [mailto:[EMAIL PROTECTED]
> Sent: Wednesday, October 10, 2007 5:50 AM
> To: solr-user@lucene.apache.org
> Subject: Re: problems with arabic search
>
>
> FYI: you don't need to resend your question just because you didn't  
> get
> a
> reply within a day, either people haven't had a chance to reply, or  
> they
>
> don't know the answer.
>
> : XML Parsing Error: mismatched tag. Expected: </HR>.
> :
> : Location:
> http://localhost:8080/solrServlet/searchServlet?query=%D9%85%D8%AD% 
> D9%85
> %D8%AF&cmdSearch=Search%21
>
> this doesn't look like a query error .. and that doesn't look like a
> solr
> URL, this looks something you have in front of Solr.
>
> : </head><body><h1>HTTP Status 400 - Query parsing error: Cannot parse
> : '????': '*' or '?' not allowed as first character in
>
> that looks like a Solr error.  i'm guessing that your app isn't  
> dealing
> with the UTF8 correctly, something is substituting "?" characters in
> place
> of any character it doesn't understand - and Solr thinks you are  
> trying
> to
> do a wildcard query.
>
> have you tried querying solr directly (in your browser or using curl)
> for
> your arabic word?
>
>
> -Hoss
>

--------------------------
Grant Ingersoll
http://lucene.grantingersoll.com

Lucene Boot Camp Training:
ApacheCon Atlanta, Nov. 12, 2007.  Sign up now!  http:// 
www.apachecon.com

Reply via email to