Re: Solr under tomcat - UTF-8 issue

2010-01-23 Thread Sven Maurmann
Hi, I did not read the original mail, but for the UTF-8 issue with Tomcat you might consult the url http://wiki.apache.org/solr/SolrTomcat The relevant piece of information is under "URI Charset Config": *** quote *** Edit Tomcat's conf/server.xml and add the following attribute

Re: Solr under tomcat - UTF-8 issue

2010-01-22 Thread Frank Wesemann
Glock, Thomas schrieb: My flex client httpservice by default only sets the content-type request header to "application/x-www-form-urlencoded" what it needed to do for tomcat is set the content-type request header to content-type = "application/x-www-form-urlencoded; charset=UTF-8"; As s

RE: Solr under tomcat - UTF-8 issue

2009-10-26 Thread Glock, Thomas
26, 2009 2:12 AM To: solr-user@lucene.apache.org Subject: RE: Solr under tomcat - UTF-8 issue I was originally using POST for the same reason, however I discovered that Tomcat could easily be configured to accept any length URI. All it requires is specifying the maxHttpHeaderSize attribute in yo

RE: Solr under tomcat - UTF-8 issue

2009-10-25 Thread markwaddle
awesome. I > got the book two days ago too - fantastic! > > Thanks again, > Tom > -- View this message in context: http://www.nabble.com/Solr-under-tomcat---UTF-8-issue-tp26040052p26054942.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: Solr under tomcat - UTF-8 issue

2009-10-24 Thread Glock, Thomas
or so ago and I think its awesome. I got the book two days ago too - fantastic! Thanks again, Tom -Original Message- From: Walter Underwood [mailto:wun...@wunderwood.org] Sent: Saturday, October 24, 2009 1:31 PM To: solr-user@lucene.apache.org Subject: Re: Solr under tomcat - UTF-8 iss

Re: Solr under tomcat - UTF-8 issue

2009-10-24 Thread Walter Underwood
eeley Sent: Saturday, October 24, 2009 12:43 PM To: solr-user@lucene.apache.org Subject: Re: Solr under tomcat - UTF-8 issue Try using example/exampledocs/test_utf8.sh to narrow down if the charset problems you're hitting are due to servlet container configuration. -Yonik http://w

RE: Solr under tomcat - UTF-8 issue

2009-10-24 Thread Glock, Thomas
Of Yonik Seeley Sent: Saturday, October 24, 2009 12:43 PM To: solr-user@lucene.apache.org Subject: Re: Solr under tomcat - UTF-8 issue Try using example/exampledocs/test_utf8.sh to narrow down if the charset problems you're hitting are due to servlet container configuration. -Yonik http:

Re: Solr under tomcat - UTF-8 issue

2009-10-24 Thread Yonik Seeley
---Original Message- > From: Zsolt Czinkos [mailto:czin...@gmail.com] > Sent: Saturday, October 24, 2009 11:36 AM > To: solr-user@lucene.apache.org > Subject: Re: Solr under tomcat - UTF-8 issue > > Hello > > Have you set URIEncoding attribute to UTF-8 in tomcat

RE: Solr under tomcat - UTF-8 issue

2009-10-24 Thread Glock, Thomas
ng it. Thanks for your help. -Original Message- From: Zsolt Czinkos [mailto:czin...@gmail.com] Sent: Saturday, October 24, 2009 11:36 AM To: solr-user@lucene.apache.org Subject: Re: Solr under tomcat - UTF-8 issue Hello Have you set URIEncoding attribute to UTF-8 in tomcat's serv

Re: Solr under tomcat - UTF-8 issue

2009-10-24 Thread Zsolt Czinkos
Hello Have you set URIEncoding attribute to UTF-8 in tomcat's server.xml (on connector element)? Like: Hope this helps. Best regards czinkos 2009/10/24 Glock, Thomas : > > Hoping someone can help - > > Problem: >        Querying for non-english phrases such as Добавить do not return any >

Solr under tomcat - UTF-8 issue

2009-10-24 Thread Glock, Thomas
Hoping someone can help - Problem: Querying for non-english phrases such as Добавить do not return any results under Tomcat but do work when using the Jetty example. Both tomcat and jetty are being queried by the same custom (flash) client and both reference the same solr/da

Re: utf 8 issue

2009-02-18 Thread revathy arun
Hi Eril, $post_string is xml data i dont see any content for those files when i give *:* .what would that mean? On 2/19/09, Erik Hatcher wrote: > > > On Feb 18, 2009, at 1:53 PM, revathy arun wrote: > >> I am using php curl to post data to solr >> >> container tomcat >> i have uriencoding s

Re: utf 8 issue

2009-02-18 Thread Erik Hatcher
On Feb 18, 2009, at 1:53 PM, revathy arun wrote: I am using php curl to post data to solr container tomcat i have uriencoding set to utf8 in tomcats server.xml file this is how its indexed $header[] = "Content-Type: text/xml; charset=utf-8"; curl_setopt($ch, CURLOPT_URL,$url); curl_set

Re: utf 8 issue

2009-02-18 Thread revathy arun
I am using php curl to post data to solr container tomcat i have uriencoding set to utf8 in tomcats server.xml file this is how its indexed $header[] = "Content-Type: text/xml; charset=utf-8"; curl_setopt($ch, CURLOPT_URL,$url); curl_setopt( $ch, CURLOPT_HTTPHEADER, $header ); curl_se

Re: utf 8 issue

2009-02-18 Thread Erik Hatcher
On Feb 18, 2009, at 7:34 AM, revathy arun wrote: I am trying to index various langauge documents (foroyo,chinese,japanese) .These have been converted from pdf to text using xpdf I am using the standard anlyzer for content analysis ,but i am not able to search anything from some of the files

Re: utf 8 issue

2009-02-18 Thread Gert Brinkmann
revathy arun wrote: > Is there any way to check the encoding of a text/pdf document or convert > them to utf -8 encoding? If you are using pdftotext you could set the enc parameter: pdftotext -enc UTF-8 filename How can you convert PDFs to text via xpdf programmatically? Greetings, Gert

utf 8 issue

2009-02-18 Thread revathy arun
Hi , I am trying to index various langauge documents (foroyo,chinese,japanese) .These have been converted from pdf to text using xpdf I am using the standard anlyzer for content analysis ,but i am not able to search anything from some of the files. My guess is that these documents are not in utf-