Re: Does solr supports indexing of files other than UTF-8

2011-01-28 Thread Yonik Seeley
On Thu, Jan 27, 2011 at 3:51 AM, prasad deshpande wrote: > The size of docs can be huge, like suppose there are 800MB pdf file to index > it I need to translate it in UTF-8 and then send this file to index. PDF is binary AFAIK... you shouldn't need to do any charset translation before sending it

Re: Does solr supports indexing of files other than UTF-8

2011-01-28 Thread Dennis Gearon
.org Sent: Fri, January 28, 2011 12:41:29 AM Subject: Re: Does solr supports indexing of files other than UTF-8 Thanks paul. However I want to support local encoding files to be indexed. How would I achieve it? On Thu, Jan 27, 2011 at 2:46 PM, Paul Libbrecht wrote: > At least in java utf-8 transco

Re: Does solr supports indexing of files other than UTF-8

2011-01-28 Thread prasad deshpande
Thanks paul. However I want to support local encoding files to be indexed. How would I achieve it? On Thu, Jan 27, 2011 at 2:46 PM, Paul Libbrecht wrote: > At least in java utf-8 transcoding is done on a stream basis. No issue > there. > > paul > > > Le 27 janv. 2011 à 09:51, prasad deshpande a

Re: Does solr supports indexing of files other than UTF-8

2011-01-27 Thread Paul Libbrecht
At least in java utf-8 transcoding is done on a stream basis. No issue there. paul Le 27 janv. 2011 à 09:51, prasad deshpande a écrit : > The size of docs can be huge, like suppose there are 800MB pdf file to index > it I need to translate it in UTF-8 and then send this file to index. Now > sup

Re: Does solr supports indexing of files other than UTF-8

2011-01-27 Thread prasad deshpande
The size of docs can be huge, like suppose there are 800MB pdf file to index it I need to translate it in UTF-8 and then send this file to index. Now suppose there can be any number of clients who can upload file. at that time it will affect performance. and already our product support localization

Re: Does solr supports indexing of files other than UTF-8

2011-01-27 Thread Paul Libbrecht
Why is converting documents to utf-8 not feasible? Nowadays any platform offers such services. Can you give a detailed failure description (maybe with the URL to a sample document you post)? paul Le 27 janv. 2011 à 07:31, prasad deshpande a écrit : > I am able to successfully index/search non-