Re: Solr HTTP client authentication

2014-11-17 Thread Bai Shen
enizer.ca > > > -Original Message- > From: Anurag Sharma [mailto:anura...@gmail.com] > Sent: November-17-14 11:21 AM > To: solr-user@lucene.apache.org > Subject: Re: Solr HTTP client authentication > > I think Solr encourage SSL than authentication > > On Mon, No

Solr HTTP client authentication

2014-11-17 Thread Bai Shen
I am using solrj to connect to my solr server. However I need to authenticate against the server and can not find out how to do so using solrj. Is this possible or do I need to drop solrj? I can manually create an httpclient and set up authentication but then I can't use solrj. Thanks.

Re: Is indexing large documents still an issue?

2013-05-06 Thread Bai Shen
s. > Did you solve this in some other way? or you just didn't have to? > > > Bai Shen wrote > > The only issue I ran into was returning the content field. Once I > > modified > > my query to avoid that, I got good performance. > > > > Admittedly, I only

Re: Is indexing large documents still an issue?

2013-05-02 Thread Bai Shen
The only issue I ran into was returning the content field. Once I modified my query to avoid that, I got good performance. Admittedly, I only have about 15-20k documents in my index ATM, but most of them are in the multiMB range with a current max of 250MB. On Thu, May 2, 2013 at 7:05 AM, adfel

Re: Only return snippets, not content

2013-05-01 Thread Bai Shen
t still > down (uh oh), but it's a setting in solrconfig.xml. > > > Michael Della Bitta > > > Appinions > 18 East 41st Street, 2nd Floor > New York, NY 10017-6271 > > www.appinions.com > > Where Influ

Re: Only return snippets, not content

2013-05-01 Thread Bai Shen
Fixed it. I just had to add "&fl=" with all of the fields except for content that I wanted returned. On Wed, May 1, 2013 at 7:38 AM, Bai Shen wrote: > I have a lot of large files that I've indexed into solr. Is there a way > to have solr return the snippets instead

Only return snippets, not content

2013-05-01 Thread Bai Shen
I have a lot of large files that I've indexed into solr. Is there a way to have solr return the snippets instead of the content? I'm only displaying the snippets to my users, so transferring the content as well just wastes bandwidth. Thanks.

Re: Removing old documents

2012-05-02 Thread Bai Shen
ents > in one run based on segment data. > > On Tuesday 01 May 2012 16:31:47 Bai Shen wrote: > > I'm running Nutch, so it's updating the documents, but I'm wanting to > > remove ones that are no longer available. So in that case, there's no > >

Re: Removing old documents

2012-05-01 Thread Bai Shen
> Not sure if there is an automatic way but we do it via a delete query and > where possible we update doc under same id to avoid deletes. > > > > > > On 01/05/2012 13:43, "Bai Shen" wrote: > > >What is the best method to remove old documents? Things that no gen

Re: Language Identification

2012-04-23 Thread Bai Shen
r Training - www.solrtraining.com > > On 20. apr. 2012, at 21:49, Bai Shen wrote: > > > I'm working on using Shuyo's work to improve the language identification > of > > our search. Apparently, it's been moved from Nutch to Solr. Is there a > > rea

Language Identification

2012-04-20 Thread Bai Shen
I'm working on using Shuyo's work to improve the language identification of our search. Apparently, it's been moved from Nutch to Solr. Is there a reason for this? http://code.google.com/p/language-detection/issues/detail?id=34 I would prefer to have the processing done in Nutch as that has the

Solr memory usage

2011-12-30 Thread Bai Shen
I have solr running on a single machine with 8GB of ram. Right now I have about 1.5 million documents indexed, which had produced a 30GB index. When I look in top, the tomcat process which is hosting solr says that it's using 38GB of VIRT, 6.6G RES, and 2GB SHR. The machine is showing a complete