Re: How to improve Solr search performance

2008-04-09 Thread 李银松
2008/4/9, Chris Hostetter <[EMAIL PROTECTED]>: > > > : most of time seems to be used for the writer getting and writing the > docs > : can those docs prefetched? > > as mentiond, the documentCache can help you out in the common case, but > 1-4 seconds for just the XMLWriting seems pretty high ...

Can I find the which field matched?

2008-04-09 Thread Umar Shah
Hi, If i through a query at the solr index , is there a mechanism where i can find out which fields matced the query... (score of that match). Example: for Fields A,B and C, if query q has term1 term2 term3 Field A matches term1 term 2 Field C matches term3 can i get component scores of the whol

Re: Snipets Solr/nutch

2008-04-09 Thread khirb7
thank you for your response. I have another problem with snippets.here is the problem: I transform the HTML code into text then I index all this text generated into one field called myText , many pages has common header with common information (example : web site about the president bush) and t

Re: Return the result only field A or field B is non-zero?

2008-04-09 Thread Walter Underwood
This would be trivial if you also stored boolean fields for aiszero and biszero. That would also be fast, I expect. wunder On 4/8/08 11:53 PM, "Vinci" <[EMAIL PROTECTED]> wrote: > > Hi all, > > I want to limit the search result by 2 numerical field, A and B, where Solr > return the result onl

Re: Distributed Search

2008-04-09 Thread Yonik Seeley
On Wed, Apr 9, 2008 at 2:00 AM, oleg_gnatovskiy <[EMAIL PROTECTED]> wrote: > We are using the Chain Collapse patch as well. Will that not work over a > distributed index? Since there is no explicit distributed support for it, it would only collapse per-shard. -Yonik

Re: How to improve Solr search performance

2008-04-09 Thread Chris Hostetter
: 1) how are you timing this (ie: what exactly are you measuring) : And I got the time from dispatchfilter received the request to : responsewriter write the response : It is much larger than QTime. can you be more specific about what you mean when you say "And I got the time from dispatchfilte

Re: Nightly build compile error?

2008-04-09 Thread Chris Hostetter
: : Hello everyone. I downloaded the latest nightly build from : http://people.apache.org/builds/lucene/solr/nightly/. When I tried to : compile it, I got the following errors: : : [javac] Compiling 189 source files to : /home/csweb/apache-solr-nightly/build/core : [javac] : /home/csweb/apac

Re: Nightly build compile error?

2008-04-09 Thread oleg_gnatovskiy
hossman wrote: > > : > : Hello everyone. I downloaded the latest nightly build from > : http://people.apache.org/builds/lucene/solr/nightly/. When I tried to > : compile it, I got the following errors: > : > : [javac] Compiling 189 source files to > : /home/csweb/apache-solr-nightly/build/cor

Re: Distributed Search

2008-04-09 Thread oleg_gnatovskiy
Do you have any suggestions as to how we would be able to implement chain collapse over the entire distributed index? Our collection is 27 GB, 15 million documents. Do you think there is a way to optimize Solr performance enough to not have to segment such a large collection? Yonik Seeley wrote:

Re: Distributed Search

2008-04-09 Thread Yonik Seeley
On Wed, Apr 9, 2008 at 1:57 PM, oleg_gnatovskiy <[EMAIL PROTECTED]> wrote: > Do you have any suggestions as to how we would be able to implement chain > collapse over the entire distributed index? Our collection is 27 GB, 15 > million documents. Do you think there is a way to optimize Solr perfo

Re: Distributed Search

2008-04-09 Thread oleg_gnatovskiy
Yonik Seeley wrote: > > On Wed, Apr 9, 2008 at 1:57 PM, oleg_gnatovskiy > <[EMAIL PROTECTED]> wrote: >> Do you have any suggestions as to how we would be able to implement >> chain >> collapse over the entire distributed index? Our collection is 27 GB, 15 >> million documents. Do you think t

Re: Payloads in Solr

2008-04-09 Thread pgwillia
I started this thread back in November. Recall that I'm indexing xml and storing the xpath as a payload in each token. I am not encoding or mapping the xpath but storing the text directly as String.getBytes(). We're not using this to query in any way, just to add context to our search results.

Re: indexing slow, IO-bound?

2008-04-09 Thread Norberto Meijome
On Mon, 7 Apr 2008 16:37:48 -0400 "Yonik Seeley" <[EMAIL PROTECTED]> wrote: > On Mon, Apr 7, 2008 at 4:30 PM, Mike Klaas <[EMAIL PROTECTED]> wrote: > > 'top', 'vmstat' tell exactly what's going on in terms of io and cpu on > > unix. Perhaps someone has gotten these to work under windows with cyg

Re: Return the result only field A or field B is non-zero?

2008-04-09 Thread Vinci
Hi, Thank you Underwood. That still not come up with the solution...doing the boolean operation for every query (query AND (isAZero OR isBZero) ) if I have the boolean field? ***Adding boolean need largely update the document structure that may not be preferred...can Solr generate this field for

Re: How to improve Solr search performance

2008-04-09 Thread Eason . Lee
> > > can you be more specific about what you mean when you say "And I got the > time from dispatchfilter..." What *exactly* are you looking at (ie: is > this a time you are seeing in a log file? ifso which log file? ... is this > timing code you added to the dispatch filter yourself? what *exactl

Human Powered Search Module

2008-04-09 Thread Sushan Rungta
Hello Everybody, I am a newbie in Lucene and I am from India, currently working for a search module for our classifed website search module in clickindia.com. I have implemented the basic functionality of solr lucen and am pretty happy with the results. Search in India has its own share of n