Re: Searching multiple indices (solr newbie)

2007-01-04 Thread Chris Hostetter
: The issue is best described with an example: : search for canon - matches multiple categories, which will have very : different schemas : http://cnet.search.com/search?chkpt=astg.cnet.fd.search.cnet&q=canon&tag=srch I just so happen to have a bit of insight into how that page works, and while i

Re: Custom scorer...

2007-01-04 Thread Mike Klaas
On 1/4/07, escher2k <[EMAIL PROTECTED]> wrote: Yonik, I have my own Similarity and now need to write my own Scorer for muti field scoring. Is extending DisMaxQueryScorer the way to go ? Hmm, have you checked out DisjunctionSumQuery from lucene? I think it might be close to what you are loo

Re: Custom scorer...

2007-01-04 Thread escher2k
Yonik, I have my own Similarity and now need to write my own Scorer for muti field scoring. Is extending DisMaxQueryScorer the way to go ? Thanks. Yonik Seeley wrote: > > On 1/3/07, escher2k <[EMAIL PROTECTED]> wrote: >>I only saw options to specify the default operator (AND|OR) and to

Re: How to tell the highlighter not to escape?

2007-01-04 Thread Chris Hostetter
: it sure seems to me that if SOLR is returning XML, it might as well return : XML with real markup through and through instead of exploiting : pseudo-markup. if there is concern about introducing validation errors, then : perhaps you could use namespaces in the XML and put the highlighting markup

Re: Handling disparate data sources in Solr

2007-01-04 Thread Alan Burlison
Original problem statement: -- I'm considering using Solr to replace an existing bare-metal Lucene deployment - the current Lucene setup is embedded inside an existing monolithic webapp, and I want to factor out the search functionality into a separate webapp so it can be reused more e

Re: result grouping?

2007-01-04 Thread Yonik Seeley
On 1/4/07, Luis Neves <[EMAIL PROTECTED]> wrote: Yonik Seeley wrote: > Off the top of my head, one could use a priority queue that can change > it's size dynamically. One could increment a group count for each hit > (like faceted search with the FieldCache) and if the group count > exceeds "n",

Re: result grouping?

2007-01-04 Thread Luis Neves
Yonik Seeley wrote: On 1/3/07, Ryan McKinley <[EMAIL PROTECTED]> wrote: thanks. Yes, the presentation layer could group results, but that is not practical if i want to show the first 20 results out of 200,000 matches. Nutch groups the results by site. Any idea how they do it? Good question.

Re: Searching multiple indices (solr newbie)

2007-01-04 Thread Mekin Maheshwari
Thanks Chris. http://wiki.apache.org/solr/FederatedSearch Thats useful & I might be getting close to that size soon. The issue is best described with an example: search for canon - matches multiple categories, which will have very different schemas http://cnet.search.com/search?chkpt

Re: How to tell the highlighter not to escape?

2007-01-04 Thread Edward Garrett
just to add a note on this, the whole idea of inserting "pseudo-markup" into XML text elements seems to be pretty much in disrepute, and certainly caused many complaints about RSS 1.0, see e.g. http://www.biglist.com/lists/xsl-list/archives/200505/msg00316.html in xsl, you **can** use disable-ou

Re: Handling disparate data sources in Solr

2007-01-04 Thread Alan Burlison
Chris Hostetter wrote: For your purposes, if you've got a system that works and does the Document conversion for you, then you are probably right: Solr may not be a usefull addition to your architecture. Solr doesn't really attempt to solve the problem of parsing differnet kinds of data streams