Re: Using Solr with Hadoop ....

2008-11-29 Thread Erik Hatcher
On Nov 28, 2008, at 8:38 PM, Yonik Seeley wrote: Or, it would be relatively trivial to write a Lucene program to merge the indexes. FYI, such a tool exists in Lucene's API already: Erik

Re: Spellcheck for phrase queries

2008-11-29 Thread Grant Ingersoll
Hi Kalyan, Currently the spell checker does not support phrase based suggestions. See http://lucene.markmail.org/message/wdr7wsenhtuecatb?q=spellcheck+list:org%2Eapache%2Elucene%2Esolr-user and a variety of other links. Now, I think there are a couple of things you could do. 1. Try n-gram

Re: Using Solr with Hadoop ....

2008-11-29 Thread Jon Baer
HadoopEntityProcessor for the DIH? Ive wondered about this as they make HadoopCluster LiveCDs and EC2 have images but best way to make use of them is always a challenge. - Jon On Nov 29, 2008, at 3:34 AM, Erik Hatcher wrote: On Nov 28, 2008, at 8:38 PM, Yonik Seeley wrote: Or, it would b

Re: Regex Transformer Error

2008-11-29 Thread Ahmed Hammad
OK, I contributed it at: https://issues.apache.org/jira/browse/SOLR-887 I changed it to use Solr class org.apache.solr.analysis.HTMLStripReader Thank you all. Ahmed On Tue, Nov 18, 2008 at 5:49 AM, Noble Paul നോബിള്‍ नोब्ळ् < [EMAIL PROTECTED]> wrote: > On Tue, Nov 18, 2008 at 2:49 AM, Ahmed

Re: Using Solr with Hadoop ....

2008-11-29 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Sat, Nov 29, 2008 at 7:26 PM, Jon Baer <[EMAIL PROTECTED]> wrote: > HadoopEntityProcessor for the DIH? Reading data from Hadoop with DIH could be really cool There are a few very useful ones which are required badly. Most useful one would be a TikaEntityProcessor. But I do not see it solving th

Re: range queries on string field with millions of values

2008-11-29 Thread Chris Hostetter
: The results are correct. But the response time sucks. : : Reading the docs about caches, I thought I could populate the query result : cache with an autowarming query and the response time would be okay. But that : hasn't worked. (See excerpts from my solrConfig file below.) : : A repeated

Returning function values from query

2008-11-29 Thread tonypayne
I have a boost query that combines several functions to affect the score. For logging, I would like to be able to see the individual values of the functions as well as the base relevancy score for each document in the result set. So far, I've been able to work around the issue because all of my fu

boosting certain terms within one field?

2008-11-29 Thread Peter Wolanin
I've recently started working on the Drupal integration module for SOLR, and we are looking for suggestions for how to address this question: how do we boost the importance of a subset of terms within a field. For example, we are using the standard request handler for queries, and the default fie

Re: range queries on string field with millions of values

2008-11-29 Thread Naomi Dushay
Hi Hoss, Thanks for this. The terms component approach, if i understand it correctly, will be problematic. I need to present not only the next X call numbers in sequence, but other fields in those documents (e.g. title, author). I assume the Terms Component approach will only give me the