Re: Realtime directory change...

2006-12-22 Thread Thorsten Scherler
On Thu, 2006-12-21 at 12:23 -0800, escher2k wrote: > Hi, > We currently use Lucene to do index user data every couple of hours - the > index is completely rebuilt, > the old index is archived and the new one copied over to the directory. > Example - > > /bin/cp ${LOG_FILE} ${CRON_ROOT}/index/hel

Re: Help with spellchecker integration

2006-12-22 Thread Thorsten Scherler
On Thu, 2006-12-21 at 21:27 -0800, Otis Gospodnetic wrote: > Hi, > I'm trying to integrate the Lucene-based spellchecker > (http://wiki.apache.org/jakarta-lucene/SpellChecker + contrib/spellchecker > under Lucene) with Solr (http://issues.apache.org/jira/browse/SOLR-81) in > order to provide a

Re: Happy Solstice

2006-12-22 Thread Kevin Lewandowski
Yes! There's no shortage of puns when using solr. We're always talking about "creating a solr system" or "one of the solr systems is down" :) On 12/21/06, Erik Hatcher <[EMAIL PROTECTED]> wrote: It's all about "sol(a)r", ya know? More day light, please!

Re: Multiple indexes...

2006-12-22 Thread Bill Au
While it may be easier from a management perspective, there are also disadvantages with runnning multiple indices within a single appserver port: - they are sharing the memory of a single JVM so the size of the cache has to be smaller then running a singler instance per appserver instance - a si

Re: Happy Solstice

2006-12-22 Thread Greg Ludington
I'm still surprised we have solr plugins instead of "Solr Panels." On 12/22/06, Kevin Lewandowski <[EMAIL PROTECTED]> wrote: Yes! There's no shortage of puns when using solr. We're always talking about "creating a solr system" or "one of the solr systems is down" :) On 12/21/06, Erik Hatcher <[

Re: Help with spellchecker integration

2006-12-22 Thread Otis Gospodnetic
Hi Thorsten, Some comments to your comments, inlined and prefixed with "OG". - Original Message From: Thorsten Scherler <[EMAIL PROTECTED]> To: solr-user@lucene.apache.org Sent: Friday, December 22, 2006 5:53:19 AM Subject: Re: Help with spellchecker integration On Thu, 2006-12-21 at 21

Re: Help with spellchecker integration

2006-12-22 Thread Mike Klaas
On 12/22/06, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: OG: Yes, adding those separate fieldtype definitions was my attempt at getting separate sets of n-grams of different sizes: uni-bram, bi-gram... But how do I get "3start", "4start", "2end", and "4end"? It looks like I'd have to do this:

Solr 1.1 released

2006-12-22 Thread Yonik Seeley
Solr 1.1 is now available for download! This is the first official release since Solr entered the Incubator. The release is available at http://people.apache.org/dist/incubator/solr/1.1/ and the detailed changelog is at http://people.apache.org/dist/incubator/solr/1.1/CHANGES.txt Thanks to ev

Re: Solr 1.1 released

2006-12-22 Thread Mike Klaas
On 12/22/06, Yonik Seeley <[EMAIL PROTECTED]> wrote: Solr 1.1 is now available for download! This is the first official release since Solr entered the Incubator. The release is available at http://people.apache.org/dist/incubator/solr/1.1/ and the detailed changelog is at http://people.apac

Re: Help with spellchecker integration

2006-12-22 Thread Otis Gospodnetic
Hi Mike, Thanks, that (what you said in the end) is precisely what I ended up doing. I'll post a new patch to SOLR-81 shortly. Otis - Original Message From: Mike Klaas <[EMAIL PROTECTED]> To: solr-user@lucene.apache.org Sent: Friday, December 22, 2006 5:23:42 PM Subject: Re: Help with

Handling disparate data sources in Solr

2006-12-22 Thread Alan Burlison
Hi, I'm considering using Solr to replace an existing bare-metal Lucene deployment - the current Lucene setup is embedded inside an existing monolithic webapp, and I want to factor out the search functionality into a separate webapp so it can be reused more easily. At present the content of

Re: SolrDistribution is tested by FreeBSD?

2006-12-22 Thread Yonik Seeley
On 12/21/06, James liu <[EMAIL PROTECTED]> wrote: i use fb 6.1 when i use # sh rsyncd-enable it show me: cd: can't cd to rsyncd-enable/.. Try executing it directly, or passing it's full path. The current ${0%/*} trick to get the exe path only works if there is more than one path element. $ c

Re: Handling disparate data sources in Solr

2006-12-22 Thread Otis Gospodnetic
Alan, omitNorms let's you not use field norms for certain field when calculating document matching score. This can save you some RAM. See http://issues.apache.org/jira/browse/LUCENE-448 . For position increment gap, have a look at http://lucene.apache.org/java/docs/api/org/apache/lucene/analy

Re: Handling disparate data sources in Solr

2006-12-22 Thread Mike Klaas
On 12/22/06, Alan Burlison <[EMAIL PROTECTED]> wrote: At present the content of the Lucene index comes from many different sources (web pages, documents, blog posts etc) and can be different formats (plaintext, HTML, PDF etc). All the various content types are rendered to plaintext before being