Help with word frequency / tag clouds

2008-08-16 Thread Gene Campbell
Hello Solrites, I'm somewhat new to Solr and Lucene. I would like to build a tag cloud based on a filtered set of words from documents. I have a master list of approved tags. So, what I need from each document is the list of words and frequencies such that that words appear in the master list (

Re: "Auto commit error" and java.io.FileNotFoundException

2008-08-16 Thread Walter Underwood
I hate to blame the JDK, but we tried 1.6 for our production webapp and it was crashing too often. Unless you need 1.6, you might try 1.5. --wunder On 8/16/08 1:54 PM, "Chris Harris" <[EMAIL PROTECTED]> wrote: > On Sat, Aug 16, 2008 at 4:33 AM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: >> What v

Re: "Auto commit error" and java.io.FileNotFoundException

2008-08-16 Thread Otis Gospodnetic
How are you adding documents? One at a time? Multiple at a time? From a single thread or multiple threads? Have you tried building the latest and greatest Lucene from trunk and using that with Solr on the Linux box? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Or

Re: "Auto commit error" and java.io.FileNotFoundException

2008-08-16 Thread Otis Gospodnetic
I'd ignore Otis' message from 2005. I haven't followed the thread carefully, but it looks like a bug deep in the guts of Lucene. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Chris Harris <[EMAIL PROTECTED]> > To: solr-user@lucene.apach

Re: "Auto commit error" and java.io.FileNotFoundException

2008-08-16 Thread Chris Harris
On Sat, Aug 16, 2008 at 4:33 AM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > Can you try Lucene's CheckIndex tool on it and report what it says? > > On Aug 15, 2008, at 1:35 PM, Chris Harris wrote: > >> I have an index (different from the ones mentioned yesterday) that was >> working fine with 3M

Using Shingles to Increase Phrase Search Performance

2008-08-16 Thread Chris Harris
Mike Klaas suggested last month that I might be able to improve phrase search performance by indexing word bigrams, aka bigram shingles. I've been playing with this, and the initial results are very promising. (I may post some performance data later.) I wanted to describe my technique, which I'm no

Re: "Auto commit error" and java.io.FileNotFoundException

2008-08-16 Thread Chris Harris
On Sat, Aug 16, 2008 at 4:33 AM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > What version of Java do you have on Linux? The Java version on *Linux* (where I'm seeing the trouble): java version "1.6.0" OpenJDK Runtime Environment (build 1.6.0-b09) OpenJDK 64-Bit Server VM (build 1.6.0

Localisation, faceting

2008-08-16 Thread Pierre Auslaender
Hello, I have a couple of questions: 1/ Is it possible to localise query operator names without writing code? For instance, I'd like to issue queries with French operator names, e.g. ET (instead of AND), OU (instead of OR), etc. 2/ Is it possible for Solr to generate, in the XML response, th

Re: Solr Cache

2008-08-16 Thread Yonik Seeley
On Sat, Aug 16, 2008 at 12:04 AM, Tim Christensen <[EMAIL PROTECTED]> wrote: > We have two servers, with the same index load balanced. The indexes are > updated at the same time every day. Occasionally, a search on one server > will return different results from the other server, even though the da

Re: "Auto commit error" and java.io.FileNotFoundException

2008-08-16 Thread Grant Ingersoll
What version of Java do you have on Linux? Also, is this easily reproducible? How many threads are you adding documents with? What is your Auto Commit setting? Can you try Lucene's CheckIndex tool on it and report what it says? On Aug 15, 2008, at 1:35 PM, Chris Harris wrote: I have an i