RE: Embedded about 50% faster for indexing

2007-08-27 Thread Sundling, Paul
27, 2007 5:50 PM To: solr-user@lucene.apache.org Subject: Re: Embedded about 50% faster for indexing On 27-Aug-07, at 12:44 PM, Sundling, Paul wrote: > Whether embedded solr should give me a performance boost or not, it > did. > :) I'm not surprised, since it skips XML parsing. Alt

RE: Embedded about 50% faster for indexing

2007-08-27 Thread Sundling, Paul
xes often. So generally this is moot, but it's still interesting. :) Paul Sundling -Original Message----- From: Sundling, Paul Sent: Monday, August 27, 2007 12:44 PM To: solr-user@lucene.apache.org Subject: RE: Embedded about 50% faster for indexing Whether embedded solr should give me a

RE: Embedded about 50% faster for indexing

2007-08-27 Thread Sundling, Paul
Whether embedded solr should give me a performance boost or not, it did. :) I'm not surprised, since it skips XML parsing. Although you never know where cycles are used for sure until you profile. I tried doing more records per post (200) and it was actually slightly slower and seemed to require

RE: Embedded about 50% faster for indexing

2007-08-24 Thread Sundling, Paul
ginal Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Yonik Seeley Sent: Friday, August 24, 2007 2:07 PM To: solr-user@lucene.apache.org Subject: Re: Embedded about 50% faster for indexing On 8/24/07, Sundling, Paul <[EMAIL PROTECTED]> wrote: > Created two indexer i

RE: clear index

2007-08-24 Thread Sundling, Paul
If that happens, then using that specific query should be added to the FAQ for how to clear an index. Paul Sundling -Original Message- From: Chris Hostetter [mailto:[EMAIL PROTECTED] Sent: Tuesday, August 21, 2007 6:41 PM To: solr-user@lucene.apache.org Subject: RE: clear index : I'm

Embedded about 50% faster for indexing

2007-08-24 Thread Sundling, Paul
Created two indexer implementations to test HTTP Post versus Embedded and the performance was 54.6% faster on embedded. Thought others might find that interesting that are using Java. Paul Sundling

RE: clear index

2007-08-21 Thread Sundling, Paul
> optimize) should do the trick: > > *:* > > cheers, > Piete > > > On 21/08/07, Sundling, Paul <[EMAIL PROTECTED]> wrote: > > > > what is the best approach to clearing an index? > > > > The use case is that I'm doing some performa

clear index

2007-08-20 Thread Sundling, Paul
what is the best approach to clearing an index? The use case is that I'm doing some performance testing with various index sizes. In between indexing (embedded and soon HTTP/XML) I need to clear the index so I have a fresh start. What's the best approach, close the index and delete the files?

RE: Too many open files

2007-08-10 Thread Sundling, Paul
Good link, I added it to the wiki page where optimization is covered, http://wiki.apache.org/solr/SolrPerformanceFactors . Paul Sundling -Original Message- From: Stu Hood [mailto:[EMAIL PROTECTED] Sent: Thursday, August 09, 2007 9:23 AM To: solr-user@lucene.apache.org Subject: RE: Too m

EmbeddedSolr and optimize

2007-08-09 Thread Sundling, Paul
http://wiki.apache.org/solr/EmbeddedSolr Following the example on connecting to the Index directly without using HTTP, I tried to optimize by passing the true flag to the CommitUpdateCommand. When optimizing an index with Lucene directly it doubles the size of the index temporarily and then del

Solr and Chines/Japenese

2007-07-26 Thread Sundling, Paul
Are there any known Solr sites that are in Chinese or Japenese? I need to include links to such sites for a comparison I'm doing on enterprise search engines. I realize that if I stay UTF-8 it should work and I can use the CJK analyzer. Paul Sundling

EdgeNGramTokenizer errors in eclipse

2007-07-24 Thread Sundling, Paul
I checked out the latest solr source code from subversion and put it in an eclipse project. I used all the jars for the project (had to add junit). I get errors in eclipse about two constants not being defined in one of the library jars: (based on imports org.apache.lucene.analysis.ngram.EdgeNG