Re: Limit Porter stemmer to plural stemming only?

2008-06-30 Thread climbingrose
I modified the original English Stemmer written in Snowball language and regenerate the Java implementation using Snowball compiler. It's been working for me so far. I certainly can share the modified Snowball English Stemmer if anyone wants to use it. Cheers, Cuong On Tue, Jul 1, 2008 at 4:12 A

Re: Search query optimization

2008-06-30 Thread Chris Hostetter
: If I know that condition C will eliminate more results than either A or B, : does specifying the query as: "C AND A AND B" make it any faster (than the : original "A AND B AND C")? Nope. Lucene takes care of that for you. -Hoss

Re: Search query optimization

2008-06-30 Thread wojtekpia
If I know that condition C will eliminate more results than either A or B, does specifying the query as: "C AND A AND B" make it any faster (than the original "A AND B AND C")? -- View this message in context: http://www.nabble.com/Search-query-optimization-tp17544667p18205504.html Sent from the

Re: Efficient date-based results sorting

2008-06-30 Thread Chris Hostetter
: Subject: Efficient date-based results sorting Sorting on anything but score is done pretty much the exact same way regardless of data type. The one thing you can do to make any sorting on any field more efficient is to try and reduce the cardinality of the field -- ie: reduce the number of

Re: Limit Porter stemmer to plural stemming only?

2008-06-30 Thread Mike Klaas
If you find a solution that works well, I encourage you to contribute it back to Solr. Plural-only stemming is probably a common need (I've definitely wanted to use it before). cheers, -Mike On 30-Jun-08, at 2:25 AM, climbingrose wrote: Ok, it looks like step 1a in Porter algo does what I

Re: Solr Master Slave Architecture over NFS

2008-06-30 Thread Grant Ingersoll
I think it comes w/ some caveats, but is now workable (although it may not give great performance), assuming you're using 2.3 (2.2) or later. I would definitely do a search in the Lucene archives about NFS, especially paying attention to Mike McCandless' comments. On Jun 30, 2008, at

RE: UnicodeNormalizationFilterFactory

2008-06-30 Thread Steven A Rowe
Hi Robert, Could you create a JIRA issue and attach your code to it? That makes it easier for people to evaluate it (rather than just binary distribution). This sounds general enough to me that it would be a useful addition to Lucene itself. Solr's factory could just be sugar on top then. Th

Re: Solr Master Slave Architecture over NFS

2008-06-30 Thread Bill Au
Isn't using Lucene over NFS *not* recommended? Bill On Mon, Jun 30, 2008 at 4:27 AM, Nico Heid <[EMAIL PROTECTED]> wrote: > Hey, I'm looking for some feedback on the following setup. > Due to the architects decision I will be working with NFS not Solr's own > distribution scripts. > > A few Solr

Re: Benchmarking tools?

2008-06-30 Thread Yugang Hu
Me too. Thanks. Jacob Singh wrote: nice stuff. Please send me the test case, I'd love to see it. Thanks, Jacob Nico Heid wrote: Hi, I basically followed this: http://wiki.apache.org/jakarta-jmeter/JMeterFAQ#head-1680863678257fbcb85bd97351860eb0049f19ae I basically put all my queries in a

Re: Minimum JDK for SolrJ?

2008-06-30 Thread Noble Paul നോബിള്‍ नोब्ळ्
SolrJ needs a minimum java 5 --Noble On Mon, Jun 30, 2008 at 8:00 PM, Todd Breiholz <[EMAIL PROTECTED]> wrote: > What is the minimum JDK that can be used for developing clients that use > SolrJ? I am stuck on JDK 1.4.2 at the moment and am wondering if SolrJ is an > option for me. > > Thanks! > >

Minimum JDK for SolrJ?

2008-06-30 Thread Todd Breiholz
What is the minimum JDK that can be used for developing clients that use SolrJ? I am stuck on JDK 1.4.2 at the moment and am wondering if SolrJ is an option for me. Thanks! Todd

Re: Benchmarking tools?

2008-06-30 Thread Jacob Singh
nice stuff. Please send me the test case, I'd love to see it. Thanks, Jacob Nico Heid wrote: > Hi, > I basically followed this: > http://wiki.apache.org/jakarta-jmeter/JMeterFAQ#head-1680863678257fbcb85bd97351860eb0049f19ae > > > I basically put all my queries in a flat text file. you could eith

Re: analyzer index vs query vs {missing}

2008-06-30 Thread Norberto Meijome
On Mon, 30 Jun 2008 05:52:33 -0400 Erik Hatcher <[EMAIL PROTECTED]> wrote: > Yes, that's exactly what it means. > > Erik great, thanks for the clarification. B _ {Beto|Norberto|Numard} Meijome "A dream you dream together is reality." John Lennon I speak for mys

Re: Benchmarking tools?

2008-06-30 Thread Nico Heid
Hi, I basically followed this: http://wiki.apache.org/jakarta-jmeter/JMeterFAQ#head-1680863678257fbcb85bd97351860eb0049f19ae I basically put all my queries in a flat text file. you could either use two parameters or put them in one file. The good point of this is, that each test uses the same qu

1.3 maven artifact

2008-06-30 Thread Stefan Oestreicher
Hi, I just wanted to ask if solr 1.3 is already available as maven artifact? If it is not could you give me an estimate on when it will be? TIA, Stefan Oestreicher -- Dr. Maté GmbH Stefan Oestreicher / Entwicklung [EMAIL PROTECTED] http://www.netdoktor.at Tel Buero: + 43 1 405 55 75 24 Fax Bu

Re: analyzer index vs query vs {missing}

2008-06-30 Thread Erik Hatcher
Yes, that's exactly what it means. Erik On Jun 30, 2008, at 3:01 AM, Norberto Meijome wrote: hi there, when defining a field type, i understand the meaning of 'analyzer type="index"' , or type="query". What does it mean when the type is missing? does it apply at both index and que

Re: Limit Porter stemmer to plural stemming only?

2008-06-30 Thread climbingrose
Ok, it looks like step 1a in Porter algo does what I need. On Mon, Jun 30, 2008 at 6:39 PM, climbingrose <[EMAIL PROTECTED]> wrote: > Hi all, > Porter stemmer in general is really good. However, there are some cases > where it doesn't work. For example, "accountant" matches "Accountant" as > well

Re: Benchmarking tools?

2008-06-30 Thread Jacob Singh
Hi Nico, Thanks for the info. Do you have you scripts available for this? Also, is it configurable to give variable numbers of facets and facet based searches? I have a feeling this will be the limiting factor, and much slower than keyword searches but I could be (and usually am) wrong. Best,

Limit Porter stemmer to plural stemming only?

2008-06-30 Thread climbingrose
Hi all, Porter stemmer in general is really good. However, there are some cases where it doesn't work. For example, "accountant" matches "Accountant" as well as "Account Manager" which isn't desirable. Is it possible to use this analyser for plural words only? For example: +Accountant -> accountant

RE: Benchmarking tools?

2008-06-30 Thread Nico Heid
Hi, I did some trivial Tests with Jmeter. I set up Jmeter to increase the number of threads steadily. For requests I either usa a random word or combination of words in a wordlist or some sample date from the test system. (this is described in the JMeter manual) In my case the System works fine as

Solr Master Slave Architecture over NFS

2008-06-30 Thread Nico Heid
Hey, I'm looking for some feedback on the following setup. Due to the architects decision I will be working with NFS not Solr's own distribution scripts. A few Solr indexing machines use Multicore to divide the 300.000 Users to 1000 shards. For several reasons we have to go with per user shardin

analyzer index vs query vs {missing}

2008-06-30 Thread Norberto Meijome
hi there, when defining a field type, i understand the meaning of 'analyzer type="index"' , or type="query". What does it mean when the type is missing? does it apply at both index and query ? This can be found in the example's schema.xml :