Re: INTERNET ARCHIVE goes SOLR!

2007-02-01 Thread Tracey Jaquith
oh, tee hee if if our eternal admiration and gratitude isn't obvious...  8-) i concur, the amount one *can* customize simply from the XML configuration and schema is fantastically impressive!  almost all of the configuration setup it is quick to do in our experience, too --tracey Walter Unde

Re: INTERNET ARCHIVE goes SOLR!

2007-02-01 Thread Walter Underwood
On 1/27/07 1:12 PM, "Tracey Jaquith" <[EMAIL PROTECTED]> wrote: > > * To be fair, Michael StAck (our greatest help for prior SE "life support") > has smartly pointed out that by making a smarter schema and strategy, > I could reduce the number of fields searched from 677 to 5, with the > sam

Re: INTERNET ARCHIVE goes SOLR!

2007-02-01 Thread Tracey Jaquith
Message From: Tracey Jaquith <[EMAIL PROTECTED]> To: solr-user@lucene.apache.org Sent: Sunday, January 28, 2007 5:12:44 AM Subject: INTERNET ARCHIVE goes SOLR! Internet Archive on Monday afternoon switched over to SOLR! We converted from a badly deteriorating "home grown" s

Re: INTERNET ARCHIVE goes SOLR!

2007-01-30 Thread Walter Underwood
On 1/29/07 7:46 PM, "Ryan McKinley" <[EMAIL PROTECTED]> wrote: >> >> Your argument is a good one, and I buy it. However, I've never had a >> case where a user typing "multiple words" where the expectation was >> for OR, it is always AND. > > But there are many cases where the expectation is to t

Re: INTERNET ARCHIVE goes SOLR!

2007-01-29 Thread Erik Hatcher
On Jan 29, 2007, at 11:01 PM, Chris Hostetter wrote: if there are cases where DisMax isn't the right choice for raw user input ... i'm not aware of them, but i'd love to hear about them :) Ok, ok, ok... I'm a self-admitted dismax avoider thus far. I'll remedy that by building in dismax ca

Re: INTERNET ARCHIVE goes SOLR!

2007-01-29 Thread Erik Hatcher
On Jan 29, 2007, at 10:46 PM, Ryan McKinley wrote: Your argument is a good one, and I buy it. However, I've never had a case where a user typing "multiple words" where the expectation was for OR, it is always AND. But there are many cases where the expectation is to to get the best results

Re: INTERNET ARCHIVE goes SOLR!

2007-01-29 Thread Yonik Seeley
On 1/29/07, Chris Hostetter <[EMAIL PROTECTED]> wrote: there is no prefix operator for "OR" so if the default is "AND" there is no way at request time to indicate that some clauses should be optional without reverting to the ugly and missleading binary operator syntax ... Perhaps that's somethi

Re: INTERNET ARCHIVE goes SOLR!

2007-01-29 Thread Chris Hostetter
: > case where a user typing "multiple words" where the expectation was : > for OR, it is always AND. if the input you are passing in comes straight fram a user -- and that user doesn't understand the Lucene query syntax -- i'd argue that StandardRequestHandler is the wrong choice, and you should

Re: INTERNET ARCHIVE goes SOLR!

2007-01-29 Thread Ryan McKinley
Your argument is a good one, and I buy it. However, I've never had a case where a user typing "multiple words" where the expectation was for OR, it is always AND. But there are many cases where the expectation is to to get the best results possible. With AND you get zero results even when the

Re: INTERNET ARCHIVE goes SOLR!

2007-01-29 Thread Erik Hatcher
On Jan 29, 2007, at 6:15 PM, Chris Hostetter wrote: : > We override defaultOperator of "OR" to "AND". : : We really ought to make AND the default anyway. No, no, no, no, No.. :) Your argument is a good one, and I buy it. However, I've never had a case where a user typing

Re: INTERNET ARCHIVE goes SOLR!

2007-01-29 Thread Chris Hostetter
congrats on the successfull roll-out Tracey, : We don't use DisMax and (as of now) do not use faceting. : And finally, the hardest part to convert to Solr. : I had to write a PHP front-end custom converter to take our query strings, : parse the clauses and lucene syntax into pieces, and

Re: INTERNET ARCHIVE goes SOLR!

2007-01-29 Thread Chris Hostetter
: > We override defaultOperator of "OR" to "AND". : : We really ought to make AND the default anyway. No, no, no, no, No.. there is no prefix operator for "OR" so if the default is "AND" there is no way at request time to indicate that some clauses should be optional without rev

Re: INTERNET ARCHIVE goes SOLR!

2007-01-28 Thread Erik Hatcher
Tracey, Thanks for sharing! This is a fantastic case study. Your experiences and numbers will certainly be a major sales pitch for bringing Solr into more organizations. On Jan 27, 2007, at 4:12 PM, Tracey Jaquith wrote: We've done essentially no optimizing outside of focusing on a "s

Re: INTERNET ARCHIVE goes SOLR!

2007-01-27 Thread Otis Gospodnetic
:12:44 AM Subject: INTERNET ARCHIVE goes SOLR! Internet Archive on Monday afternoon switched over to SOLR! We converted from a badly deteriorating "home grown" server that was made up of java + jetty ( + rsync for replication) + an older version of lucene. I make some compar

INTERNET ARCHIVE goes SOLR!

2007-01-27 Thread Tracey Jaquith
Internet Archive on Monday afternoon switched over to SOLR! We converted from a badly deteriorating "home grown" server that was made up of java + jetty ( + rsync for replication) + an older version of lucene. I make some comparisons of SOLR vs. "prior" using "[]" notes below. I parsed 2 d