Newspad using solr

2007-10-06 Thread Jed Reynolds
PRWeb's Newspad.com search has been using a replicated Solr setup since June 11, 2007. In that time, and I'm just checking the admin page on the query server...3,000,000 requests since June across 350,000 documents. This hardly taxes the server, it's load is about 0.20 with 20 rather sleepy apa

Re: Multiple Values -Structured?

2007-09-03 Thread Jed Reynolds
Bharani wrote: Hi, I have got two sets of document 1) Primary Document 2) Occurrences of primary document Since there is no such thing as "join" i can either a) Post the primary document with occurrences as multi valued field or b) Post the primary document for every occurrences i.e. class

Re: minimum occurances of term in document

2007-08-30 Thread Jed Reynolds
Mike Klaas wrote: On 30-Aug-07, at 4:01 PM, Chris Hostetter wrote: You could accomplish the goal without any coding by using phrase queries: "calico calico calico"~1 will match only documents that have at least three occurrences of calico. If this is performant enough, you are done. O

Re: minimum occurances of term in document

2007-08-30 Thread Jed Reynolds
Mike Klaas wrote: On 30-Aug-07, at 1:22 PM, Jed Reynolds wrote: Jed Reynolds wrote: Apologies if this is in the Lucene FAQ, but I was looking thru the Lucene syntax and I just didn't see it. Is there a way to search for documents that have a certain number of occurrences of a term i

Re: minimum occurances of term in document

2007-08-30 Thread Jed Reynolds
Jed Reynolds wrote: Apologies if this is in the Lucene FAQ, but I was looking thru the Lucene syntax and I just didn't see it. Is there a way to search for documents that have a certain number of occurrences of a term in the document? Like, I want to find all documents that have the

minimum occurances of term in document

2007-08-30 Thread Jed Reynolds
Apologies if this is in the Lucene FAQ, but I was looking thru the Lucene syntax and I just didn't see it. Is there a way to search for documents that have a certain number of occurrences of a term in the document? Like, I want to find all documents that have the term Calico mentioned three

Re: Replication script file issues..

2007-07-19 Thread Jed Reynolds
Matthew Runo wrote: It seems that as soon as I get a commit, snapshooter goes wild. I have 1107 running instances of snapshooter right now.. I suspect you've got pathing and/or permissions issues. First try running snapshooter -v, and it will be louder. I've often had to dig in deeper, tho.

success! Newspad lives anew!

2007-07-18 Thread Jed Reynolds
I'd like to thank everyone that created and helped bring us Solr. Newspad is working awesomely. http://www.newspad.com/ And sorting in 1.2.0 is going to be such a bonus! Thanks! Jed

Re: Restrict Servlet Access

2007-03-14 Thread Jed Reynolds
Gunther, Andrew wrote: What are people doing to restrict UpdateServlet access on production installs of Solr. Are people removing that option and rotating in a new index or restricting access from the jetty side. I'm putting Solr on my DMZ without direct WAN access. If I had to put it on

Re: Federated Search

2007-03-10 Thread Jed Reynolds
Venkatesh Seetharam wrote: The hash idea sounds really interesting and if I had a fixed number of indexes it would be perfect. I'm infact looking around for a reverse-hash algorithm where in given a docId, I should be able to find which partition contains the document so I can save cycles

Re: merely a suggestion: schema.xml validator or better schema validation logging

2007-03-03 Thread Jed Reynolds
Chris Hostetter wrote: : I almost didn't notice the exception fly by because there's s much : log output, and I can see why I might not have noticed. Yay for you should be able to configure it to put WARNING and SEVERE messages in a seperate log file even. Certainly! I learned to r

Re: merely a suggestion: schema.xml validator or better schema validation logging

2007-03-03 Thread Jed Reynolds
Yonik Seeley wrote: On 3/2/07, Ryan McKinley <[EMAIL PROTECTED]> wrote: How do you all feel about returning an error when you add a document with unknown fields? +1 dynamicField definitions can be used if desired (including "*" to match every undefined field). If dynamicField definitions ar

Re: JVM random crashes

2007-03-03 Thread Jed Reynolds
Yonik Seeley wrote: On 3/3/07, Dimitar Ouzounov <[EMAIL PROTECTED]> wrote: But what hardware problem could it be? Tomorrow I'll make sure that the memory is fine, but nothing else comes to my mind. Memory, motherboard, etc. Try http://www.memtest86.com/ to test this. It may be OS-related - p

Re: merely a suggestion: schema.xml validator or better schema validation logging

2007-03-03 Thread Jed Reynolds
Bertrand Delacretaz wrote: On 3/3/07, Ryan McKinley <[EMAIL PROTECTED]> wrote: ...The rationale with the solrconfig stuff is that a broken config should behave as best it can. This is great if you are running a real site with people actively using it - it is a pain in the ass if you are getti

Re: merely a suggestion: schema.xml validator or better schema validation logging

2007-03-02 Thread Jed Reynolds
Ryan McKinley wrote: On 3/2/07, Yonik Seeley <[EMAIL PROTECTED]> wrote: On 3/2/07, Ryan McKinley <[EMAIL PROTECTED]> wrote: > The rationale with the solrconfig stuff is that a broken config should > behave as best it can. I don't think that's what I was actually going for in this instance (the

Re: merely a suggestion: schema.xml validator or better schema validation logging

2007-03-02 Thread Jed Reynolds
Ryan McKinley wrote: I almost didn't notice the exception fly by because there's s much log output, and I can see why I might not have noticed. Yay for scrollback! (Hrm, I might not have wanted to watch logging for 4 instances of solr all at once. Might explain why so much logging.) This h

Re: merely a suggestion: schema.xml validator or better schema validation logging

2007-03-02 Thread Jed Reynolds
Yonik Seeley wrote: If the actual schema was null, then that was probably some problem parsing the schema. If that's the case, hopefully you saw an exception in the logs on startup? Using apache-solr-1.1.0-incubating. Actually not at first, but now I do. But I've gone back and re-created

merely a suggestion: schema.xml validator or better schema validation logging

2007-03-01 Thread Jed Reynolds
First time user. Not interested in flamewar, just making a suggestion. I just got Solr working with my own schema and it was only a little more mysterious than I expected, having previously dealth with Nutch. Solr is exactly what I wanted in terms of (theoretical) ease of configurability. How