Re: One big XML file vs. many HTTP requests

2006-05-12 Thread Erik Hatcher
On May 12, 2006, at 1:02 PM, Michael Levy wrote: One nice feature of INQUERY is that you can create one large SGML file, containing lots of records, each bracketed with and DOC> tags. Submitting that big SGML document for indexing goes very fast. I believe that Solr indexes one document at

Re: Leveraging filter chache in queries

2006-05-12 Thread Erik Hatcher
On May 12, 2006, at 9:06 AM, Fabio Confalonieri wrote: I see our needs have already surfaced in the mailing list, it's the refine search problem You have sometime called faceted browsing and which is the base of CNet browsing architecture: we have ads with different categories which have d

Re: One big XML file vs. many HTTP requests

2006-05-12 Thread Yonik Seeley
On 5/12/06, Michael Levy <[EMAIL PROTECTED]> wrote: How efficient is making a separate HTTP request per-document, when there are millions of documents? If you use persistent connections and add make multiple requests in parallel, there won't be much difference than multiple docs per request. -

One big XML file vs. many HTTP requests

2006-05-12 Thread Michael Levy
Greetings, I'm evaluating using Solr under Tomcat to replace a number of text searching projects that currently use UMASS's INQUERY, an older search engine. One nice feature of INQUERY is that you can create one large SGML file, containing lots of records, each bracketed with and tags. S

Re: Leveraging filter chache in queries

2006-05-12 Thread Yonik Seeley
On 5/12/06, Fabio Confalonieri <[EMAIL PROTECTED]> wrote: I tried a query like "+field:value^0" which returns a great number of Hits (on a total test of 100.000 documents), but I see only the query cache growing and the filter cache always empty. Is this normal ? I've tried to check all the cache

Leveraging filter chache in queries

2006-05-12 Thread Fabio Confalonieri
Hello, I've just fond Lucene and Solr and I'm thinking of using them in our current project, essentially an ads portal (something very similar to www.oodle.com). I see our needs have already surfaced in the mailing list, it's the refine search problem You have sometime called faceted browsing an