Re: One big XML file vs. many HTTP requests

2006-05-22 Thread Chris Hostetter
: ...but you can't simply : FIELDNAME:* : or : * That's because the Lucene query parser doesn't support 100% wildcard queries. : What is the best way to delete all records, for example if you want to clear : out the entire index and reindex everything? if you really want to make sure *EVERYTHI

Re: One big XML file vs. many HTTP requests

2006-05-22 Thread Michael Levy
It seems you can do something like FIELDNAME:a* and FIELDNAME:b* ...but you can't simply FIELDNAME:* or * The demo post.sh returns Error parsing Lucene query and the demo Solr Admin page shows XML Parsing Error: syntax error Location: http://wiki.ushmm.org:8080/solr/select/?stylesheet=&q=*&versio

Re: One big XML file vs. many HTTP requests

2006-05-21 Thread Chris Hostetter
: But deleting multiple documents with just one POST is not possible, : right? Is there a special reason for that or is it because nobody asked delete by query will remoe multiple documents with a sigle command .. but if you mean dleete by id .. you may be right about it not having the same "loop

Re: One big XML file vs. many HTTP requests

2006-05-21 Thread Marcus Stratmann
Erik Hatcher wrote: I believe that Solr indexes one document at a time; each document requires a separate HTTP POST. Actually adding multiple documents per POST is possible But deleting multiple documents with just one POST is not possible, right? Is there a special reason for that or is it be

Re: One big XML file vs. many HTTP requests

2006-05-12 Thread Erik Hatcher
On May 12, 2006, at 1:02 PM, Michael Levy wrote: One nice feature of INQUERY is that you can create one large SGML file, containing lots of records, each bracketed with and DOC> tags. Submitting that big SGML document for indexing goes very fast. I believe that Solr indexes one document at

Re: One big XML file vs. many HTTP requests

2006-05-12 Thread Yonik Seeley
On 5/12/06, Michael Levy <[EMAIL PROTECTED]> wrote: How efficient is making a separate HTTP request per-document, when there are millions of documents? If you use persistent connections and add make multiple requests in parallel, there won't be much difference than multiple docs per request. -