You can also do streaming XML upload for the XML-based indexing. This can feed, say, 100k records in one XML file from a separate machine.
All of these options ignore the case where there is an error in your input records v.s. the schema. DIH gives up on an error. Streaming XML gives up on an error. Lance -----Original Message----- From: Steven Anderson [mailto:[EMAIL PROTECTED] Sent: Thursday, November 06, 2008 5:57 AM To: solr-user@lucene.apache.org Subject: RE: Large Data Set Suggestions > In that case you may put the file in a mounted NFS directory or you > can serve it out with an apache server. That's one option although someone else on the list mentioned that performance was 10x slower in their NFS experience. Another option is to serve up the files via Apache and pull them via DIH HTTP. Thankfully, there are lots of options, but we need to determine which one will perform best. Thanks, A. Steven Anderson 410-418-9908 VSTI 443-790-4269 cell