You can also do streaming XML upload for the XML-based indexing. This can
feed, say, 100k records in one XML file from a separate machine.

All of these options ignore the case where there is an error in your input
records v.s. the schema.  DIH gives up on an error. Streaming XML gives up
on an error.

Lance

-----Original Message-----
From: Steven Anderson [mailto:[EMAIL PROTECTED] 
Sent: Thursday, November 06, 2008 5:57 AM
To: solr-user@lucene.apache.org
Subject: RE: Large Data Set Suggestions

> In that case you may put the file in a mounted NFS directory or you 
> can serve it out with an apache server.

That's one option although someone else on the list mentioned that
performance was 10x slower in their NFS experience.

Another option is to serve up the files via Apache and pull them via DIH
HTTP.

Thankfully, there are lots of options, but we need to determine which one
will perform best.

Thanks,

A. Steven Anderson
410-418-9908 VSTI
443-790-4269 cell
 


Reply via email to