Muneeb Ali wrote: > > Hi Blargy, > > Nice to hear that I am not alone ;) > > Well we have been using Hadoop for other data-intensive services, those > that can be done in parallel. We have multiple nodes, which are used by > Hadoop for all our MapReduce jobs. I personally don't have much experience > with its use and hence wouldn't be able to help you much with that. > > Our indexing takes 6+ hours to index 15 million documents (using > solrj.streamUpdateSolrServer). I wanted to explore hadoop for this task, > as it can be done in parallel. > > I have just started investigating into this, will keep this post updated > if found anything helpful. > > -Neeb >
Would you mind explaining how your full indexing strategy is implemented using the StreamingUpdateSolrServer? I am currently only familar with using the DataImportHandler. Thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/anyone-use-hadoop-solr-tp485333p915227.html Sent from the Solr - User mailing list archive at Nabble.com.