OK, _where_ is that written? The HdfsDirectoryFactory code? Someone's blog somewhere? Your notes?
Ali has one part of the answer, using HDFS will redundantly store your index, which is good. Furthermore, the MapReduceIndexerTool (see the contribs) _will_ use HDFS to do the classic M/R indexing process for batch, and has a --go-live feature that allows you to merge the results into a running SolrCloud. It was written with the assumption that the Solr index was on HDFS. FWIW, Erick On Mon, Jul 7, 2014 at 6:06 AM, Ali Nazemian <alinazem...@gmail.com> wrote: > I think this will not improve the performance of indexing but probably it > would be a solution for using HDFS HA with replication factor. But I am not > sure about that. > > > On Mon, Jul 7, 2014 at 12:53 PM, search engn dev <sachinyadav0...@gmail.com> > wrote: > >> Currently i am exploring hadoop with solr, Somewhere it is written as "This >> does not use Hadoop Map-Reduce to process Solr data, rather it only uses >> the >> HDFS filesystem for index and transaction log file storage. " , >> >> then what is the advantage of using using hadoop over local file system? >> will use of hdfs increase overall performance of searching? >> >> any detailed pointers regarding this will surely help me to understand >> this. >> >> >> >> -- >> View this message in context: >> http://lucene.472066.n3.nabble.com/Need-of-hadoop-tp4145846.html >> Sent from the Solr - User mailing list archive at Nabble.com. >> > > > > -- > A.Nazemian