Hello Rui, If your data to be indexed is in HDFS, using MapReduce to parallelize indexing is still a good idea.
Otis -- Search Analytics - http://sematext.com/search-analytics/index.html Performance Monitoring - http://sematext.com/spm/index.html On Fri, Oct 12, 2012 at 2:35 PM, Rui Vaz <rui....@gmail.com> wrote: > Hello, > > Solr Cloud and Hadoop are new to me. And I am figuring out an > architecture to do a > distributed indexing/searching system in a cluster. Integrating them is an > option. > > I would like to know if Hadoop + Solr is still a good option to build the a > big index in a cluster, > using HDFS and MapReduce, or if the new functionalities in Solr Cloud make > Hadoop unnecessary. > > I know I provided few insight about the number of shards, or if I have more > network throughput > or memory constraints. I want to launch the discussion and see diferent > points of view. > > Thank you very much, > -- > Rui Vaz