Hi All, I have TB of data that need to be indexed. I am trying to use hadoop to index those TB. I am still newbie. I thought that the Map function will read data from hard disks and the reduce function will index them. The problem I am facing is how to read those data from hard disks which are not HDFS.
I understand that the data to be indexed must be on HDFS, don't they? or I am missing something here. I can't convert the nodes on which the data resides to HDFS. Can anyone please help. I would also appreciate if you can provide a good tutorial for solr indexing using hadoop. I googled alot but I did not find a sufficient one. Thanks -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-indexer-and-Hadoop-tp4072951.html Sent from the Solr - User mailing list archive at Nabble.com.