Hi,
I am trying to setup a SolrCloud cluster on top of Hadoop cluster using Hortonworks Data Platform. I understood how to configure Solr to enable it to store data in HDFS (process given below). However, I could not understand how to enable Solr to setup the cluster using Zookeeper already available with HDP. As per my understanding, if I make only the below HDFS related change, Solr index data will be stored in HDFS. However, only machine from which the Solr application is run, will act as a Solr server. Can anyone please let me know how to configure Solr to use an external Zookeeper ensemble on HDP so that the complete Hadoop cluster can be used as a SolrCloud cluster? <directoryFactory name="DirectoryFactory" class="solr.HdfsDirectoryFactory"> <str name="solr.hdfs.home">hdfs://Hadoop_namenode:8020/user/solr</str> <bool name="solr.hdfs.blockcache.enabled">true</bool> <int name="solr.hdfs.blockcache.slab.count">1</int> <bool name="solr.hdfs.blockcache.direct.memory.allocation">true</bool> <int name="solr.hdfs.blockcache.blocksperbank">16384</int> <bool name="solr.hdfs.blockcache.read.enabled">true</bool> <bool name="solr.hdfs.blockcache.write.enabled">true</bool> <bool name="solr.hdfs.nrtcachingdirectory.enable">true</bool> <int name="solr.hdfs.nrtcachingdirectory.maxmergesizemb">16</int> <int name="solr.hdfs.nrtcachingdirectory.maxcachedmb">192</int> </directoryFactory> Also, please let me know if there are other activities that need to be performed to make SolrCloud working on Hadoop apart from this HDFS and Zookeeper changes. Thanks & Regards Vijay -- The contents of this e-mail are confidential and for the exclusive use of the intended recipient. If you receive this e-mail in error please delete it from your system immediately and notify us either by e-mail or telephone. You should not copy, forward or otherwise disclose the content of the e-mail. The views expressed in this communication may not necessarily be the view held by WHISHWORKS.