Re: Integrating solr with Hadoop

2014-07-02 Thread gurunath
Thanks Eric, I will watch out for Map reduce option. It will be helpfull if I get any links to set up hadoop with solr. -- View this message in context: http://lucene.472066.n3.nabble.com/Integrating-solr-with-Hadoop-tp4144715p4145157.html Sent from the Solr - User mailing list archive at Nabb

Re: Integrating solr with Hadoop

2014-07-01 Thread Erick Erickson
Should be fine. Things to watch: 1> solrconfig.xml has to have the HdfsDirectoryFactory enabled. 2> You probably want to configure ZooKeeper stand-alone, although it's possible to run embedded ZK it's just awkward since you can't really bounce Solr nodes running embedded ZK at wil

Re: Integrating solr with Hadoop

2014-06-30 Thread gurunath
Thanks everybody, And I was confused. Now if i am not wrong, I have to use solr with tomcat or jetty and I can use Hadoop file system to store index file where solr by default uses NTFs... and etc. So my question is can I have a configuration mentioned below. 1. Solr 4.7 + Tomcat 7 + Apache zookee

Re: Integrating solr with Hadoop

2014-06-30 Thread Jay Vyas
Minor clarification: The storage of indices uses the Hadoop file system API- not hdfs specifically - so connection is actually not to hdfs ... Solr can distribute indices for failover / reliability/ scaling to any hcfs compliant filesystem. > On Jun 30, 2014, at 11:55 AM, Erick Erickson wro

Re: Integrating solr with Hadoop

2014-06-30 Thread Shawn Heisey
On 6/30/2014 3:19 AM, gurunath wrote: > I want to setup solr in production, Initially the data set i am using is of > small scale, the size of data will grow gradually. I have heard about using > "*Big Data Work for Hadoop and Solr*", Is this a better option for large > data or better to go ahead w

Re: Integrating solr with Hadoop

2014-06-30 Thread Erick Erickson
Whoa! You're confusing a couple of things I think. The only real connection Solr <-> Hadoop _may_ be that Solr can have its indexes stored on HDFS. Well, you can also create map/reduce jobs that will index the data via M/R and merge them into a live index in Solr (assuming it's storing its indexes