How to index data by hadoop and solr?

2014-07-06 Thread toothlou_t...@163.com
Hello: I want to use hadoop and solr to index data, is there someone can tell me how to do it? toothlou_t...@163.com

Re: Solr and SolrCloud repllcation, and load balancing questions.

2014-07-06 Thread Himanshu Mehrotra
Ok, great. Thanks for helping out. Thanks, Himanshu On Sun, Jul 6, 2014 at 9:35 PM, Erick Erickson wrote: > [C] I've rarely seen situations where the document cache has > a very high a hit rate. For that to happen, the queries would > need to be returning the exact same documents, which isn'

Re: Solr and SolrCloud repllcation, and load balancing questions.

2014-07-06 Thread Erick Erickson
[C] I've rarely seen situations where the document cache has a very high a hit rate. For that to happen, the queries would need to be returning the exact same documents, which isn't usually the case. I wouldn't increase this very far. The recommendation is that it be (total simultaneous queries you

Re: Solr and SolrCloud repllcation, and load balancing questions.

2014-07-06 Thread Himanshu Mehrotra
Erick, first up thanks for thoroughly answering my questions. [A] I had read the blot mentioned, and yet failed to 'get it'. Now I understand the flow. [B] The automatic, heuristic based approach as you said will be difficult to get right, that is why I thought 'beefiness' index configuration s

Re: error during heavy indexing

2014-07-06 Thread navdeep710
thank you Shawn for replying. its the client that is doing the indexing(queries).i have set connection timeout and connection socket timeout as 0 explicitly .i am doing indexing through MapReduce jobs . so indexing is happening in multi thread environment . -- View this message in context: h