Re: Solr server requirements for 100+ million documents

2014-02-11 Thread Shawn Heisey
On 2/11/2014 3:28 PM, Susheel Kumar wrote: Thanks, Otis for quick reply. So for ZK do you recommend separate servers and if so how many for initial Solr cloud cluster setup. In a minimal 3-server setup, all servers would run zookeeper and two of them would also run Solr.With this setup, you c

Re: Solr server requirements for 100+ million documents

2014-02-11 Thread Jason Hellman
Whether you use the same machines as Solr or separate machines is a matter suited to taste. If you are the CTO, then you should make this decision. If not, inform management that risk conditions are greater when you share function and control on a single piece of hardware. A single failure of

Re: Solr server requirements for 100+ million documents

2014-02-11 Thread svante karlsson
ZK needs a quorum to keep functional so 3 servers handles one failure. 5 handles 2 node failures. If you Solr with 1 replica per shard then stick to 3 ZK. If you use 2 replicas use 5 ZK >

RE: Solr server requirements for 100+ million documents

2014-02-11 Thread Susheel Kumar
Subject: Re: Solr server requirements for 100+ million documents Hi Susheel, No, we wouldn't want to go with just 1 ZK. :) Otis -- Performance Monitoring * Log Analytics * Search Analytics Solr & Elasticsearch Support * http://sematext.com/ On Tue, Feb 11, 2014 at 5:18 PM, Sush

Re: Solr server requirements for 100+ million documents

2014-02-11 Thread Otis Gospodnetic
> To: solr-user@lucene.apache.org > Subject: Re: Solr server requirements for 100+ million documents > > Hi Susheel, > > Like Erick said, it's impossible to give precise recommendations, but > making a few assumptions and combining them with experience (+ a licke

RE: Solr server requirements for 100+ million documents

2014-02-11 Thread Susheel Kumar
Subject: Re: Solr server requirements for 100+ million documents Hi Susheel, Like Erick said, it's impossible to give precise recommendations, but making a few assumptions and combining them with experience (+ a licked finger in the air): * 3 servers * 32 GB * 2+ CPU cores * Linux Assuming doc

Re: Solr server requirements for 100+ million documents

2014-01-28 Thread Jorge Luis Betancourt Gonzalez
day, January 28, 2014 8:01 PM > To: solr-user@lucene.apache.org > Subject: Re: Solr server requirements for 100+ million documents > > Lucene and Solr work best if the full index can be cached in OS memory. > Sure, Lucene/Solr does work properly once the index no longer fits, but >

RE: Solr server requirements for 100+ million documents

2014-01-28 Thread Susheel Kumar
Thanks, Jack. That helps. -Original Message- From: Jack Krupansky [mailto:j...@basetechnology.com] Sent: Tuesday, January 28, 2014 8:01 PM To: solr-user@lucene.apache.org Subject: Re: Solr server requirements for 100+ million documents Lucene and Solr work best if the full index can be

Re: Solr server requirements for 100+ million documents

2014-01-28 Thread Jack Krupansky
ince they are on different machines.) -- Jack Krupansky -Original Message- From: Susheel Kumar Sent: Sunday, January 26, 2014 10:54 AM To: solr-user@lucene.apache.org Subject: RE: Solr server requirements for 100+ million documents Thank you Erick for your valuable inputs. Yes, we have t

Re: Solr server requirements for 100+ million documents

2014-01-26 Thread simon
rt SolrClod using SolrJ is better? > > > > > > Thanks > > Susheel > > -Original Message- > > From: Erick Erickson [mailto:erickerick...@gmail.com] > > Sent: Sunday, January 26, 2014 8:37 AM > > To: solr-user@lucene.apache.org > &g

Re: Solr server requirements for 100+ million documents

2014-01-26 Thread Erick Erickson
-Original Message- > From: Erick Erickson [mailto:erickerick...@gmail.com] > Sent: Sunday, January 26, 2014 8:37 AM > To: solr-user@lucene.apache.org > Subject: Re: Solr server requirements for 100+ million documents > > Dumping the raw data would probably be a good idea. I

RE: Solr server requirements for 100+ million documents

2014-01-26 Thread Susheel Kumar
mail.com] > Sent: Saturday, January 25, 2014 12:08 AM > To: solr-user@lucene.apache.org > Subject: Re: Solr server requirements for 100+ million documents > > can you post the complete solrconfig.xml file and schema.xml files to review > all of your settings that would impact your indexin

Re: Solr server requirements for 100+ million documents

2014-01-26 Thread Erick Erickson
ay, January 25, 2014 12:08 AM > To: solr-user@lucene.apache.org > Subject: Re: Solr server requirements for 100+ million documents > > can you post the complete solrconfig.xml file and schema.xml files to review > all of your settings that would impact your indexing performance. > >

RE: Solr server requirements for 100+ million documents

2014-01-25 Thread Susheel Kumar
mail.com] Sent: Saturday, January 25, 2014 12:08 AM To: solr-user@lucene.apache.org Subject: Re: Solr server requirements for 100+ million documents can you post the complete solrconfig.xml file and schema.xml files to review all of your settings that would impact your indexing performance.

Re: Solr server requirements for 100+ million documents

2014-01-25 Thread svante karlsson
>> > >> > >> >> required="false" /> > >> > >> > >> id > >> > >> > >> > >> > >> > >> > >> > >> 2014/1/25 Kranti Parisa > >> > >>> can you pos

Re: Solr server requirements for 100+ million documents

2014-01-25 Thread Erick Erickson
ing libcurl. >>> > 11 million took 763 seconds so the speed drops somewhat with increasing >>> > dbsize. >>> > >>> > We write 1000 docs (just an arbitrary number) in each request from two >>> > threads. If you will be using solrcloud you will

Re: Solr server requirements for 100+ million documents

2014-01-25 Thread svante karlsson
SSD >> > and 32GB and the solr runs on ubuntu 13.10 inside a esxi virtual >> machine. >> > >> > /svante >> > >> > >> > >> > >> > 2014/1/24 Susheel Kumar >> > >> > > Thanks, Erick for

Re: Solr server requirements for 100+ million documents

2014-01-25 Thread svante karlsson
kbs. > > > > -Original Message- > > From: saka.csi...@gmail.com [mailto:saka.csi...@gmail.com] On Behalf Of > > svante karlsson > > Sent: Friday, January 24, 2014 5:05 PM > > To: solr-user@lucene.apache.org > > Subject: Re: Solr server requirements f

Re: Solr server requirements for 100+ million documents

2014-01-24 Thread Kranti Parisa
e.apache.org > Subject: Re: Solr server requirements for 100+ million documents > > I just indexed 100 million db docs (records) with 22 fields (4 > multivalued) in 9524 sec using libcurl. > 11 million took 763 seconds so the speed drops somewhat with increasing > dbsize. > > We

RE: Solr server requirements for 100+ million documents

2014-01-24 Thread Susheel Kumar
J or should we try to automate using Curl? > > > -Original Message- > From: Erick Erickson [mailto:erickerick...@gmail.com] > Sent: Friday, January 24, 2014 2:59 PM > To: solr-user@lucene.apache.org > Subject: Re: Solr server requirements for 100+ million documents

Re: Solr server requirements for 100+ million documents

2014-01-24 Thread Otis Gospodnetic
Hi Susheel, Like Erick said, it's impossible to give precise recommendations, but making a few assumptions and combining them with experience (+ a licked finger in the air): * 3 servers * 32 GB * 2+ CPU cores * Linux Assuming docs are not bigger than a few KB, that they are not being reindexed ov

Re: Solr server requirements for 100+ million documents

2014-01-24 Thread svante karlsson
o automate using Curl? > > > -Original Message- > From: Erick Erickson [mailto:erickerick...@gmail.com] > Sent: Friday, January 24, 2014 2:59 PM > To: solr-user@lucene.apache.org > Subject: Re: Solr server requirements for 100+ million documents > > Can't be do

RE: Solr server requirements for 100+ million documents

2014-01-24 Thread Susheel Kumar
. Do you recommend to use SolrJ or should we try to automate using Curl? -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Friday, January 24, 2014 2:59 PM To: solr-user@lucene.apache.org Subject: Re: Solr server requirements for 100+ million documents Can'

Re: Solr server requirements for 100+ million documents

2014-01-24 Thread Erick Erickson
Can't be done with the information you provided, and can only be guessed at even with more comprehensive information. Here's why: http://searchhub.org/2012/07/23/sizing-hardware-in-the-abstract-why-we-dont-have-a-definitive-answer/ Also, at a guess, your indexing speed is so slow due to data acq