RE: Using SolrCloud with RDBMS or without

2014-05-26 Thread Susheel Kumar
ntly had a use case where we indexed around 10m docs from database in less than ½ hr. Thanks, Susheel -Original Message- From: Ali Nazemian [mailto:alinazem...@gmail.com] Sent: Monday, May 26, 2014 2:42 PM To: solr-user@lucene.apache.org Subject: Re: Using SolrCloud with RDBMS

Re: Using SolrCloud with RDBMS or without

2014-05-26 Thread Shawn Heisey
On 5/26/2014 1:48 PM, Ali Nazemian wrote: > Dear Shawn, > Hi and thank you for you reply. > Could you please tell me about the performance and scalability of the > mentioned solutions? Suppose I have a SolrCloud with 4 different machine. > Would it scale linearly if I add another 4 machines to that

Re: Using SolrCloud with RDBMS or without

2014-05-26 Thread Ali Nazemian
Dear Shawn, Hi and thank you for you reply. Could you please tell me about the performance and scalability of the mentioned solutions? Suppose I have a SolrCloud with 4 different machine. Would it scale linearly if I add another 4 machines to that? I mean when the documents number increases from 10

Re: Using SolrCloud with RDBMS or without

2014-05-26 Thread Ali Nazemian
Dear Erick, Thank you for you reply. Some parts of documents come from Nutch crawler and the other parts come from processing those documents. I really need it to be as fast as possible and 10 hours for indexing is not acceptable for my application. Regards. On Mon, May 26, 2014 at 9:25 PM, Erick

Re: Using SolrCloud with RDBMS or without

2014-05-26 Thread Erick Erickson
What you haven't told us is where the data comes from. But until you put some numbers to it, it's hard to decide. I tend to prefer storing the data somewhere else, filesystem, whatever and indexing to Solr when data changes. Even if that means re-indexing the entire corpus. I don't like going to m

Re: Using SolrCloud with RDBMS or without

2014-05-26 Thread Shawn Heisey
On 5/26/2014 7:50 AM, Ali Nazemian wrote: > I was wondering which scenario (or the combination) would be better for my > application. From the aspect of performance, scalability and high > availability. Here is my application: > > Suppose I am going to have more than 10m documents and it grows eve

Re: Using SolrCloud with RDBMS or without

2014-05-26 Thread Ali Nazemian
The fact that I ignore Cassandra is because of it seems Cassandra is perfect when you have too much write operation. In my case it is true that I have some update operation but for sure read operations are much more than write ones. By the way there are probably more scenarios for my application. M

Re: Using SolrCloud with RDBMS or without

2014-05-26 Thread Jack Krupansky
You could also consider DataStax Enterprise, which integrates Apache Cassandra as the primary database and Solr for indexing and query. See: http://www.datastax.com/what-we-offer/products-services/datastax-enterprise -- Jack Krupansky -Original Message- From: Ali Nazemian Sent: Monda