Re: How to Fast Bulk Inserting documents

2015-08-19 Thread Troy Edwards
Thank you for taking the time to do the test. I have been doing similar tests using the post Tool (SimplePostTool) with the real data and was able to get to about 10K documents/second. I am considering using multiple files (one per client) ftp'd into a solr node and then use a scheduled job to us

Re: How to Fast Bulk Inserting documents

2015-08-19 Thread Troy Edwards
Are you suggesting that requests come into a service layer that identifies which client is on which solrcloud and passes the request to that cloud? Thank you On Wed, Aug 19, 2015 at 1:13 PM, Toke Eskildsen wrote: > Troy Edwards wrote: > > My average document size is 400 bytes > > Number of doc

Re: How to Fast Bulk Inserting documents

2015-08-19 Thread Toke Eskildsen
Toke Eskildsen wrote > Use more than one cloud. Make them fully independent. > As I suggested when you asked 4 days ago. That would > also make it easy to scale: Just measure how much a > single setup can take and do the math. The goal is 250K documents/second. I tried modifying the books.csv-ex

Re: How to Fast Bulk Inserting documents

2015-08-19 Thread Susheel Kumar
For Indexing 3.5 billion documents, you will not only run into bottleneck with Solr but also at different places (data acquisition, solr document object creation, submitting in bulk/batches to Solr). This will require parallelizing the above operations at each of the above steps which can get you

Re: How to Fast Bulk Inserting documents

2015-08-19 Thread Erick Erickson
Ir you're sitting on HDFS anyway, you could use MapReduceIndexerTool. I'm not sure that'll hit your rate, it spends some time copying things around. If you're not on HDFS, though, it's not an option. Best, Erick On Wed, Aug 19, 2015 at 11:36 AM, Upayavira wrote: > > > On Wed, Aug 19, 2015, at 07

Re: How to Fast Bulk Inserting documents

2015-08-19 Thread Upayavira
On Wed, Aug 19, 2015, at 07:13 PM, Toke Eskildsen wrote: > Troy Edwards wrote: > > My average document size is 400 bytes > > Number of documents that need to be inserted 25/second > > (for a total of about 3.6 Billion documents) > > > Any ideas/suggestions on how that can be done? (use a cl

Re: How to Fast Bulk Inserting documents

2015-08-19 Thread Toke Eskildsen
Troy Edwards wrote: > My average document size is 400 bytes > Number of documents that need to be inserted 25/second > (for a total of about 3.6 Billion documents) > Any ideas/suggestions on how that can be done? (use a client > or uploadcsv or stream or data import handler) Use more than on

Re: How to Fast Bulk Inserting documents

2015-08-19 Thread Shawn Heisey
On 8/19/2015 11:09 AM, Troy Edwards wrote: > I have a requirement where I have to bulk insert a lot of documents in > SolrCloud. > > My average document size is 400 bytes > Number of documents that need to be inserted 25/second (for a total of > about 3.6 Billion documents) > > Any ideas/sugges

Re: How to Fast Bulk Inserting documents

2015-08-19 Thread Vineeth Dasaraju
I have been using the solrj client and get speeds of 1000 objects per second. The size of my object is around 4 kb. On Aug 19, 2015 12:09 PM, "Troy Edwards" wrote: > I have a requirement where I have to bulk insert a lot of documents in > SolrCloud. > > My average document size is 400 bytes > Num