need suggestions for storing TBs of strutucred data in SolrCloud

2014-03-05 Thread Chia-Chun Shih
Hi, I am planning a system for searching TB's of structured data in SolrCloud. I need suggestions for handling such huge amount of data in SolrCloud. (e.g., number of shards per collection, number of nodes, etc.) Here are some specs of the system: 1. Raw data is 35,000 CSV files per day. Each

SolrCloud can't correctly create collection after zookeeper ensemble recovery

2014-02-20 Thread Chia-Chun Shih
Hi all, This is my test procedure: 1. start a Zookeeper ensemble and a SolrCloud node 2. stop Zookeeper ensemble 3. start Zookeeper ensemble 4. fail to create a collection (with 1 shard and 1 replica) because of timeout 5. restart the SolrCloud node 6. fail to create a collection with the same na