Re: core,Collection,Shard,Replication

2016-01-06 Thread Erick Erickson
bq: But when indexing a document in one shard,it gets reflected in every shard of that collection This is a misunderstanding (and I'm being a bit pedantic here). Each shard contains a portion of the entire corpus. Say you have 1M docs and 2 shards. Each shard will have very close to 500K documents

Re: core,Collection,Shard,Replication

2016-01-06 Thread Binoy Dalal
The machines part may have been a bit misleading. I am sorry for that. What I actually meant was shards. Now, you can have multiple shards hosted on a single machine or multiple machines as in the example I gave. "I have to make sure that all those machines have solr server or gateway should be de

Re: core,Collection,Shard,Replication

2016-01-06 Thread vidya
Hi You described that sharding is to distribute data over multiple machines.Do I have to make sure that all those machines have solr server or gateway should be deplyed ? And what multiple JVM processes run behind a solr server running? I wanted to know what is a node. -> I understood like a mchin

Re: core,Collection,Shard,Replication

2016-01-06 Thread Binoy Dalal
1) A collection is simply a logical group and can consist of multiple cores. The core is a representation of a single physical index or part of an index. Both cores and collections can be created in local as well as cloud modes. 2) Sharding is performed to distribute your index over multiple machin