On 12/19/2010 2:07 AM, Tri Nguyen wrote:
Was wondering about the pro's and con's of using sharding versus cores.
An index can be split up to multiple cores or multilple shards.
So why one over the other?
If you split your index into multiple cores, you still have to use the
shards parameter to tell Solr where to find the parts. You can use
multiple servers, multiple cores, or even both. Which method to use
depends on why you've decided to split your index into multiple pieces.
If the primary motivating factor is index size, you'll probably want to
use separate servers. Unless the only reason for distributed search is
making build process easier (or possible), I personally would not have
multiple "live" cores on a single machine. An example where multiple
cores per server is entirely appropriate (creating a new core every five
minutes):
http://www.loggly.com/2010/08/our-solr-system/
I went to this guy's talk at Lucene Revolution. Amazing stuff.
Shawn