shards and performance

Phillip Farber Tue, 19 Aug 2008 10:18:39 -0700

I'm trying to understand how splitting a monolithic index into shardsimproves query response time. Please tell me if I'm on the right trackhere. Were does the increase in performance come from? Is it thatin-memory arrays are smaller when the index is partitioned into shards?Or is it due to the likelihood that the solr process behind each shardis running on its own CPU on a multi-CPU box?

And it must be the case that the overhead of merging results fromseveral shards is still less than the expense of searching a monolithicindex. True?

Given roughly 10 million documents in several languages inducing perhaps200K unique terms and averaging about 1 MB/doc how many shards would yourecommend and how much RAM?

Is it correct that Distributed Search (shards) is in 1.3 or does 1.2support it?

If 1.3, is the nightly build the best one to grab bearing in mind thatwe would want any protocols around distributed search to be as stable aspossible? Or just wait for the 1.3 release?




Thanks very much,

Phil

------------------------------------------
Phillip Farber - http://www.umdl.umich.edu

shards and performance

Reply via email to