Hi Toke, Thanks for your answer.
We are using RAID 0 of 8 disk, I don't understand why it should give me the same performance as disk per drive. Below is an explanation as I see it please correct me if I'm wrong. RAID configuration each shard has data on each one of the 8 disks in the RAID, on each query to get 1K docs, each shard request to get data from the one RAID disk, so we get 8 request to get date from all of the disks and we get a queue. Shard per disk configuration each shard has data only on his own disk, each shard request to get data from his own disk and they don't block each other. If I'm wrong please correct me, I do want to get it. Regarding the response time, 2-3 seconds is good for our usage also getting better is always better, if we will get better we might run the analysis on more than 1K. Thanks for the help. NIMROD COHEN Software Engineer RTI (T) +972 (9) 775-3668 (M) +972 (0) 52-5522901 nimrod.co...@nice.com www.nice.com -----Original Message----- From: Toke Eskildsen [mailto:t...@statsbiblioteket.dk] Sent: יום ג, 20 ינואר 2015 19:43 To: solr-user@lucene.apache.org Subject: RE: shards per disk Nimrod Cohen [nimrod.co...@nice.com] wrote: > We need to get 1K documents out of 100M documents each time we query > solr and send them to text Analysis. > First configuration had 8 shards on one RAD (Disk F) we got the 1K in > around 15 seconds. > Second configuration we removed the RAD and work on 8 different disk > each shard on one disk and get the 1K documents in 2-3 seconds. Which RAID level? 0, 1, maybe 5 or 6? If you did a RAID 0, it should be about the same performance as shards on individual disks, due to striping. If you did a RAID 1 with, for example, 2*4 disks, your performance would be markedly worse. If you did a RAID 1 of 8*1 disk, it would be better than individual drives as it would mitigate the "slowest drive dictates overall speed" problem. If your RAID is not really a RAID but instead JBOD or similar (http://en.wikipedia.org/wiki/Non-RAID_drive_architectures#JBOD), then the poor performance is to be expected as chances are all your data would reside on the same physical disk. Please describe your RAID setup in detail. Also, is 2-3 second response time satisfactory to you? If not, what are you aiming at? - Toke Eskildsen