Re: Throughput Optimization

Erik Hatcher Wed, 05 Nov 2008 02:54:10 -0800

One quick question.... are you seeing any evictions from yourfilterCache? If so, it isn't set large enough to handle the facetingyou're doing.


        Erik



On Nov 4, 2008, at 8:01 PM, wojtekpia wrote:

I've been running load tests over the past week or 2, and I can'tfigure outmy system's bottle neck that prevents me from increasing throughput.FirstI'll describe my Solr setup, then what I've tried to optimize thesystem.
I have 10 million records and 59 fields (all are indexed, 37 arestored, 17have termVectors, 33 are multi-valued) which takes about 15GB ofdisk space.Most field values are very short (single word or number), andusually abouthalf the fields have any data at all. I'm running on an 8-core, 64-bit, 32GBRAM Redhat box. I allocate about 24GB of memory to the java process,and myfilterCache size is 700,000. I'm using a version of Solr between 1.3and thecurrent trunk (including the latest SOLR-667 (FastLRUCache) patch),and
Tomcat 6.0.
I'm running a ramp-test, increasing the number of users every fewminutes. Imeasure the maximum number of requests that Solr can handle persecond witha fixed response time, and call that my throughput. I'd like to seea singlephysical resource be maxed out at some point during my test so Iknow it ismy bottle neck. I generated random queries for my datasetrepresenting amore or less realistic scenario. The queries include faceting by upto 6
fields, and quering by up to 8 fields.
I ran a baseline on the un-optimized setup, and saw peak CPU usageof about50%, IO usage around 5%, and negligible network traffic.Interestingly, theCPU peaked when I had 8 concurrent users, and actually dropped downto about40% when I increased the users beyond 8. Is that because I have 8cores?
I changed a few settings and observed the effect on throughput:
1. Increased filterCache size, and throughput increased by about50%, but it
seems to peak.
2. Put the entire index on a RAM disk, and significantly reduced theaverageresponse time, but my throughput didn't change (i.e. even though myresponsetime was 10X faster, the maximum number of requests I could make perseconddidn't increase). This makes no sense to me, unless there is anotherbottle
neck somewhere.
3. Reduced the number of records in my index. The throughputincreased, butthe shape of all my graphs stayed the same, and my CPU usage wasidentical.
I have a few questions:
1. Can I get more than 50% CPU utilization?
2. Why does CPU utilization fall when I make more than 8 concurrent
requests?
3. Is there an obvious bottleneck that I'm missing?
4. Does Tomcat have any settings that affect Solr performance?

Any input is greatly appreciated.

--
View this message in context: 
http://www.nabble.com/Throughput-Optimization-tp20335132p20335132.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Throughput Optimization

Reply via email to