Just to check: your index is NOT sharded, correct? Assuming not sharded, is it SolrCloud? If not SolrCloud, how are the indexes kept in synch? Master/slave? Manual copy?
But for an unchanging index, this is definitely odd. Best, Erick On Thu, Oct 9, 2014 at 7:40 AM, Walter Underwood <wun...@wunderwood.org> wrote: > Is this a production log of queries, with lots of repeats? If so, you may be > seeing the normal effect of lower cache hit rates. > > Check the hit rate for the query result cache in the two setups. With a > single machine, the second occurrence of a query will be a cache hit. With > two machines, it will not be if the two queries are routed to different > machines. > > I was running some benchmarks here. With one machine, the query cache had a > 50% hit rate. With eight machines, it was 20%. > > You can address this with a reverse proxy HTTP cache in front of the cluster, > something like Varnish. > > wunder > Walter Underwood > wun...@wunderwood.org > http://observer.wunderwood.org/ > > > On Oct 9, 2014, at 7:21 AM, Yannick <yann1...@yahoo.com.INVALID> wrote: > >> Hi Toke, >> >> thanks for your suggestion - definitely an interesting idea. But >> unfortunately no, no indexing job is running; those are static indexes being >> queried. The execution time is also very consistent in each condition, I did >> quite a few tests. >> >> Yann >> >> >> On Thursday, October 9, 2014 3:56 PM, Toke Eskildsen >> <t...@statsbiblioteket.dk> wrote: >> >> >> >> On Thu, 2014-10-09 at 15:06 +0200, Yannick wrote: >> >> >>> I created a group of 2 Solr servers with a load-balancer in front >>> (Haproxy). I have a batch client that sends requests (read-only) >>> continuously to the load-balancer. The problem is: the performance is >>> slower with 2 servers than it is with a single server (still via the >>> load-balancer, with the second server down, so it's not the >>> load-balancer itself causing the slowdown). >> >> (speculating a lot here:) >> >> Is another job updating the indexes while you are batch-searching? >> If so, the slowdown could be explained by the servers disk caches being >> flushed by the indexing job. When a request arrives some cache is >> reclaimed, but is will be a battle between the update and the search >> jobs. With more machines, there will be fewer request/machine, so the >> search-cache has a lower chance of being used again before it is >> reclaimed by the updater. >> >> Still, worse performance for 2 machines sounds pretty bad. >> >> - Toke Eskildsen, State and University Library, Denmark >