I suspect that you're seeing a timeout issue and the simplest fix might be to up the timeouts, probably at the servlet-level.
You might get some evidence that this is the issue if your log files for the time when this happens show some unusual activity, garbage collection is a popular reason for this kind of thing. Not all that helpful, but maybe a start. Erick On Wed, Dec 5, 2012 at 7:59 PM, Michael Ryan <mr...@moreover.com> wrote: > We have a longstanding issue with "failed to respond" errors in Solr when > our coordinator is querying our Solr shards. > > To elaborate further... we're using the built-in distributed capabilities > of Solr 3.6, and using Jetty as our server. Occasionally, we will have a > query fail due to an error like > "org.apache.commons.httpclient.NoHttpResponseException: The server > solr-shard-13 failed to respond" when the Solr coordinator is sending a > request to one of its shards. Over the long term, this happens for about 1 > out of 3000 queries. The quick fix of simply retrying the query when such > an intermittent error occurs works fine, but I'm trying to figure out what > the root cause might be. > > I've got lots of theories and possible fixes, but was hoping someone had > run into this before and knows the answer straight away :) > > -Michael >