Commit strategies

James Brady Wed, 06 Feb 2008 19:43:26 -0800

Hi all,

So the Solr tutorial recommends batching operation to improveperformance by avoiding multiple costly commits.

To implement this, I originally had a couple of methods in my pythonapp reading from or writing to Solr, with a scheduled task blindlycommitting every 15 seconds.


However, my logs were chock full of errors such as:

File "/mnt/yelteam/server_dev/YelServer/yel/yel_search.py", line73, in __add

    self.conn.add(**params)

File "/mnt/yelteam/server_dev/YelServer/yel/solr.py", line 159, inadd

    return self.doUpdateXML(xstr)

File "/mnt/yelteam/server_dev/YelServer/yel/solr.py", line 106, indoUpdateXMLrsp = self.doPost(self.solrBase+'/update', request,self.xmlheaders)File "/mnt/yelteam/server_dev/YelServer/yel/solr.py", line 94, indoPost

    return self.__errcheck(self.conn.getresponse())
  File "/usr/lib64/python2.4/httplib.py", line 856, in getresponse
    raise ResponseNotReady()
ResponseNotReady

and:

File "/mnt/yelteam/server_dev/YelServer/yel/solr.py", line 159, inadd

    return self.doUpdateXML(xstr)

    return self.__errcheck(self.conn.getresponse())
  File "/usr/lib64/python2.4/httplib.py", line 866, in getresponse
    response.begin()
  File "/usr/lib64/python2.4/httplib.py", line 336, in begin
    version, status, reason = self._read_status()
  File "/usr/lib64/python2.4/httplib.py", line 294, in _read_status
    line = self.fp.readline()
  File "/usr/lib64/python2.4/socket.py", line 317, in readline
    data = recv(1)
error: (104, 'Connection reset by peer')

and a few other variations.

I thought it might be to do with commit operations conflicting withreads or writes, so wrote and even dumber queueing system to holdonto pending reads/writes while a commit went through.

However, my logs are still full of those errors :) I doubt thateither python's httplib library or Solr are buggy, so is it somethingto do with the way I'm using the API?

How do people generally approach the deferred commit issue? Do I needto queue index and search requests myself or does Solr handle it? Myapp indexes about 100 times more than it searches, but searching ismore time critical. Does that change anything?


Thanks!
James

Commit strategies

Reply via email to