Hey guys,
You might remember a bunch of emails going back and forth between me and the
very helpful Solr folks a few weeks back. I just wanted to let you know
about what I've learned about the problem in last week or so.

The problem was that I would run into a hard limit of how many documents I
could add to Solr in a single post on Tomcat. It was usually around 5000 -
6000 posts per add, before the system would hang indefinitely. I tried the
Java Solr client and the problem was exactly the same. I'm still unsure of
what exactly causes the problem... but I have now been able to work around
it.

The problem only occurs when adding docs that contain <![CDATA[]]> tags in
the body of the <field> tag. The problem also only seems to cause an add
limit on an individual post. I limited the size of my HTTP posts to 5000
documents per post, and the problem never showed up. You do not need to do a
commit after each batch as I previously thought.

So, like I said, I'm still unsure of what causes this problem. It does seem
to only happen on Tomcat. I've verified that the doc limit does not show up
when running on Jetty. It seems to be some sort of a problem when Solr
attempts to write to the response, but doesn't seem to be an issue with Solr
itself. Again, it only occurs if you have CData tags in your xml as well.

Strange one indeed, but I hope if any of you run into this problem this will
help you out.

-Sangraal

Reply via email to