On 8 March 2012 16:18, gabriel shen <xshco...@gmail.com> wrote:
> Our indexing process is to adding a bundle of solr documents(for example
> 5000) to solr each time, and we observed that before commiting(which might
> be io bounded) it uses less than half the CPU capacity constantly, which
> sounds strange to us why it doesn't use full cpu power. As for RAM, I don't
> know how much it will affect CPU utilization, we have assigned 14gb to the
> solr tomcat server on a 32 gb linux machine.
[...]

Are you hitting memory limits?

As Tanguy has already pointed out in nice detail, it probably
also does matter how you push documents to Solr, and how
often you commit.

In an apples-to-oranges comparison, we used to run a
large indexing task, but with only a single commit at the
end, while it sounds as if you are using smaller batches,
with more frequent commits. In our case, we could max
out CPU usage (well, we backed off at ~85% utilisation
on each core). Though we were fetching data over the
network, it was a relatively high-bandwidth internal connection,
and we were using DIH with multiple Solr cores.

Regards,
Gora

Reply via email to