Hi all.

Using the example setup of solr-4.4.0, I was able to easily feed 23 million documents from ClueWeb09.

The I tried to split the one shard into tqo. The size on disk is:

% du -sh collection1
118G    collection1

I started Solr with 8GB for the JVM:

java -Xmx8000m -DzkRun -DnumShards=2 -Dbootstrap_confdir=./solr/collection1/conf -Dcollection.configName=myconf -jar start.jar

Then I asked for the split

http://localhost:8983/solr/admin/collections?action=SPLITSHARD&collection=collection1&shard=shard1

After a while I got the OOM in the logs:

841168 [qtp614872954-17] ERROR org.apache.solr.servlet.SolrDispatchFilter – null:java.lang.RuntimeException: java.lang.OutOfMemoryError: Java heap space

My question: is it to be expected that the split needs huge amounts of RAM or is there a chance that some configuration or procedure change could get me past this?

Regards,
Harald.
--
Harald Kirsch
Raytion GmbH
Kaiser-Friedrich-Ring 74
40547 Duesseldorf
Fon +49-211-550266-0
Fax +49-211-550266-19
http://www.raytion.com

Reply via email to