Hi all.
Using the example setup of solr-4.4.0, I was able to easily feed 23
million documents from ClueWeb09.
The I tried to split the one shard into tqo. The size on disk is:
% du -sh collection1
118G collection1
I started Solr with 8GB for the JVM:
java -Xmx8000m -DzkRun -DnumShards=2
-Dbootstrap_confdir=./solr/collection1/conf
-Dcollection.configName=myconf -jar start.jar
Then I asked for the split
http://localhost:8983/solr/admin/collections?action=SPLITSHARD&collection=collection1&shard=shard1
After a while I got the OOM in the logs:
841168 [qtp614872954-17] ERROR
org.apache.solr.servlet.SolrDispatchFilter –
null:java.lang.RuntimeException: java.lang.OutOfMemoryError: Java heap space
My question: is it to be expected that the split needs huge amounts of
RAM or is there a chance that some configuration or procedure change
could get me past this?
Regards,
Harald.
--
Harald Kirsch
Raytion GmbH
Kaiser-Friedrich-Ring 74
40547 Duesseldorf
Fon +49-211-550266-0
Fax +49-211-550266-19
http://www.raytion.com