Hello Grant, 

>Were you overwriting the existing index or did you also clean out the  
>Solr data directory, too?  In other words, was it a fresh index, or an  
>existing one?  And was that also the case for the 22 minute time?

No in each case it was a new index. I store the indexes (the "data" dir)
outside the solr home directory. For the moment I, rm -rf the index dir
after each edit to the solrconfig.sml or schema.xml file and reindex
from scratch. The relaunch of tomcat recreates the index dir.

>Would it be possible to profile the two instance and see if you notice  
>anything different?
I dont understand this. Do mean run a profiler against the tomcat
image as indexing takes place, or somehow compare the indexes?

I was think of making a short script that replicates the results, 
and posting it here, would that help?

>
>Thanks,
>Grant
>
>On Nov 19, 2008, at 8:25 AM, Fergus McMenemie wrote:
>
>> Hello,
>>
>> I have a CSV file with 6M records which took 22min to index with
>> solr 1.2. I then stopped tomcat replaced the solr stuff inside
>> webapps with version 1.3, wiped my index and restarted tomcat.
>>
>> Indexing the exact same content now takes 69min. My machine has
>> 2GB of RAM and tomcat is running with $JAVA_OPTS -Xmx512M -Xms512M.
>>
>> Are there any tweaks I can use to get the original index time
>> back. I read through the release notes and was expecting a
>> speed up. I saw the bit about increasing ramBufferSizeMB and set
>> it to 64MB; it had no effect.
>> -- 
>>
>> ===============================================================
>> Fergus McMenemie               Email:[EMAIL PROTECTED]
>> Techmore Ltd                   Phone:(UK) 07721 376021
>>
>> Unix/Mac/Intranets             Analyst Programmer
>> ===============================================================

-- 

===============================================================
Fergus McMenemie               Email:[EMAIL PROTECTED]
Techmore Ltd                   Phone:(UK) 07721 376021

Unix/Mac/Intranets             Analyst Programmer
===============================================================

Reply via email to