Tracy,

Pay attention on the new benchmarks and new jvm behavior. Changing the heap
might not be solving your problem, but just delaying it from happening.

2008/5/11 David Pratt <[EMAIL PROTECTED]>:

> Hi Tracy. Can you advise the sort of difference in max heap space that
> resulted in the improvement, that is, your before and after max heap space.
> Many thanks.
>
> Regards,
> David
>
>
> Tracy Flynn wrote:
>
>> Thanks for the replies.
>>
>> For a completely different reason, I happened to look at the memory stats
>> for all processes including the SOLR instances. Noticed that the SLOW Solr
>> instance was maxing out with more virtual memory than allocated. After
>> boosting the maximum heap space and restarting, everything started to run at
>> 4x-5x the speed before the fix - and at the rate I reasonably thought it
>> should.
>>
>> Tracy
>>
>> On May 9, 2008, at 8:02 AM, Tracy Flynn wrote:
>>
>>  Hi,
>>>
>>> I'm starting to see significant slowdown in loading performance after I
>>> have loaded about 400K documents.  I go from a load rate of near 40 docs/sec
>>> to 20- 25 docs a second.
>>>
>>> Am I correct in assuming that, during indexing operations, Lucene/SOLR
>>> tries to hold as much of the indexex in memory as possible? If so, does the
>>> slowdown indicate need to increase JVM heap space?
>>>
>>> Any ideas / help would be appreciated
>>>
>>> Regards,
>>>
>>> Tracy
>>>
>>> ---------------------------------------------------------------------------------------------------------------------
>>>
>>>
>>> Details
>>>
>>> Documents loaded as XML via POST command in batches of 1000, commit after
>>> each batch
>>>
>>> Total current documents ~ 450,000
>>> Avg document size: 4KB
>>> One indexed text field contains 3KB or so. (body field below - standard
>>> type 'text')
>>>
>>> Dual XEON 3 GHZ 4 GB memory
>>>
>>> SOLR JVM Startup options
>>>
>>> java -Xms256m -Xmx1000m  -jar start.jar
>>>
>>>
>>> Relevant portion of the schema follows
>>>
>>>
>>>  <field name="document_id" type="string" indexed="true" stored="true"
>>> required="true"/>
>>>  <field name="language" type="string" indexed="true" stored="true"
>>> required="false"/>
>>>  <field name="languages" type="string" indexed="true" stored="true"
>>> required="false"/>
>>>  <!-- The value specified for folding_id must be a field of type
>>> "integer" -
>>>       type "sint" does not work -->
>>>  <field name="folding_id" type="integer" indexed="true" stored="true"
>>> required="false" default="0"/>
>>>  <field name="document_type" type="string" indexed="true" stored="true"
>>> required="true"/>
>>>  <field name="title" type="text" indexed="true" stored="true"
>>> required="false"/>
>>>  <field name="body" type="text" indexed="true" stored="true"
>>> required="false" compressed="true"/>
>>>  <field name="teaser" type="text" indexed="no" stored="true"
>>> required="false"/>
>>>  <field name="articles_in_category" type="sint" indexed="true"
>>> stored="true" required="false" default="0"/>
>>>  <field name="pen_name" type="text" indexed="true" stored="true"
>>> required="false"/>
>>>  <field name="article_id" type="sint" indexed="true" stored="true"
>>> required="false" default="0"/>
>>>  <field name="article_status_id" type="sint" indexed="true" stored="true"
>>> required="false" default="0"/>
>>>  <field name="user_id" type="sint" indexed="true" stored="true"
>>> required="false" default="0"/>
>>>  <field name="user_name" type="text" indexed="true" stored="true"
>>> required="false"/>
>>>  <field name="user_email" type="text" indexed="true" stored="true"
>>> required="false"/>
>>>  <field name="channel_context" type="sint" indexed="true" stored="true"
>>> required="false" multiValued="true"/>
>>>  <field name="category_id" type="sint" indexed="true" stored="true"
>>> required="false" default="0"/>
>>>  <field name="category_status_id" type="sint" indexed="true"
>>> stored="true" required="false" default="0"/>
>>>  <field name="category_title" type="text" indexed="true" stored="true"
>>> required="false"/>
>>>  <field name="category_keywords" type="text" indexed="true" stored="true"
>>> required="false" multiValued="true"/>
>>>  <field name="category_type" type="text" indexed="true" stored="true"
>>> required="false"/>
>>>  <field name="channel_id" type="sint" indexed="true" stored="true"
>>> required="false" default="0"/>
>>>  <field name="channel_title" type="text" indexed="true" stored="true"
>>> required="false"/>
>>>  <field name="helium_rank" type="sint" indexed="false" stored="true"
>>> required="false" default="0"/>
>>>  <field name="helium_rank_percentile" type="sfloat" indexed="false"
>>> stored="true" required="false"/>
>>>  <field name="helium_scaled_rank_boost" type="sfloat" indexed="true"
>>> stored="true" required="false"/>
>>>  <field name="helium_scaled_rank_boost_string" type="string"
>>> indexed="true" stored="true" required="false"/>
>>>   <!--
>>>   <field name="title_popularity" type="sint" indexed="true" stored="true"
>>> default="0"/>
>>>   <field name="title_recent_popularity" type="sint" indexed="true"
>>> stored="true" default="0"/>
>>>   <field name="title_views_measure" type="sint" indexed="true"
>>> stored="true" default="0"/>
>>>   <field name="title_recent_earnings_measure" type="sint" indexed="true"
>>> stored="true" default="0"/>
>>>   <field name="title_earnings_measure" type="sint" indexed="true"
>>> stored="true" default="0"/>
>>>  -->
>>>  <field name="created_date" type="date" indexed="true" stored="true"
>>> required="false" />
>>>
>>>
>>>
>>
>>
>


-- 
Alexander Ramos Jardim

Reply via email to