Can you tell from the logs what Solr is doing during that time?
Do you have any warming queries configured?
Also see this: https://issues.apache.org/jira/browse/SOLR-6679
  (comment out suggester related stuff if you aren't using it)

-Yonik
http://heliosearch.org - native code faceting, facet functions,
sub-facets, off-heap data


On Mon, Nov 3, 2014 at 11:03 AM, Po-Yu Chuang <ratbert.chu...@gmail.com> wrote:
> Hi,
>
> I am using Solr 4.9 with Tomcat and it works fine except that the
> deployment of solr.war is too long. While deploying Solr, all webapps on
> Tomcat stop responding which is unacceptable. Most articles I found say
> that it might result from big transaction log because of uncommitted
> documents, but this is not my case.
>
> At first, the Solr data is 280G and the start up time is 30 minutes. Then I
> set a field to stored="false" and re-index whole data. The data size became
> 185G and the start up time reduced to 17 minutes, but it is still too long.
>
> Here are some numbers I measured:
>
> 1)
> Solr home: 280G
> tlog: 500K
> 30 min to start up
> While starting up, disk read is constantly about 50MB/s (according to
> dstat). So it seems that Solr reads 30m * 60s * 50MB/s = 90GB of data while
> starting up, which is 30% of index data size.
>
> 2)
> Solr home: 185G
> tlog: 5M
> 17 minutes to start up
> While starting up, disk read is constantly about 5MB/s (according to
> dstat). So it seems that Solr reads 17m * 60s *5MB/s = 5GB of data while
> starting up, which is about 3% of index data size.
>
> p.s. I did commit each time 1000 documents being added and did optimization
> after all documents are added.
>
> Any ideas or suggestions would be appreciated.
>
> Thanks,
> Po-Yu

Reply via email to