95+ % of the time problems like this are not Solr, but the
data acquisition, i.e. querying the DB, traversing the file system
etc.

We need to have an idea of what the indexing pipeline is all about
before saying anything coherent.

If you're using extractingrequesthandler for Word, PDFs, etc,
you probably want to rack some clients together and move the parsing
off to N clients.

See: http://searchhub.org/2012/02/14/indexing-with-solrj/

This link also demonstrates using a JDBC connection too..

But I'm guessing here...

Best,
Erick


On Tue, Jul 29, 2014 at 11:35 AM, Boogie Shafer <boogie.sha...@proquest.com>
wrote:

> when you say performance is very poor, what is happening at the system
> level?
>
> e.g.
>
> are cpu's pegged out?
>
> is there a lot of IO wait?
>
> is the storage busy?
>
> is the network busy?
>
>
> some easy tools to watch this stuff live if you arent sure and dont have
> full on system monitoring agents installed
>
> nmon
> htop
> iftop
>
> the old standby sar/vmstat/iostat tools can get you the info in a less
> "graphical" format
>
> ________________________________________
> From: Ameya Aware <ameya.aw...@gmail.com>
> Sent: Tuesday, July 29, 2014 10:49
> To: solr-user@lucene.apache.org
> Subject: Scaling Issues
>
> Hi,
>
> I am running Solr with below parameters:
>
> -XX:MaxPermSize=128m -Xms5120m -Xmx5120m -XX:+UseConcMarkSweepGC
> -XX:CMSInitiatingOccupancyFraction=70 -XX:NewRatio=3
> -XX:MaxTenuringThreshold=8 -XX:+CMSParallelRemarkEnabled
> -XX:+ParallelRefProcEnabled -XX:+UseLargePages -XX:+AggressiveOpts
> -XX:-UseGCOverheadLimit
>
>
> I need to index around 300000 documents but with above parameters
> performance is coming very poor around 15000-20000 documents per hour.
>
> This would take a lot of time to index all the documents.
>
> Also, my autocommit in solrconfig.xml is as below:
>
> <autoCommit>
>      <maxTime>150000</maxTime>
>       <openSearcher>false</openSearcher>
> </autoCommit>
>
> <documentCache class="solr.LRUCache" autowarmCount="0" initialSize="256"
> size="256"/>
>
>
>
> I am running Solr on machine having 12GB RAM.
>
>
> Please advice on how can i improve the performance.
>
>
> Thanks,
> Ameya
>

Reply via email to