Hi guys


I'm running a solr instance (trunk)  in my dev. Server to test my
configuration.  I'm doing a DIH full import to index 49 PDF files with their
corresponding database records.  Both the PDF files and database are local
in the server.

*Server : *

·         Windows 2008 R2

·         MS SQL server 2008 R2

·         16 core processor

·         16 GB ram

*Tomcat (7.0.5) : *

·         Set JAVA_OPTS = %JAVA_OPTS%  -Xms1024M  -Xmx8192M

*Solrconfig:*

·         Main index configurations
    <ramBufferSize>2048</ramBufferSize>
    <mergeFactor>50</mergeFactor>

*DIH configuration:*

·         2 data sources defined  jdbcDataSource and BinFileDataSource

·         One main entity with 3 sub entities

<entity dataSource="myJdbc" …>

    <entity dataSource="myBinFile" …> </entity>

    <entity dataSource=" myJdbc" …> </entity>

    <entity dataSource=" myJdbc" …> </entity>

<entity/>

·         Total schema fields are 8, three of which are text type and
multivalued.

*My DIH import Status Messages:*

·         Total Requests made to DataSource = 99**

·         Total Rows Fetched = 2124**

·         Total DocumentsProcessed = 49**

·         Time Taken = *0:2:3:880***

*
Is this time reasonable or it can be improved?*

Reply via email to