I guess it would be fater if you run this program from a diffferent
box,.that way the disk IO will not be a bottleneck

2009/7/10 Noble Paul നോബിള്‍  नोब्ळ् <noble.p...@corp.aol.com>:
> are these xml files in the solr add xml format?
>
> When you post using curl, I guess it opens as many http connections as
> there are files. if you can write a small program to post all these
> files in one request, you should be able to get better perf.
>
> the following can be the pseudo-code
>
> open connection
> write "<root>"
> for each file
>  write filecontent
> write "</root>"
> close connection
>
>
>
>
> On Fri, Jul 10, 2009 at 10:23 AM, Francis Yakin<fya...@liquid.com> wrote:
>>
>> I have about 1000 folders, each folder consist 2581 xml files. Total of xml 
>> files is ~ 2.6 millions
>>
>> I developed perl script, inside my script it's executed this cmd:
>>
>>  curl http://localhost:7001/solr/update --data-binary "@0039000.xml" -H 
>> 'Content-type:text/plain; charset=utf-8'
>>
>> It tooks me about 4 1/2 hrs to load and commit.
>>
>> I would like to know the advantages using curl to posting/add/update the xml 
>> files to solr comparing with using WebService::Solr module?
>>
>> Is using WebService::Solr faster?
>>
>> The XML files are local on the Solr Master box, so I posting it locally( not 
>> using wan or lan).
>>
>> Any input will be much appreciated.
>>
>> Thanks
>>
>> Francis
>>
>>
>>
>
>
>
> --
> -----------------------------------------------------
> Noble Paul | Principal Engineer| AOL | http://aol.com
>



-- 
-----------------------------------------------------
Noble Paul | Principal Engineer| AOL | http://aol.com

Reply via email to