what should be the fastest way to index a documents , I am indexing huge
collection of data after extracting certain meta - data information
for example author and filename of each files

i am extracting these information and storing in XML format

for example :   <fileid> 1<fileid><author>abc </author>
<filename>abc.doc</filename>
                     <fileid> 2<fileid><author>abc 1111</author>
<filename>abc11111.doc</filename>

I can not index these documents directly to solr as it is not in the format
required by solr ( i can not change the format as its used in other modules)

should converting these file to CSV will be better and faster approach
compared to XML?



please  suggest




-- 
Nipen Mark

Reply via email to