what should be the fastest way to index a documents , I am indexing huge
collection of data after extracting certain meta - data information
for example author and filename of each files
i am extracting these information and storing in XML format
for example : <fileid> 1<fileid><author>abc </author>
<filename>abc.doc</filename>
<fileid> 2<fileid><author>abc 1111</author>
<filename>abc11111.doc</filename>
I can not index these documents directly to solr as it is not in the format
required by solr ( i can not change the format as its used in other modules)
should converting these file to CSV will be better and faster approach
compared to XML?
please suggest
--
Nipen Mark