what should be the fastest way to index a documents , I am indexing huge collection of data after extracting certain meta - data information for example author and filename of each files
i am extracting these information and storing in XML format for example : <fileid> 1<fileid><author>abc </author> <filename>abc.doc</filename> <fileid> 2<fileid><author>abc 1111</author> <filename>abc11111.doc</filename> I can not index these documents directly to solr as it is not in the format required by solr ( i can not change the format as its used in other modules) should converting these file to CSV will be better and faster approach compared to XML? please suggest -- Nipen Mark