I have a local data store containing a host of different document types. This data store is separate from a remote Solr install making streaming not an option. Instead I'd like to generate an XML file that contains all of the documents including content and metadata.

What would be the most appropriate way to accomplish this? I could use the Tika CLI to generate XML but I'm not sure it would work or that its the most efficient way to handle things. Can anyone offer some suggestions?


Thanks - Tod

Reply via email to