I have a local data store containing a host of different document types.
This data store is separate from a remote Solr install making
streaming not an option. Instead I'd like to generate an XML file that
contains all of the documents including content and metadata.
What would be the most appropriate way to accomplish this? I could use
the Tika CLI to generate XML but I'm not sure it would work or that its
the most efficient way to handle things. Can anyone offer some suggestions?
Thanks - Tod
- Indexing Using XML Message Tod
-