So you can't even communicate with the remote Solr process by HTTP?
Because if you can, SolrJ would work.

Otherwise, you're stuck with creating a bunch of Solr-style XML
documents, they have a simple format. See the example/exampleDocs
directory in the standard distribution. You'll have to parse the
separate document types and put your required data into the Solr
XML format...

But I really don't understand why you need to. A Solr installation
that you can't get to via http is pretty useless, although I suppose
there can be security setups that preclude this. Assuming you can
get there via http, consider a SolrJ program combined with Tika to
parse the docs you have in all these formats and send them to Solr
via SolrJ...

Best
Erick

On Wed, Jan 25, 2012 at 7:41 AM, Tod <listac...@gmail.com> wrote:
> I have a local data store containing a host of different document types.
>  This data store is separate from a remote Solr install making streaming not
> an option.  Instead I'd like to generate an XML file that contains all of
> the documents including content and metadata.
>
> What would be the most appropriate way to accomplish this?  I could use the
> Tika CLI to generate XML but I'm not sure it would work or that its the most
> efficient way to handle things.  Can anyone offer some suggestions?
>
>
> Thanks - Tod

Reply via email to