I've been able to run Solr on Azure with this

https://github.com/MSOpenTech/Windows-Azure-Solr

Had to do some modification as we didn't need multiple slaves etc. but
otherwise this was our jumping off point.

If your Azure instance can mount the drive with files than you should be
able to expose it to Tika. I don't think that it will be any different than
running Solr on Azure and indexing DB or XML.

Above project "boostraps" itself with all of the Java and Solr files it
needs to run and starts Solr using bundled in Jetty web server, so as long
as you have Tika in your libs and a configured handler you should be able
to use it.

Radek.

On Tue, Oct 30, 2012 at 4:31 AM, Aloke Ghoshal <alghos...@gmail.com> wrote:

> Hi,
>
> Looking for feedback on running Solr Core/ Tika parsing engine on Azure.
> There's one offering for Solr within Azure from Lucid works. This offering
> however doesn't mention Tika.
>
> We are looking at options to make content from files (doc, excel, pdfs,
> etc.) stored within Azure storage search-able. And whether the parser could
> run against our Azure store directly to index the content. The other option
> could be to write a separate connector that streams in the files. Let me
> know if you have experience along these lines.
>
> Regards,
> Aloke
>

Reply via email to