I've been able to run Solr on Azure with this https://github.com/MSOpenTech/Windows-Azure-Solr
Had to do some modification as we didn't need multiple slaves etc. but otherwise this was our jumping off point. If your Azure instance can mount the drive with files than you should be able to expose it to Tika. I don't think that it will be any different than running Solr on Azure and indexing DB or XML. Above project "boostraps" itself with all of the Java and Solr files it needs to run and starts Solr using bundled in Jetty web server, so as long as you have Tika in your libs and a configured handler you should be able to use it. Radek. On Tue, Oct 30, 2012 at 4:31 AM, Aloke Ghoshal <alghos...@gmail.com> wrote: > Hi, > > Looking for feedback on running Solr Core/ Tika parsing engine on Azure. > There's one offering for Solr within Azure from Lucid works. This offering > however doesn't mention Tika. > > We are looking at options to make content from files (doc, excel, pdfs, > etc.) stored within Azure storage search-able. And whether the parser could > run against our Azure store directly to index the content. The other option > could be to write a separate connector that streams in the files. Let me > know if you have experience along these lines. > > Regards, > Aloke >