Thanks Radek. I have this as this one on my list as one option to be tried out.
Regards, Aloke On Tue, Oct 30, 2012 at 5:26 PM, Radek Zajkowski <rad...@gmail.com> wrote: > I've been able to run Solr on Azure with this > > https://github.com/MSOpenTech/Windows-Azure-Solr > > Had to do some modification as we didn't need multiple slaves etc. but > otherwise this was our jumping off point. > > If your Azure instance can mount the drive with files than you should be > able to expose it to Tika. I don't think that it will be any different than > running Solr on Azure and indexing DB or XML. > > Above project "boostraps" itself with all of the Java and Solr files it > needs to run and starts Solr using bundled in Jetty web server, so as long > as you have Tika in your libs and a configured handler you should be able > to use it. > > Radek. > > On Tue, Oct 30, 2012 at 4:31 AM, Aloke Ghoshal <alghos...@gmail.com> > wrote: > > > Hi, > > > > Looking for feedback on running Solr Core/ Tika parsing engine on Azure. > > There's one offering for Solr within Azure from Lucid works. This > offering > > however doesn't mention Tika. > > > > We are looking at options to make content from files (doc, excel, pdfs, > > etc.) stored within Azure storage search-able. And whether the parser > could > > run against our Azure store directly to index the content. The other > option > > could be to write a separate connector that streams in the files. Let me > > know if you have experience along these lines. > > > > Regards, > > Aloke > > >