Hi: Just want to say that my tiny experiment with Sami's Solr/Nutch integration worked :-!) Super thanks for the pointer. Which leads me to write the following..
It would be great if I could use this in my current project. This way I can eliminate my current python based aggregator/crawler which was used to submit docs to Solr. This solution works but the crawler is not as robust as I wanted it to be. As far as I understand SOLR-20 seems to be good to go for trunk? no? So I am lobbying for SOLR-20 :-) Cheers On 2/7/07, rubdabadub <[EMAIL PROTECTED]> wrote:
This is really interesting. You mean to say i could give the patch a try now i.e. the patch in the blog post :-) I am looking forward to it. I hope it will be standalone i.e. you don't need "the whole nutch" to get a standalone crawler working.. I am not sure if this is how you planned. Regards On 2/7/07, Sami Siren <[EMAIL PROTECTED]> wrote: > rubdabadub wrote: > > Hi: > > > > Are there relatively stand-alone crawler that are > > suitable/customizable for Solr? has anyone done any trials.. I have > > seen some discussion about coocon crawler.. was that successfull? > > There's also integration path available for Nutch[1] that i plan to > integrate after 0.9.0 is out. > > -- > Sami Siren > > [1]http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html >