Re: crawler feed?

rubdabadub Wed, 07 Feb 2007 10:12:27 -0800

Hi:

Just want to say that my tiny experiment with Sami's Solr/Nutch
integration worked :-!)  Super thanks for the pointer. Which leads me
to write the following..


It would be great if I could use this in my current project. This way
I can eliminate my current python based aggregator/crawler which was
used to submit docs to Solr. This solution works but the crawler is
not as robust as I wanted it to be. As far as I understand SOLR-20
seems to be good to go for trunk? no?

So I am lobbying for SOLR-20 :-)

Cheers


On 2/7/07, rubdabadub <[EMAIL PROTECTED]> wrote:

This is really interesting. You mean to say i could give the patch a
try now i.e. the patch in the blog post :-)

I am looking forward to it. I hope it will be standalone i.e. you
don't need "the whole nutch" to get a standalone crawler working.. I
am not sure if this is how you planned.

Regards

On 2/7/07, Sami Siren <[EMAIL PROTECTED]> wrote:
> rubdabadub wrote:
> > Hi:
> >
> > Are there relatively stand-alone crawler that are
> > suitable/customizable for Solr? has anyone done any trials.. I have
> > seen some discussion about coocon crawler.. was that successfull?
>
> There's also integration path available for Nutch[1] that i plan to
> integrate after 0.9.0 is out.
>
> --
>  Sami Siren
>
> 
[1]http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html
>

Re: crawler feed?

Reply via email to