2007/4/30, Graeme Merrall <[EMAIL PROTECTED]>:
> i wanna crawl http://www.amazone.com/ and just wanna product title ,
> product information, writer, publisher.
>
> and other data i wanna ignore.
How about
http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html
i read
i wanna crawl http://www.amazone.com/ and just wanna product title ,
product information, writer, publisher.
and other data i wanna ignore.
How about
http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html
or if you're prepared to wait or help out there's
http://svn.apa
u means their data just use RSS feeds, not crawl?
2007/4/30, Fuad Efendi <[EMAIL PROTECTED]>:
H.
Even PriceGrabber, CNET, and Frgle can't do it!!! They simply
publish RSS feeds.
http://www.tokenizer.org
-Original Message-
From: James liu
Sent: Saturday, April 28, 2007 1
H.
Even PriceGrabber, CNET, and Frgle can't do it!!! They simply
publish RSS feeds.
http://www.tokenizer.org
-Original Message-
From: James liu
Sent: Saturday, April 28, 2007 11:12 PM
To: solr-user@lucene.apache.org
Subject: i wanna find one crawl that can crawl with define