Hmmmm..... Even PriceGrabber, CNET, and Froooooooogle can't do it!!! They simply publish RSS feeds. http://www.tokenizer.org
-----Original Message----- From: James liu Sent: Saturday, April 28, 2007 11:12 PM To: solr-user@lucene.apache.org Subject: i wanna find one crawl that can crawl with defined urls and defined data for example, i wanna crawl http://www.amazone.com/ and just wanna product title , product information, writer, publisher. and other data i wanna ignore. Maybe someone can recommend it, and it will be appreciate -- regards jl