I thought I would get some more experts giving me more insight about the
methods of scraping.

I want to grab the body content of pages say of Wordpress but not through
RSS. I would assume the pages are static only. And try to scrape the  body
content but avoiding  sidebar, footer, header etc.

I tried with the DOM and its fun. But just wanting to know some expert
experience on specific to my problem.

Thanks in advance.

Reply via email to