Hi, I want to crawl links and want to identify if link is company website. For example, If I use word 'financial advisory' in google search engine. I will get list of urls in search result. Some of links are company website. I want to identify those links which are company website and index them into solr. Does any body know some api/tools which can identify if link is company website or not, or api/tool which can identify url genre/type on the basis of taxonomy.
Thanks Vineet Yadav