Another approach is to do a htdig -v   ...  > logs
In the file "logs" there will be lines like:

Not found: http://www.bc.ic.ac.uk/research/selkirk/selkrefs.html Ref:
http://www.cmmi.ic.ac.uk/selkirk.html

If you do    grep "Not found: " logs   > missingpages
there will be all the missing pages and their referrers.






-----Original Message-----
From: Gabriele Bartolini [mailto:[EMAIL PROTECTED]]
Sent: Friday, November 22, 2002 3:24 PM
To: Malcolm Austen; Cliff Addy
Cc: [EMAIL PROTECTED]
Subject: Re: [htdig] Bad link information


At 14.31 22/11/2002 +0000, Malcolm Austen wrote:
>On Fri, 22 Nov 2002, Cliff Addy wrote:
>
>+ By using the -t option to htdig, I can see all the "not found" links and
>+ the text of that link. But how do I find out *which* document has the
>+ link?  BTW, we're talking about an search system that is indexing ~150
>+ sites, so a hand search is not practical.
>
>-s is what you want Cliff

Otherwise, if you are looking for more info about links, you could probably 
find interesting a related project to ht://Dig, that is to say ht://Check. 
You can find more info on the related projects page of ht://Dig or directly 
on htcheck.sf.net .

Cheers,
-Gabriele
--
Gabriele Bartolini - Web Programmer - ht://Dig & IWA Member - ht://Check 
maintainer
Current Location: Prato, Tuscany, Italia
[EMAIL PROTECTED] | http://www.prato.linux.it/~gbartolini | ICQ#129221447
 > find bin/laden -name osama -exec rm {} \;



-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to
<[EMAIL PROTECTED]> with a subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html


-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to