If I remember well, in htdig 3.1.6 you can use htpurge to remove documents from the
database. You can build a list of URLs to remove and pass it to htpurge with "htpurge
-U list_of_URLs".
Anyway, read about the htpurge options with htpurge -?.
Hope it helps.
Angel Luis P�rez Hern�ndez
Centro de Competencia Arquitectura Avanzadas
> -----Mensaje original-----
> De: Wanrong Qiu [mailto:[EMAIL PROTECTED]]
> Enviado el: jueves, 21 de noviembre de 2002 21:22
> Para: [EMAIL PROTECTED]
> Asunto: [htdig] No more referenced pages gotten indexed
>
>
> Hi htdig folks,
>
> I haved used htdig for our intranet indexing for couple of
> years. I use
> incremental
> indexing in order to avoid re-indexing and to save indexing time. But
> recently I have
> found more and more outdated pages gotten indexed, those pages that
> actually have
> no references in any other pages but not deleted in the file
> system. Are
> there any way
> to ask htdig not to index those pages and even better to get
> rid of them
> from the
> database, but in the meantime I can avoid using -i flag to
> start a total
> new digging?
> I use htdig 3.1.6 in solaris 2.8.
>
> Any help will be very appreciated.
>
> Wayne Qiu
> Senior Web Developer
> IT Application and Web development
> IDT
>
>
>
> -------------------------------------------------------
> This sf.net email is sponsored by:ThinkGeek
> Welcome to geek heaven.
> http://thinkgeek.com/sf
> _______________________________________________
> htdig-general mailing list <[EMAIL PROTECTED]>
> To unsubscribe, send a message to
> <[EMAIL PROTECTED]> with a subject
> of unsubscribe
> FAQ: http://htdig.sourceforge.net/FAQ.html
>
-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html