Wouldn't it be easier to just add ?= to the exclude_urls: list in htdig.conf?

Hugh

Stefan Seiz wrote:

On 01.12.2004 11:45 Uhr, Andre <[EMAIL PROTECTED]> wrote:

> Hi,
>
> Can htdig skip url with characters like ' ? = * ' when it is crawling
> ?


Here's what have in my .conf file to ignore changing session IDs in my urls:

    url_rewrite_rules: (.*)&pb-id=.* \\1

This should give you a starting point i assume.
--
Stefan Seiz <http://www.StefanSeiz.com>
Spamto: <[EMAIL PROTECTED]>




------------------------------------------------------- SF email is sponsored by - The IT Product Guide Read honest & candid reviews on hundreds of IT Products from real users. Discover which products truly live up to the hype. Start reading now. http://productguide.itmanagersjournal.com/ _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general




-- Hugh Caley | Unix Systems Administrator | CIS AFFYMETRIX, INC. | 6550 Vallejo St. Ste 100 | Emeryville, CA 94608 Tel: 510-428-8537 | [EMAIL PROTECTED]



-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now. http://productguide.itmanagersjournal.com/
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to