According to Krzysztof Gorgolewski:
> i try to index some pages generated by a php-script. ok, that works,
> but i want to index only such pages which matches one or more regular
> expressions. for example i used
> 
> -include "^http://www.tu-clausthal.de/geomuseum/[^&]+&[^&]+$";
> 
> in an other indexer, but i can't use this with htdig with the
> -limit_urls_to attribute. i've tried different variations of the
> regular expression, but nothing works.
> does any one knows, how to solve this problem? 
> please mail me ([EMAIL PROTECTED]).

I don't know if you already got an answer privately, but I didn't see it
on the list.  I assume you're using one of the 3.2 beta releases (hopefully
a recent 3.2.0b4 snapshot), and not a 3.1.x release.  If so, try this in
your htdig.conf file...

limit_urls_to: [http://www.tu-clausthal.de/geomuseum/[^&]+&[^&]+$]

See also http://www.htdig.org/dev/htdig-3.2/cf_types.html and
http://www.htdig.org/dev/htdig-3.2/attrs.html#limit_urls_to

Regular expressions in pattern lists aren't supported in the 3.1.x
releases, only in 3.2 betas.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/
Dept. Physiology, U. of Manitoba  Winnipeg, MB  R3E 3J7  (Canada)


-------------------------------------------------------
This sf.net email is sponsored by: See the NEW Palm 
Tungsten T handheld. Power & Color in a compact size!
http://ads.sourceforge.net/cgi-bin/redirect.pl?palm0001en
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to