Hi,
I've got a cron job running every night. It executes rundig.sh. The
.conf file is specified as the conf file we use (not the default .conf
file). htdig indexes our intranet. The url for the intranet (and the
start url) is http://inside.hinshawlaw.com/
The problem is, however, htdig is also indexing our web site at
http://www.hinshawlaw.com/
I've checked to see if there are any other cron jobs running that might
be executing a different .conf file and can't find any. I've checked and
double-checked the .conf file we are using and it appears to be correct.
Is it possible that in 3.1.6 the start url drops the beginning of the
url string and only starts parsing at the domain? If not, any ideas what
might be going on here?
Thanks in advance!
Ted
------------------------------------------------------------------------------------
Homepage: http://www.tedmasterweb.com/
My JavaScript Window Management Tool: http://www.tedmasterweb.com/wmo/
-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html
- Re: [htdig] scope of start_url in .conf (htdig 3.1.6) Ted Stresen-Reuter

