> On Wed, 14 Apr 2004, Douglas Kline wrote:
> 
> > A scan of the source code in the htdig and htsearch directories confirms th
at
> > the string "search_algorithm" is found only in the latter directory (and th
ere
> > only in the file htsearch.cc and in that file in only one command line).  T
hat
> > doesn't prove that the htdig command isn't affected by the definition of th
at
> > attribute.
> 
> Actually I think it does ;) If htdig doesn't know about it, it isn't going
> to do anything special because of it. In any case there are very few people
> who know the code base even half as well as Gilles, so unless you have some
> really strong evidence to the contrary, I would accept his opinion on this
> one.


I was thinking that the implication of the attribute in the htdig command could
come about by means of a library or perhaps some other file which referenced it
and was part of htdig although not in the htdig sub-directory's source files
but, yes, it doesn't really affect htdig.


> > A repetition of the tests I ran before confirms that a database created by
> > "rundig -a" with an htdig.conf file with the line
> > 
> > search_algorithm:     substring:0.5
> > 
> > can be searched for strings other than whole words by htsearch using the sa
me
> 
> That is because *htsearch* is now using the htdig.conf *with* 
> substring:0.5.
> 
> > htdig.conf file while a database created by "rundig -a" with an htdig.conf 
file

<snip>

> > rundig with these two htdig.conf files are the same size.  However I ran ck
sum
> > on them and got different results.  So their content is different.
> 
> As it should be. There will be fields that change from one dig to the
> next (e.g. timestamps).


The files being scanned hadn't changed but I thought that a time of execution
of the htdig might affect the database.


> > rundig calls htdig, htpurge, htnotify, and htfuzzy.  Might one of the other
s be
> > relevant?  htnotify does not affect the database.  If I understand correctl
y,
> > htfuzzy doesn't either and when I ran it the files in the database weren't
> > changed.
> 
> Not for substring, though there are dependencies on htfuzzy for a number
> of the other algorithms. However I believe everything generated by htfuzzy
> lives in separate databases generated by htfuzzy.


That seems to be the upshot of the documentation and I found that htfuzzy
didn't change the modification date or the cksums of the database files.


> > In trying to pin down whether htdig or htpurge or both had to be run with t
he
> > htdig.conf file with the search_algorithm: line, I found that specifying th
e
> > alternate htdig.conf file with "-c <file name>" didn't have the same effect


> It is hard to say what exactly is going on here because the important bit
> is which configuration file you used with htsearch, not rundig. My guess
> is that in both cases htsearch was using htdig.conf. In the first case the
> htdig.conf file contained the stock search_algorithm line and therefore
> substring searching was disabled. In the second case, htdig.conf was then
> pointing to htdig-substring.conf, which did have the substring algorithm
> specified and thus htsearch was able to perform substring searches.
> 
> It is a safe bet that there is nothing fundamentally wrong with the -c
> option. There are a lot of us who use it on a regular basis with no problem
> whatsoever.


That became clear to me when I ran some more tests and eventually found that
the form I was using for htsearch wasn't having the intended effect of using
the sub-string version of htdig.conf regardless of which file had been named
"htdig.conf".  My earlier tests had been based on that assumption.  The
confusion resulted from the fact that I used the "-c" flag to specify an
alternate conf file for htdig, htpurge, and rundig while for htsearch I used
the HTML form.  Would it be possible to call htsearch with the "-c" flag from
an HTML form?  I suppose that, if that is possible, doing so would preclude the
possibility of the user's deciding on the search form whether he wants string
or whole-word searching.

Douglas

========
Douglas Kline
[EMAIL PROTECTED]




-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to