> On Wed, 14 Apr 2004, Douglas Kline wrote: > > > A scan of the source code in the htdig and htsearch directories confirms th at > > the string "search_algorithm" is found only in the latter directory (and th ere > > only in the file htsearch.cc and in that file in only one command line). T hat > > doesn't prove that the htdig command isn't affected by the definition of th at > > attribute. > > Actually I think it does ;) If htdig doesn't know about it, it isn't going > to do anything special because of it. In any case there are very few people > who know the code base even half as well as Gilles, so unless you have some > really strong evidence to the contrary, I would accept his opinion on this > one.
I was thinking that the implication of the attribute in the htdig command could come about by means of a library or perhaps some other file which referenced it and was part of htdig although not in the htdig sub-directory's source files but, yes, it doesn't really affect htdig. > > A repetition of the tests I ran before confirms that a database created by > > "rundig -a" with an htdig.conf file with the line > > > > search_algorithm: substring:0.5 > > > > can be searched for strings other than whole words by htsearch using the sa me > > That is because *htsearch* is now using the htdig.conf *with* > substring:0.5. > > > htdig.conf file while a database created by "rundig -a" with an htdig.conf file <snip> > > rundig with these two htdig.conf files are the same size. However I ran ck sum > > on them and got different results. So their content is different. > > As it should be. There will be fields that change from one dig to the > next (e.g. timestamps). The files being scanned hadn't changed but I thought that a time of execution of the htdig might affect the database. > > rundig calls htdig, htpurge, htnotify, and htfuzzy. Might one of the other s be > > relevant? htnotify does not affect the database. If I understand correctl y, > > htfuzzy doesn't either and when I ran it the files in the database weren't > > changed. > > Not for substring, though there are dependencies on htfuzzy for a number > of the other algorithms. However I believe everything generated by htfuzzy > lives in separate databases generated by htfuzzy. That seems to be the upshot of the documentation and I found that htfuzzy didn't change the modification date or the cksums of the database files. > > In trying to pin down whether htdig or htpurge or both had to be run with t he > > htdig.conf file with the search_algorithm: line, I found that specifying th e > > alternate htdig.conf file with "-c <file name>" didn't have the same effect > It is hard to say what exactly is going on here because the important bit > is which configuration file you used with htsearch, not rundig. My guess > is that in both cases htsearch was using htdig.conf. In the first case the > htdig.conf file contained the stock search_algorithm line and therefore > substring searching was disabled. In the second case, htdig.conf was then > pointing to htdig-substring.conf, which did have the substring algorithm > specified and thus htsearch was able to perform substring searches. > > It is a safe bet that there is nothing fundamentally wrong with the -c > option. There are a lot of us who use it on a regular basis with no problem > whatsoever. That became clear to me when I ran some more tests and eventually found that the form I was using for htsearch wasn't having the intended effect of using the sub-string version of htdig.conf regardless of which file had been named "htdig.conf". My earlier tests had been based on that assumption. The confusion resulted from the fact that I used the "-c" flag to specify an alternate conf file for htdig, htpurge, and rundig while for htsearch I used the HTML form. Would it be possible to call htsearch with the "-c" flag from an HTML form? I suppose that, if that is possible, doing so would preclude the possibility of the user's deciding on the search form whether he wants string or whole-word searching. Douglas ======== Douglas Kline [EMAIL PROTECTED] ------------------------------------------------------- This SF.Net email is sponsored by: IBM Linux Tutorials Free Linux tutorial presented by Daniel Robbins, President and CEO of GenToo technologies. Learn everything from fundamentals to system administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

