Hi,
I've been trying to understand Lucene's file format and I keep getting hung up 
on one detail - how can Lucene quickly find the frequency data (or proximity 
data) for a particular term? According to the file formats page on the Lucene 
website<http://lucene.apache.org/java/2_2_0/fileformats.html#Term%20Dictionary>,
 the FreqDelta field in the Term Info file (.tis) is relative to the previous 
term. How is this helpful? The few references I've found on the web for this 
subject make it sound like the Term Dictionary has direct pointers to the 
frequency data for a given term, but that isn't consistent with the 
aforementioned reference.

Thanks for your help,
Gio.

Reply via email to