Chris Harris <[EMAIL PROTECTED]> wrote:
> It's my understanding that if my mergeFactor is 10, then there
> shouldn't be more than 11 segments in my index directory (10 segments,
> plus an additional segment if a merge is in progress).

Actually, mergeFactor 10 means each *level* will have <= 10 segments,
where a level is roughly 10X the size of the previous level.

EG after 10 segments (level 0) are flushed, they get merged into a
single level 1 segment.  Another 10 produces another level 1 segment.
Etc.  Until you have 10 level 1 segments, which then get merged into a
single level 2 segment.

The number of levels you have is logarithmic in the size of your index.

> I'm noticing that _2pk, _2pl, _2pm, _2pn, _2po are sequential file
> names, alphabetically speaking, and their last modified times are very
> close to one another. Does this mean they're actually part of the same
> segment, even though they are in separate files?

No, these are different segments, just flushed shortly after one
another in time.

> I just discovered that one of my other indexes has over 11,000 tis
> files. That's disturbing. I'm not sure if it would have the same
> underlying cause.

That does NOT sound right.  Can you provide more details how this
index is created/maintained?

Mike

Reply via email to