On 3/26/07, nick19701 <[EMAIL PROTECTED]> wrote:

But here the "signature" field has field type "string". when you index it,
you put the whole string somewhere and give it an id, for example, 323454.

In a doc, you only need to reference this id 323454 if the doc happens to
contain
the same signature value.

Now suppose I have a lot of docs with same signature and signature
is a very long string. It seems to me indexing the signature will save me
hard disk space.

In short, what I mean is that if you index a "string" field, you can
retrieve it
without loss. So you don't need to store it separately. what do you think?

In theory that might be true, but lucene is not implemented that way,
I'm afraid.  If this is the a priori situation, it is probably easier
to implement this outside of lucene and "store" the id in your
external index.

-Mike

Reply via email to