You can think of it as the latter but it's quite a bit more
complicated than that. For details on how lucene stores it's index
check out the file formats page on lucene.
http://lucene.apache.org/java/docs/fileformats.html
Cheers
Rob
On Jan 4, 2008 4:59 PM, Jae Joo <[EMAIL PROTECTED]> wrote:
> ti
title of Document 1 - "This is document 1 regarding china" - fieldtype =
text
title of Document 2 - "This is document 2 regarding china" fieldtype=text
Once it is indexed, will index hold 2 "china" text fields or just 1 china
word which is pointing document1 and document2?
Jae
On Jan 4, 2008
I don't quite understand what you're getting at. What is the problem
you're encountering or what are you trying to achieve?
Cheers
Rob
On Jan 4, 2008 3:26 PM, Jae Joo <[EMAIL PROTECTED]> wrote:
> Hi,
>
> Is there any way to dedup the keyword cross the document?
>
> Ex.
>
> "china" keyword is in d