Yes this may be my problem,

But is there any solution to have only one "men" keyword indexed when i''ve
got something like this :

1 - k1_en = men;business;Men
or :
2 - k1_en = man,business,men
or :
3 - k1_en = Man,men,business,Men,man
...

Thx in advance,

On Wed, Oct 1, 2008 at 5:12 PM, Otis Gospodnetic <[EMAIL PROTECTED]
> wrote:

> Hi,
>
> Note that RemoveDuplicatesTokenFilterFactory "filters out any tokens which
> are at the same logical position in the tokenstream as a previous token with
> the same text."
>
> So if you have "men in black are real men" then
> RemoveDuplicatesTokenFilterFactory will not remove duplicate "men".
>
> This may or may not be your problem.
>
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
>
>
> ----- Original Message ----
> > From: KLessou <[EMAIL PROTECTED]>
> > To: solr-user@lucene.apache.org
> > Sent: Wednesday, October 1, 2008 9:48:28 AM
> > Subject: termFreq always = 1 ?
> >
> > Hi,
> >
> > I want to index a list of keywords.
> >
> > When I search "k1_en:men", I find a lot of documents like that :
> >
> > DocA :
> > (k1_en = man;men;Men;business... termFreq=2)
> > DocB :
> > (k1_en = man;Men;business... termFreq=1)
> > DocC :
> > ...
> > DocD :
> > ...
> > DocE :
> > ...
> >
> > But I don't want to have a different termFreq for DocA & DocB.
> >
> > I try RemoveDuplicatesTokenFilterFactory but it doesn't seem to help me
> :-/
> >
> >
> >
> >
> >
> >
> >
> >
> > ignoreCase="true"/>
> >
> > protected="protwords.txt" />
> >
> >
> >
> >
> >                     generateWordParts="0"
> >                     generateNumberParts="0"
> >                     catenateWords="0"
> >                     catenateNumbers="0"
> >                     catenateAll="0"
> >                     />
> >
> >
> >
> >
> > />
> >
> >
> >
> >
> > ignoreCase="true"/>
> >
> > protected="protwords.txt" />
> >
> >
> >
> >                     generateWordParts="0"
> >                     generateNumberParts="0"
> >                     catenateWords="0"
> >                     catenateNumbers="0"
> >                     catenateAll="0"
> >                     />
> >
> >
> >
> >
> >
> > ...
> >
> >
> >
> > required="false" />
> >
> >
> > If you have any idea, thx in advance.
> >
> > --
> > ~~~~~
> > | klessou |
> > ~~~~~
>
>


-- 
~~~~~
| klessou |
~~~~~

Reply via email to