paulosalamat wrote:
Hi I'm new to this group,
I would like to ask a question:
What does it mean when you see a plus sign in between two words inside
synonyms.txt?
e.g.
macbookair => macbook+air
Thanks,
Paulo
Welcome, Paulo!
It depends on your tokenizer. You can specify a tokenizer via
tokenizerFactory attribute when you use SynonymFilterFactory.
The tokenizer is used when SynonymFilterFactory reads the
synonyms.txt. If you do not specify it, WhitespaceTokenizer
will be used as default.
In the above example, the term text "macbookair" will be
normalized to the term text "macbook+air", if WhitespaceTokenizer
is used.
Koji
--
http://www.rondhuit.com/en/