Re: Synonym and Whitespaces and optional TokenizerFactory

2011-08-18 Thread Ravi Solr
If you have multi-word synonyms you could use - tokenizerFactory="solr.KeywordTokenizerFactory" - in the SynonymFilterFactory filter factory declaration. This assumes that your tokenizer for that field allows for keeping the phrases as a single token (achieved by using solr.KeywordTokenizerFactory

Re: Synonym and Whitespaces and optional TokenizerFactory

2011-08-18 Thread Ravi Solr
If you have multi-word synonyms you could use - tokenizerFactory="solr.KeywordTokenizerFactory" - in the SynonymFilterFactory filter factory declaration. This assumes that your tokenizer for that field allows for keeping the phrases as a single token (achieved by using solr.KeywordTokenizerFactory

Re: Synonym and Whitespaces and optional TokenizerFactory

2011-08-18 Thread Markus Jelsma
How about escaping white\ space? cheers > Hmmm, why doesn't the multi word synonym syntax in your > synonym.txt handle this case? Or am I missing something > totally? > > Best > Erick > > On Wed, Aug 17, 2011 at 10:02 PM, Will Milspec wrote: > > Hi all, > > > > This may be obvious. My quest

Re: Synonym and Whitespaces and optional TokenizerFactory

2011-08-18 Thread Erick Erickson
Hmmm, why doesn't the multi word synonym syntax in your synonym.txt handle this case? Or am I missing something totally? Best Erick On Wed, Aug 17, 2011 at 10:02 PM, Will Milspec wrote: > Hi all, > > This may be obvious. My question pertains to use of tokenizerFactory > together with SynonymFilt

RE: Synonym and Whitespaces and optional TokenizerFactory

2011-08-18 Thread Jaeger, Jay - DOT
nt: Wednesday, August 17, 2011 9:02 PM To: solr-user@lucene.apache.org Subject: Synonym and Whitespaces and optional TokenizerFactory Hi all, This may be obvious. My question pertains to use of tokenizerFactory together with SynonymFilterFactory. Which tokenizerFactory does one use to treat "

Synonym and Whitespaces and optional TokenizerFactory

2011-08-17 Thread Will Milspec
Hi all, This may be obvious. My question pertains to use of tokenizerFactory together with SynonymFilterFactory. Which tokenizerFactory does one use to treat "synonyms with spaces" as one token, Example these two entries are synonyms: "lms", "learning management system" index time expansion wou