Re: Synonym and Whitespaces and optional TokenizerFactory

2011-08-18 Thread Ravi Solr
If you have multi-word synonyms you could use - tokenizerFactory="solr.KeywordTokenizerFactory" - in the SynonymFilterFactory filter factory declaration. This assumes that your tokenizer for that field allows for keeping the phrases as a single token (achieved by using solr.KeywordTokenizerFactory

Re: Synonym and Whitespaces and optional TokenizerFactory

2011-08-18 Thread Ravi Solr
If you have multi-word synonyms you could use - tokenizerFactory="solr.KeywordTokenizerFactory" - in the SynonymFilterFactory filter factory declaration. This assumes that your tokenizer for that field allows for keeping the phrases as a single token (achieved by using solr.KeywordTokenizerFactory

Re: Synonym and Whitespaces and optional TokenizerFactory

2011-08-18 Thread Markus Jelsma
How about escaping white\ space? cheers > Hmmm, why doesn't the multi word synonym syntax in your > synonym.txt handle this case? Or am I missing something > totally? > > Best > Erick > > On Wed, Aug 17, 2011 at 10:02 PM, Will Milspec wrote: > > Hi all, > > > > This may be obvious. My quest

Re: Synonym and Whitespaces and optional TokenizerFactory

2011-08-18 Thread Erick Erickson
Hmmm, why doesn't the multi word synonym syntax in your synonym.txt handle this case? Or am I missing something totally? Best Erick On Wed, Aug 17, 2011 at 10:02 PM, Will Milspec wrote: > Hi all, > > This may be obvious. My question pertains to use of tokenizerFactory > together with SynonymFilt

RE: Synonym and Whitespaces and optional TokenizerFactory

2011-08-18 Thread Jaeger, Jay - DOT
You could presumably do it with solr.PatternTokenizerFactory with the pattern set to .* as your Or, maybe, if Solr allows it, you don't use any tokenizer at all? Or, maybe you could use solr.WhitespaceTokenizerFactory, allowing it to split up the words, along with solr.WordDelimiterFilterFacto