Re: Multiple Words in String

2011-04-03 Thread Erick Erickson
Short form: I think you're going down a rabbit-hole and should just use synonyms and forget about it. I'm particularly thinking that a general-purpose solution that somehow breaks up or combines adjacent tokens will have consequences that pop out other places that you don't want and you'll have to

Re: Multiple Words in String

2011-04-03 Thread Chris Fauerbach
It's not a specific case only ( e.g. microsoft.com), but it's really a multi word issue. carwash, bookkeeper etc... I'm ultimately looking for a schema for search and retrieve that's heavily focused on 'names'.. these are peoples names, business names etc.. not content like large text fields,

Re: Multiple Words in String

2011-04-03 Thread Erick Erickson
Is this a general question or specific? You can handle specific ones by using synonyms. But the general case, that is treating any two pairs of tokens as a single pair seems fraught with unintended consequences, but you know your problem space better than I do. Best Erick On Sat, Apr 2, 2011 at

Re: Multiple Words in String

2011-04-03 Thread lboutros
://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.PositionFilterFactory Ludovic. - Jouve France. -- View this message in context: http://lucene.472066.n3.nabble.com/Multiple-Words-in-String-tp2767964p2770713.html Sent from the Solr - User mailing list archive at Nabble.com.

Multiple Words in String

2011-04-02 Thread Chris Fauerbach
Good afternoon everyone! I am stumped, and I would love some help.I'm new to solr/lucene, but I have thrown myself into it, so I think I have a solid understanding. Using the analysis tool in the admin interface, I see these words stemmed and processed as I assume they would be, so I'm stuck.