Short form:
I think you're going down a rabbit-hole and should just
use synonyms and forget about it.
I'm particularly thinking that a general-purpose solution
that somehow breaks up or combines adjacent tokens
will have consequences that pop out other places that
you don't want and you'll have to
It's not a specific case only ( e.g. microsoft.com), but it's really a
multi word issue.
carwash, bookkeeper etc...
I'm ultimately looking for a schema for search and retrieve that's heavily
focused on 'names'.. these are peoples names, business names etc.. not
content like large text fields,
Is this a general question or specific? You can handle specific ones by
using synonyms.
But the general case, that is treating any two pairs of tokens as
a single pair seems fraught with unintended consequences, but
you know your problem space better than I do.
Best
Erick
On Sat, Apr 2, 2011 at
://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.PositionFilterFactory
Ludovic.
-
Jouve
France.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Multiple-Words-in-String-tp2767964p2770713.html
Sent from the Solr - User mailing list archive at Nabble.com.
Good afternoon everyone!
I am stumped, and I would love some help.I'm new to solr/lucene,
but I have thrown myself into it, so I think I have a solid
understanding. Using the analysis tool in the admin interface, I see
these words stemmed and processed as I assume they would be, so I'm
stuck.