> I have a "host" field in my documents which keep the host > from which the page > was crawled. for example, yahoo.com, or sports.yahoo.com. I > want this field to > be searchable so if I search yahoo, I can find > sports.yahoo.com. > > I have used these tokenizers and it does not work: > <tokenizer class="solr.StandardTokenizerFactory"/> > <filter class="solr.LowerCaseFilterFactory"/> > <filter > class="solr.RemoveDuplicatesTokenFilterFactory"/> > Now, it seems they do not break the host name at the dots > and does not match > find yahoo in sports.yahoo.com. > What tokenizer should I use so it breaks the host name at > dots?
LetterTokenizerFactory or MappingCharFilterFactory with "."=> " "