> I have a "host" field in my documents which keep the host
> from which the page 
> was crawled. for example, yahoo.com, or sports.yahoo.com. I
> want this field to 
> be searchable so if I search yahoo, I can find
> sports.yahoo.com. 
> 
> I have used these tokenizers and it does not work:
> <tokenizer class="solr.StandardTokenizerFactory"/>
> <filter class="solr.LowerCaseFilterFactory"/>
> <filter
> class="solr.RemoveDuplicatesTokenFilterFactory"/>
> Now, it seems they do not break the host name at the dots
> and does not match 
> find yahoo in sports.yahoo.com.
> What tokenizer should I use so it breaks the host name at
> dots?

LetterTokenizerFactory or MappingCharFilterFactory with "."=> " "


      

Reply via email to