My fear is what will then happen with highlighting if I use re-mapping?


On Mon, 6 Feb 2012 03:33:03 -0800 (PST), Ahmet Arslan
<iori...@yahoo.com> wrote:
>> I need to tokenise on whitespace, full-stop, and comma
>> ONLY.
>>
>> Currently using solr.WhitespaceTokenizerFactory with
>> WordDelimiterFilterFactory but this is also splitting on
>> &, /, new-line, etc.
> 
> WDF is customizable via types="wdftypes.txt" parameter. 
> 
> https://svn.apache.org/repos/asf/lucene/dev/trunk/solr/core/src/test-files/solr/conf/wdftypes.txt
> 
> Alternatively you can convert . and , to whitespace (before
> tokenizer) by MappingCharFilterFactory.
> 
> http://lucene.apache.org/solr/api/org/apache/solr/analysis/MappingCharFilterFactory.html

Reply via email to