Hello,

I have some problem with the wordDelimiter. My data looks like this:

mcdonald's#burgerking#Free record shop#h&m

I want to tokenize this on #. After that it has to split on whitespace. I
use the
wordDelimiter for that(can't use 2 tokenizers)

Now this works but there is one problem, it removes the '. My index looks
like this:

mcdonald
burgerking
free
record
shop
h&m

I don't want this so i use the stemEnglishPossessive. The description from
this part of the filter looks like this:

stemEnglishPossessive="1" causes trailing "'s" to be removed for each
subword.
    "Doug's" => "Doug"
    default is true ("1"); set to 0 to turn off 

My Field looks like this:

<fieldType name="Test_field" class="solr.TextField">
  <analyzer>
    <charFilter class="solr.HTMLStripCharFilterFactory"/>
    <tokenizer class="solr.PatternTokenizerFactory" pattern="#" />
    <filter class="solr.WordDelimiterFilterFactory" 
                        splitOnCaseChange="0" 
                        splitOnNumerics="0"
                        stemEnglishPossessive="0"
                        catenateWords="0"
     />
  </analyzer>
</fieldType>

It looks like the stemEnglishPossessive=0 is not working. How can i fix this
problem? Other filter? Did i forget something?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/WordDelimiter-and-stemEnglishPossessive-doesn-t-work-tp3047678p3047678.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to