Hmmm, that is confusing. the stemEnglishPossessive=0
actually leaves the 's' in the index, just not attached to the
word. The admin/analysis page can help show this....

Setting it equal to 1 removes it entirely from the stream.

If you set catenateWords=1, you'll get "mcdonalds" in
your index if stemEnglishPosessive=0 but not if you
set stemEnglishPosessive=1.

Hope that helps
Erick

On Fri, Jun 10, 2011 at 3:51 AM, roySolr <royrutten1...@gmail.com> wrote:
> Hello,
>
> I have some problem with the wordDelimiter. My data looks like this:
>
> mcdonald's#burgerking#Free record shop#h&m
>
> I want to tokenize this on #. After that it has to split on whitespace. I
> use the
> wordDelimiter for that(can't use 2 tokenizers)
>
> Now this works but there is one problem, it removes the '. My index looks
> like this:
>
> mcdonald
> burgerking
> free
> record
> shop
> h&m
>
> I don't want this so i use the stemEnglishPossessive. The description from
> this part of the filter looks like this:
>
> stemEnglishPossessive="1" causes trailing "'s" to be removed for each
> subword.
>    "Doug's" => "Doug"
>    default is true ("1"); set to 0 to turn off
>
> My Field looks like this:
>
> <fieldType name="Test_field" class="solr.TextField">
>  <analyzer>
>    <charFilter class="solr.HTMLStripCharFilterFactory"/>
>    <tokenizer class="solr.PatternTokenizerFactory" pattern="#" />
>    <filter class="solr.WordDelimiterFilterFactory"
>                        splitOnCaseChange="0"
>                        splitOnNumerics="0"
>                        stemEnglishPossessive="0"
>                        catenateWords="0"
>     />
>  </analyzer>
> </fieldType>
>
> It looks like the stemEnglishPossessive=0 is not working. How can i fix this
> problem? Other filter? Did i forget something?
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/WordDelimiter-and-stemEnglishPossessive-doesn-t-work-tp3047678p3047678.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>

Reply via email to