Excellent. Thanks, Robert!
-- Avi
On Mon, Feb 21, 2011 at 19:24, Robert Muir wrote:
> On Mon, Feb 21, 2011 at 12:16 PM, Avi Rosenschein
> wrote:
> > Is there any analyzer that can do full Unicode case folding (for example,
> as
> > described at
> >
> http:/
Is there any analyzer that can do full Unicode case folding (for example, as
described at
http://www.w3.org/International/wiki/Case_folding#Recommendations_for_Case_Folding
)?
Specifically, in a German index, I would like the sharp s character (ß) to
be normalized into ss, which isn't done by any
This sounds like https://issues.apache.org/jira/browse/SOLR-1711. It is a
known issue in Solr 1.4.0, which is apparently fixed in Solr 1.4.1. We also
encountered it when indexing large numbers of documents with SolrJ, and are
therefore in the process of upgrading to 1.4.1.
-- Avi
On Wed, Sep 29,
>
>
> > I kind of suspected stemming to be the reason behind this.
> > But I consider stemming to be a good feature.
>
> This is the side effect of stemming. Stemming increases recall while
> harming precision.
>
This is a side effect of stemming, the way it is currently implemented in
Lucene. Ste