Re: Unicode case folding

2011-02-21 Thread Avi Rosenschein
Excellent. Thanks, Robert! -- Avi On Mon, Feb 21, 2011 at 19:24, Robert Muir wrote: > On Mon, Feb 21, 2011 at 12:16 PM, Avi Rosenschein > wrote: > > Is there any analyzer that can do full Unicode case folding (for example, > as > > described at > > > http:/

Unicode case folding

2011-02-21 Thread Avi Rosenschein
Is there any analyzer that can do full Unicode case folding (for example, as described at http://www.w3.org/International/wiki/Case_folding#Recommendations_for_Case_Folding )? Specifically, in a German index, I would like the sharp s character (ß) to be normalized into ss, which isn't done by any

Re: deadlock in solrj?

2010-09-29 Thread Avi Rosenschein
This sounds like https://issues.apache.org/jira/browse/SOLR-1711. It is a known issue in Solr 1.4.0, which is apparently fixed in Solr 1.4.1. We also encountered it when indexing large numbers of documents with SolrJ, and are therefore in the process of upgrading to 1.4.1. -- Avi On Wed, Sep 29,

Re: Confused by Solr Ranking

2010-03-09 Thread Avi Rosenschein
> > > > I kind of suspected stemming to be the reason behind this. > > But I consider stemming to be a good feature. > > This is the side effect of stemming. Stemming increases recall while > harming precision. > This is a side effect of stemming, the way it is currently implemented in Lucene. Ste