Using XSLT with DIH for a URLDataSource

2010-02-21 Thread Roland Villemoes
Hi, I have to load data for Solr from a UrlDataSource supplying me with a XML feed. In the simple case where I just do simple XSLT select this works just fine. Just as shown on the wiki (http://wiki.apache.org/solr/DataImportHandler) But I need to do some manipulation of the XML feed first, So

Sorting by a function that depends on the current result set

2010-02-21 Thread Daniel Dourvaris
When sorting by (an integer) price field I need prices under 1 standard deviation from the mean of the current result set to be pushed to the end of the list. For example with these values: 0, 20, 40, 100, 2000, 2000, 2000, 2000, 2000, 3000, 3000, 3000, 3000, 3000, 4000, 5000, 5000, 9000 Mean ~

Re: Why ASCIIFoldingFilter is not a CharFilter

2010-02-21 Thread Robert Muir
right, most stemmers expect the diacritics to be in their input to work correctly, too. On Sun, Feb 21, 2010 at 5:19 PM, Erik Hatcher wrote: > won't some stemmers leave diacritics in the terms that ought to be removed > before indexing? > > > > On Feb 21, 2010, at 4:57 PM, Shalin Shekhar Mangar w

Re: Why ASCIIFoldingFilter is not a CharFilter

2010-02-21 Thread Erik Hatcher
won't some stemmers leave diacritics in the terms that ought to be removed before indexing? On Feb 21, 2010, at 4:57 PM, Shalin Shekhar Mangar wrote: Hello, Looking over the CharFilter franchise, it seems to me that the ASCIIFoldingFilter is a perfect candidate for being a CharFilter as it

Why ASCIIFoldingFilter is not a CharFilter

2010-02-21 Thread Shalin Shekhar Mangar
Hello, Looking over the CharFilter franchise, it seems to me that the ASCIIFoldingFilter is a perfect candidate for being a CharFilter as it performs character level substitutions like MappingCharFilter. However it is not a CharFilter. Is there a reason why? -- Regards, Shalin Shekhar Mangar.

Re: optimize is taking too much time

2010-02-21 Thread David Smiley @MITRE.org
I've always thought that these two events were effectively equivalent. -- the results of an optimize vs the results of Lucene _naturally_ merging all segments together into one. If they don't have the safe effect then what is the difference? ~ David Smiley Otis Gospodnetic wrote: > > Hello,