Re: Solr stemming -> preserve original words

2009-01-23 Thread Thushara Wijeratna
Chris, Ahmet - thanks for the responses. Ahmet - yes, i want to see "run" as a top term + the original words that formed that term The reason is that due to mis-stemming, the terms could become non-english. ex: "permanent" would stem to "perm", "archive" would become "archiv". I need to extract

Re: Solr stemming -> preserve original words

2009-01-23 Thread Thushara Wijeratna
hi Ahmet, thanks. when i look at the non_stemmed_text field to get the top terms, i will not be getting the useful feature of aggregating many related words into one (which is done by stemming). for ex: if a document has run(10), running(20), runner(2), runners(8) - i would like to see a a "top t

Solr stemming -> preserve original words

2009-01-23 Thread Thushara Wijeratna
hello, Is it possible to retrieve the original words once solr (Porter algorithm) stems them? I need to index a bunch of data, store it in solr, and get back a list of most frequent terms out of solr. and i want to see the non-stemmed version of this data. so basically, i want to enhance this: ht