Re: Unstemming after solr.PorterStemFilterFactory

2010-01-20 Thread Bogdan Vatkov
Thanks! It is good to know I did not do something in vаin :) On Wed, Jan 20, 2010 at 6:54 PM, Erick Erickson wrote: > Ah, OK. I take the "unnecessary" comment back. If you require > the original form of the tokens (not just the original text), then you > do have to do something to preserve them,

Re: Unstemming after solr.PorterStemFilterFactory

2010-01-20 Thread Erick Erickson
Ah, OK. I take the "unnecessary" comment back. If you require the original form of the tokens (not just the original text), then you do have to do something to preserve them, so I think you're on the right track FWIW Erick On Wed, Jan 20, 2010 at 9:38 AM, Bogdan Vatkov wrote: > Hi Eric, > >

Re: Unstemming after solr.PorterStemFilterFactory

2010-01-20 Thread Bogdan Vatkov
Hi Eric, I think I realize that and I am actually using this - I am using the stemmed, cased etc. token from the stored "term vectors" and additionally I am using the field values. But the fields values are different from the tokens in the level of granularity. When I access the term vector for my

Re: Unstemming after solr.PorterStemFilterFactory

2010-01-19 Thread Erick Erickson
This is completely unnecessary. Fields can be both indexed and stored, and the operations are orthogonal. That is, when you specify that a field is indexed, it is run through an analyzer and the *tokens* are indexed, after any stemming, casing, etc. Stored means that the original value, before an

Re: Unstemming after solr.PorterStemFilterFactory

2010-01-19 Thread Bogdan Vatkov
I am using fields like: which contain multi-line text, not just single strings, what does "stored values" mean? I am relatively new to Solr I solved my issue by copy/pasting and enhancing the SnowballPorterFilterFactory class by creating SnowballPorterWithUnstemLowerCaseFilterFactory I added lo

Re: Unstemming after solr.PorterStemFilterFactory

2010-01-19 Thread Otis Gospodnetic
Bogdan, You can get them from stored values of your fields, if you are storing them. Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message > From: Bogdan Vatkov > To: solr-user@lucene.apache.org > Sent: Tue, January 19, 2010 5:28:51 PM > Subject: Unstem