Re: TokenFilter not working at index time

2014-06-26 Thread Erlend Garåsen
I found the root of the problem. This is very strange, but I guess someone can explain to me why this happens. Take a look at the static block in my factory: http://folk.uio.no/erlendfg/solr/NorwegianLemmatizerFilterFactory.java static { ... } If I remove this block and return a stemmed du

Re: TokenFilter not working at index time

2014-06-25 Thread Erlend Garåsen
On 24.06.14 17:33, Erick Erickson wrote: Hmmm. It would help if you posted a couple of other pieces of information BTW, if this is new code are you considering donating it back? If so please open a JIRA so we can track it, see: http://wiki.apache.org/solr/HowToContribute All my other langua

Re: TokenFilter not working at index time

2014-06-24 Thread Dmitry Kan
By quickly looking at it, I think you have unreachable code in the NorwegianLemmatizerFilter class (certainly, attaching & debugging would be your best bet): @Override public boolean incrementToken() throws IOException { if (input.incrementToken()) { if (!keywordAttr.is

Re: TokenFilter not working at index time

2014-06-24 Thread Ahmet Arslan
Hi Erlend, After a quick look, I have implemented similar TokenFilter that injects several tokens at same position. Please see source code of : Zemberek2DeasciifyFilter in https://github.com/iorixxx/lucene-solr-analysis-turkish  You can insert your line :  final String[] values = stemmer.ste

Re: TokenFilter not working at index time

2014-06-24 Thread Erick Erickson
Hmmm. It would help if you posted a couple of other pieces of information BTW, if this is new code are you considering donating it back? If so please open a JIRA so we can track it, see: http://wiki.apache.org/solr/HowToContribute But to your question: First couple of things I'd do: 1> see wha

TokenFilter not working at index time

2014-06-24 Thread Erlend Garåsen
I'm trying to create a Norwegian Lemmatizer based on a dictionary, but for some odd reason I don't get any search results even thought the Analyzer in Solr Admin shows that it does the right thing. It works at query time if I have reindexed everything based on another stemmer, e.g. NorwegianM