Re: French and SpellingQueryConverter

2009-05-19 Thread Michael Ludwig
Jonathan Mamou schrieb: Thanks Michael for your answer! I think that (?:(?!(\w+:|\d+)))[\p{L}]+ should also be OK. Oh yes, that's much simpler and clearer than my suggestion. (Newbieness factor for Java style regular expressions, too.) Or maybe this:(?:(?!(\w+:|\d+)))[\p{L}\d_]+:-) Mi

Re: French and SpellingQueryConverter

2009-05-19 Thread Jonathan Mamou
Thanks Michael for your answer! I think that (?:(?!(\w+:|\d+)))[\p{L}]+ should also be OK. Jonathan Michael Ludwig

Re: French and SpellingQueryConverter

2009-05-19 Thread Michael Ludwig
Shalin Shekhar Mangar schrieb: On Mon, May 11, 2009 at 2:46 PM, Michael Ludwig wrote: Could you give an example of how the spellcheck.q parameter can be brought into play to (take non-ASCII characters into account, so that "Käse" isn't mishandled) given the following example: You will need t

Re: French and SpellingQueryConverter

2009-05-11 Thread Shalin Shekhar Mangar
On Mon, May 11, 2009 at 2:46 PM, Michael Ludwig wrote: > Could you give an example of how the spellcheck.q parameter can be > brought into play to (take non-ASCII characters into account, so > that "Käse" isn't mishandled) given the following example: > You will need to set the correct tokenizer

Re: French and SpellingQueryConverter

2009-05-11 Thread Michael Ludwig
Shalin Shekhar Mangar schrieb: On Fri, May 8, 2009 at 2:14 AM, Jonathan Mamou wrote: SpellingQueryConverter always splits words with special character. I think that the issue is in SpellingQueryConverter class Pattern.compile.("(?:(?!(\\w+:|\\d+)))\\w+");?: According to http://java.sun.com/ja

Re: French and SpellingQueryConverter

2009-05-08 Thread Shalin Shekhar Mangar
On Fri, May 8, 2009 at 2:14 AM, Jonathan Mamou wrote: > Hi > It does not seem to be related to FrenchStemmer, the stemmer does not split > a word into 2 words. I have checked with other words and > SpellingQueryConverter always splits words with special character. > I think that the issue is in S

Re: French and SpellingQueryConverter

2009-05-07 Thread Jonathan Mamou
Hi It does not seem to be related to FrenchStemmer, the stemmer does not split a word into 2 words. I have checked with other words and SpellingQueryConverter always splits words with special character. I think that the issue is in SpellingQueryConverter class Pattern.compile.("(?:(?!(\\w+:|\\d+)))

Re: French and SpellingQueryConverter

2009-05-07 Thread Jay Hill
It seems to me that this is just the expected behavior of the FrenchAnalyzer using the FrenchStemmer. I'm not familiar with the French language, but in English words like running, runner, and runs are all stemmed down to "run" as intended. I don't know what other words in French would stem down to