Re: Cyrillic problem

2010-03-02 Thread michaelnazaruk
Thank you very much! but I have problem with url :) If I send request using get method - I get: http://localhost/russian/result.php?search=%EF%F0%E8%E2%B3%F2 I use function (php)urldecode! If I print result, i get "привіт"! But if i send request to solr, my q param = "пїЅпїЅпїЅпїЅпїЅ"! -- View

Re: Cyrillic problem

2010-03-01 Thread Robert Muir
as far as cyrillic goes, any of the analyzers will handle cyrillic characters. so you can just use the "textgen" or whatever in the example schema and everything is ok, standardanalyzer will work too. you don't need to use the RussianAnalyzer, the only special thing it has is awareness of russian

Re: Cyrillic problem

2010-03-01 Thread Erick Erickson
Hmmm, I'm nowhere near an expert on how the analyzers actually work, so I have to punt a bit here. And certainly take any of "the regulars" advice if they give it ... But outside of stemming, Lucene/SOLR really doesn't understand the concept of "language". And that's not even Lucene, it's the stem

Re: Cyrillic problem

2010-03-01 Thread michaelnazaruk
Thank you! And one little question: Can I use RussianAnalyzer for ukrainian characters? -- View this message in context: http://old.nabble.com/Cyrillic-problem-tp27744106p27749323.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Cyrillic problem

2010-03-01 Thread Erick Erickson
Have you tried specifying the RussianAnalyzer in your schema? See: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#Specifying_an_Analyzer_in_the_schema particularly the first point (anythi