TH all six cases and 3
genders.
Dennis Gearon
Signature Warning
EARTH has a Right To Life,
otherwise we all die.
Read 'Hot, Flat, and Crowded'
Laugh at http://www.yert.com/film.php
--- On Tue, 7/27/10, Robert Muir wrote:
> From: Robert Muir
> Subject: Re:
right, but your problem is this is the current output:
Ковров -> Ковр
Коврову -> Ковров
Ковровом -> Ковров
Коврове -> Ковров
so, if Ковров was simply left alone, all your forms would match...
2010/7/27 Oleg Burlaca
> Thanks Robert for all your help,
>
> The idea of ы[A-Z].* stopwords is ideal
Thanks Robert for all your help,
The idea of ы[A-Z].* stopwords is ideal for the english language,
although in russian nouns are inflected: Борис, Борису, Бориса, Борисом
I'll try the RussianLightStemFilterFactory (the article in the PDF mentioned
it's more accurate).
Once again thanks,
Oleg Bur
2010/7/27 Oleg Burlaca
> Actually the situation with Немцов из ок,
> I've just checked how Yandex works with Немцов and Немцова:
> http://nano.yandex.ru/project/inflect/
>
> I think there are two solutions:
> a) manually search for both Немцов and then Немцова
> b) use wildcard query: Немцов*
>
Actually the situation with Немцов из ок,
I've just checked how Yandex works with Немцов and Немцова:
http://nano.yandex.ru/project/inflect/
I think there are two solutions:
a) manually search for both Немцов and then Немцова
b) use wildcard query: Немцов*
Robert, thanks for the RussianLightStemF
A similar word is Немцов.
The strange thing is that searching for "Немцова" will not find documents
containing "Немцов"
Немцова: 14 articles
http://www.sova-center.ru/search/?lg=1&q=%D0%BD%D0%B5%D0%BC%D1%86%D0%BE%D0%B2%D0%B0
Немцов: 74 articles
http://www.sova-center.ru/search/?lg=1&q=%D0%BD%D0%B
Yes, I'm sure I've enabled SnowballPorterFilterFactory both at Index and
Query time, because the search works ok,
except names and geo locations.
I've noticed that searching by
Коврова
also shows documents that contain Коврову, Коврове
Search by Ковров, 7 results:
http://www.sova-center.ru/searc
another look, your problem is ковров itself... its mapped to ковр
a workaround might be to use the protected words functionality to
keep ковров and any other problematic people/geo names as-is.
separately, in trunk there is an alternative russian stemmer
(RussianLightStemFilterFactory), which mig
All of your examples stem to "ковров":
assertAnalyzesTo(a, "Коврова Коврову Ковровом Коврове",
new String[] { "ковров", "ковров", "ковров", "ковров" });
}
Are you sure you enabled this at *both* index and query time?
2010/7/27 Oleg Burlaca
> Hello,
>
> I'm using SnowballPorter