Re: Preparing the ground for a real multilang index

Jan Høydahl Sat, 11 Jul 2009 01:37:18 -0700

Michael, you're of course right, copyfield would copy from source.
The lack of built-in language awareness in Solr is unfortunate :(
I have not tried Lucid's BasisTech lemmatizer implementation, but check
with them whether they can support multi languages in the same field.


--
Jan Høydahl
On 8. juli. 2009, at 16.32, Paul Libbrecht wrote:

Can't the copy field use a different analyzer?
Both for query and indexing?
Otherwise you need to craft your own analyzer which reads thelanguage from the field-name... there's several classes ready forthis.
paul

Le 08-juil.-09 à 02:36, Michael Lackhoff a écrit :
On 08.07.2009 00:50 Jan Høydahl wrote:
itself and do not need to know the query language. You may then want
to do a copyfield from all your text_<lang> -> text for convenientone-
field-to-rule-them-all search.
Would that really help? As I understand it, copyfield takes theraw, not
yet analyzed field value. I cannot see yet the advantage of this
"text"-field over the current situation with no text_<lang> fieldsat all.The copied-to text field has to be language agnostic with nostemming at
all, so it would miss many hits. Or is there a way to combine many
differently stemmed variants into one field to be able to searchagainst
all of them at once? That would be great indeed!

-Michael

Re: Preparing the ground for a real multilang index

Reply via email to