Re: Multi language, one "body" field, multi stopwords ?

Otis Gospodnetic Thu, 24 Apr 2008 15:10:00 -0700

Multilingual indexing and searching is tricky, but I think all of them require 
that you know the language of a query one way or the other.  Without that, you 
won't know how to correctly analyze the query.  You didn't mention that, so I'm 
bringing it up.


To answer your question, comment for copyField from example schema.xml:

"...It's used either to index the same field differently,...."

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



----- Original Message ----
From: Tim Mahy <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Wednesday, April 23, 2008 4:51:55 AM
Subject: Multi language, one "body" field, multi stopwords ?

Multi language, one "body" field, multi stopwords ? 
Hi all,

we are in the situation that we want to store documents from x number of 
languages but in the query we want to query the same field,
but at indexing time we want different stopwords text file to be used for the 
language of the uploaded document.

I thought perhaps creating a body field per language and use the copyField to 
copy the contents to one body field, is this possible because I would set 
different settings for the type per "body language field" so that I can set 
different stop word files to be used. This only works I think if the copyField 
does the copy after the indexing, is this the case ?

Greetings,
Tim Op dit e-mail bericht is de
disclaimer van Info Support van toepassing, zie
http://www.infosupport.nl/disclaimer

Re: Multi language, one "body" field, multi stopwords ?

Reply via email to