Re: multi-language searching with Solr

2008-05-08 Thread Eli K
If you only have 2 languages this approach might work for you. This is not something I would consider with the number of languages I need to support. Eli On Thu, May 8, 2008 at 5:51 AM, Gereon Steffens <[EMAIL PROTECTED]> wrote: > > > > These are shards of one index and not multiple indexes. Th

Re: multi-language searching with Solr

2008-05-08 Thread Jayson Minard
Using distributed search to solve the language problem is very similar to using multiple fields to solve it. You need different analyzers for each language, and that gives you another way to have just that; trading management of multiple fields for management of multiple cores or shards. In your

Re: multi-language searching with Solr

2008-05-08 Thread Gereon Steffens
These are shards of one index and not multiple indexes. There is probably a way to get each shard to contain one language but then you end up with x servers for x languages, and some will be under utilized while other will be over utilized. Schemas will be identical, except for analysers. The l

Re: multi-language searching with Solr

2008-05-07 Thread Mike Klaas
I don't really see how that would help, no. All the benefits from using separate indices would be gained by using one field per language, ISTM. By the way, there are tools available that make field-per-language stuff much easier, especially if there are many fields. By using dynamic fie

RE: multi-language searching with Solr

2008-05-07 Thread Warner, Andrew
e] 303.271.9100 [Fax] 303.271.1930 [E-Mail] [EMAIL PROTECTED] NYSE: TYL www.tyler-eagle.com -Original Message- From: Eli K [mailto:[EMAIL PROTECTED] Sent: Wednesday, May 07, 2008 2:35 PM To: solr-user@lucene.apache.org Subject: Re: multi-language searching with Solr Gereon, I think that yo

Re: multi-language searching with Solr

2008-05-07 Thread Eli K
Gereon, I think that you must have the same schema on each shard but I am not sure if it must also have the same analyzers. These are shards of one index and not multiple indexes. There is probably a way to get each shard to contain one language but then you end up with x servers for x languages,

Re: multi-language searching with Solr

2008-05-07 Thread Gereon Steffens
I have the same requirement, and from what I understand the distributed search feature will help implementing this, by having one shard per language. Am I right? Gereon Mike Klaas wrote: On 5-May-08, at 1:28 PM, Eli K wrote: Wouldn't this impact both indexing and search performance and the

Re: multi-language searching with Solr

2008-05-06 Thread Mike Klaas
On 5-May-08, at 1:28 PM, Eli K wrote: Wouldn't this impact both indexing and search performance and the size of the index? It is also probable that I will have more then one free text fields later on and with at least 20 languages this approach does not seem very manageable. Are there other opt

RE: multi-language searching with Solr

2008-05-06 Thread Tim Mahy
PROTECTED] Verzonden: dinsdag 6 mei 2008 18:26 Aan: solr-user@lucene.apache.org Onderwerp: Re: multi-language searching with Solr Peter, Thanks for your help, I will prototype your solution and see if it makes sense for me. Eli On Mon, May 5, 2008 at 5:38 PM, Binkley, Peter <[EMAIL PROTEC

Re: multi-language searching with Solr

2008-05-06 Thread Eli K
- > From: Eli K [mailto:[EMAIL PROTECTED] > > Sent: Monday, May 05, 2008 2:28 PM > To: solr-user@lucene.apache.org > > > Subject: Re: multi-language searching with Solr > > Wouldn't this impact both indexing and search performance and the size > of the

Re: multi-language searching with Solr

2008-05-05 Thread Eli K
I searched the Solr list but not as much the Lucene list. I will look again to see if there is something there that might work with Solr. I rather leverage Solr, but if I have no choice I will to do this using Lucene only. Thanks, Eli On Mon, May 5, 2008 at 4:58 PM, Erick Erickson <[EMAIL PROT

RE: multi-language searching with Solr

2008-05-05 Thread Binkley, Peter
al Message- From: Eli K [mailto:[EMAIL PROTECTED] Sent: Monday, May 05, 2008 2:28 PM To: solr-user@lucene.apache.org Subject: Re: multi-language searching with Solr Wouldn't this impact both indexing and search performance and the size of the index? It is also probable that I will have

Re: multi-language searching with Solr

2008-05-05 Thread Erick Erickson
You might want to bounce over to the Lucene user's list and search for language. This topic has arisen many times and there's some good discussion. And have you searched the solr users list of "language"? I know it's turned up here as well. Best Erick On Mon, May 5, 2008 at 4:28 PM, Eli K <[EMAIL

Re: multi-language searching with Solr

2008-05-05 Thread Eli K
Wouldn't this impact both indexing and search performance and the size of the index? It is also probable that I will have more then one free text fields later on and with at least 20 languages this approach does not seem very manageable. Are there other options for making this work with stemming?

RE: multi-language searching with Solr

2008-05-05 Thread Binkley, Peter
I think you would have to declare a separate field for each language (freetext_en, freetext_fr, etc.), each with its own appropriate stemming. Your ingestion process would have to assign the free text content for each document to the appropriate field; so, for each document, only one of the freetex