RE: multi language search engine in solr

2017-09-12 Thread Mugeesh Husain
thanks Junte Zhang, its really helpful for me -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

RE: multi language search engine in solr

2017-09-11 Thread Junte Zhang
-Original Message- From: Mugeesh Husain [mailto:muge...@gmail.com] Sent: Monday, September 11, 2017 3:46 AM To: solr-user@lucene.apache.org Subject: Re: multi language search engine in solr Thank you rick for your response. The document document have sepearte of the lanaguage instead of

Re: multi language search engine in solr

2017-09-11 Thread Tommaso Teofili
another thing to consider is what users would expect, would english user search over english docs only ? if yes, the most important task would be to correctly set up / create accurate per language analyzers, otherwise you may consider to also adopt machine translation, either on the search queries

Re: multi language search engine in solr

2017-09-10 Thread Mugeesh Husain
Thank you rick for your response. The document document have sepearte of the lanaguage instead of mix of Arabic, English, Bengali, Hindi, Malay. I coul not find any tokenizer for Malay, can you suggest me if you know please. -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.ht

Re: multi language search engine in solr

2017-09-10 Thread Rick Leir
Mugeesh, One important question: will the typical document have a mix of English and Bangla and Hindi? If so, you would probably have them all in one collection. Another thing to think about is the tokenizer. Are all words separated by white space? If not, then you might need to think about whic

Re: multi-language search

2009-08-25 Thread Erik Hatcher
On Aug 25, 2009, at 10:34 AM, Elaine Li wrote: I am still looking for help on chinese language search. I tried chinesetokenizerfactory as my analyzer, but it did not help. Only word with white space, comma and etc around them can be found. Try using the StandardTokenizerFactory - it handles Ch

Re: multi-language search

2009-08-25 Thread Elaine Li
Uri, Thanks a lot! I don't need to do cross language search. So Option 2 sounds better, coz my corpus is very large. I am still looking for help on chinese language search. I tried chinesetokenizerfactory as my analyzer, but it did not help. Only word with white space, comma and etc around them c

Re: multi-language search

2009-08-24 Thread Uri Boness
I can think of ways to tackle your problem: Option 1: each document will have a field indicating its language. Then, when searching, you can simply filter the query on the language you're searching on. Advantages: everything is in one index, so if in the future you will need to do a cross lang

Re: Multi language search help

2008-12-19 Thread Sujatha Arun
Thanks Grant, The requirement from the user end is to only search in that particular language and not across languages. Also going forward we will be adding more languages. so if i have separate fields for each language ,then we need to change the schema everytime and that will not scale very we

Re: Multi language search help

2008-12-18 Thread Grant Ingersoll
On Dec 18, 2008, at 6:25 AM, Sujatha Arun wrote: Hi, I am prototyping lanuage search using solr 1.3 .I have 3 fields in the schema -id,content and language. I am indexing 3 pdf files ,the languages are foroyo,chinese and japanese. I use xpdf to convert the content of pdf to text and pu

Re: Multi Language Search

2008-12-04 Thread Grant Ingersoll
On Dec 2, 2008, at 4:52 AM, tushar kapoor wrote: 1. AND This is the way the query should look, but there's no reason why you can't let you're users input AND in Russian and then you substitute it when you create the query. or rather, 2 . Now over to solr specific question. In

Re: Multi Language Search

2008-12-03 Thread Shalin Shekhar Mangar
Option 1 is correct. On Tue, Dec 2, 2008 at 3:22 PM, tushar kapoor < [EMAIL PROTECTED]> wrote: > > Hi, > > Before I start with Solr specific question, there is one thing I need to > get > information on. > > If I am a Russian user on a Russian Website & I want to search for indexes > having two R