thanks Junte Zhang, its really helpful for me
--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html
-Original Message-
From: Mugeesh Husain [mailto:muge...@gmail.com]
Sent: Monday, September 11, 2017 3:46 AM
To: solr-user@lucene.apache.org
Subject: Re: multi language search engine in solr
Thank you rick for your response.
The document document have sepearte of the lanaguage instead of
another thing to consider is what users would expect, would english user
search over english docs only ? if yes, the most important task would be to
correctly set up / create accurate per language analyzers, otherwise you
may consider to also adopt machine translation, either on the search
queries
Thank you rick for your response.
The document document have sepearte of the lanaguage instead of mix of
Arabic, English, Bengali, Hindi, Malay.
I coul not find any tokenizer for Malay, can you suggest me if you know
please.
--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.ht
Mugeesh,
One important question: will the typical document have a mix of English and
Bangla and Hindi? If so, you would probably have them all in one collection.
Another thing to think about is the tokenizer. Are all words separated by white
space? If not, then you might need to think about whic
On Aug 25, 2009, at 10:34 AM, Elaine Li wrote:
I am still looking for help on chinese language search. I tried
chinesetokenizerfactory as my analyzer, but it did not help. Only word
with white space, comma and etc around them can be found.
Try using the StandardTokenizerFactory - it handles Ch
Uri,
Thanks a lot! I don't need to do cross language search. So Option 2
sounds better, coz my corpus is very large.
I am still looking for help on chinese language search. I tried
chinesetokenizerfactory as my analyzer, but it did not help. Only word
with white space, comma and etc around them c
I can think of ways to tackle your problem:
Option 1: each document will have a field indicating its language. Then,
when searching, you can simply filter the query on the language you're
searching on. Advantages: everything is in one index, so if in the
future you will need to do a cross lang
Thanks Grant,
The requirement from the user end is to only search in that particular
language and not across languages.
Also going forward we will be adding more languages.
so if i have separate fields for each language ,then we need to change the
schema everytime and that will not scale very we
On Dec 18, 2008, at 6:25 AM, Sujatha Arun wrote:
Hi,
I am prototyping lanuage search using solr 1.3 .I have 3 fields in
the
schema -id,content and language.
I am indexing 3 pdf files ,the languages are foroyo,chinese and
japanese.
I use xpdf to convert the content of pdf to text and pu
On Dec 2, 2008, at 4:52 AM, tushar kapoor wrote:
1. AND
This is the way the query should look, but there's no reason why you
can't let you're users input AND in Russian and then you substitute it
when you create the query.
or rather,
2 .
Now over to solr specific question. In
Option 1 is correct.
On Tue, Dec 2, 2008 at 3:22 PM, tushar kapoor <
[EMAIL PROTECTED]> wrote:
>
> Hi,
>
> Before I start with Solr specific question, there is one thing I need to
> get
> information on.
>
> If I am a Russian user on a Russian Website & I want to search for indexes
> having two R
12 matches
Mail list logo