Re: Multilingual text analysis

2011-06-02 Thread Paul Libbrecht
Le 2 juin 2011 à 16:27, Juan Antonio Farré Basurte a écrit : > Paul, what do you mean when you say it would make sense to start a page at > the solr website? I meant the solr wiki. > I just had wondered whether it was possible to parametrize the analyzers in > function of one field value. I

Re: Multilingual text analysis

2011-06-02 Thread Juan Antonio Farré Basurte
Thank you both Paul and Lee for your answer. Luckily in my case there's no problem about knowing language at index time nor we have really to bother about the language of the query, as users can specify the language they are interested in. So I guess our solution would be to use different optiona

Re: Multilingual text analysis

2011-06-02 Thread Paul Libbrecht
Juan, An easy way in solr, I think, is indeed to use different fields at index time and expand on multiple fields at query time. I believe using field-names' wildcards allows you to specify a different analyzer per language doing this. There's been long discussions on the java-u...@lucene.apach

Re: Multilingual text analysis

2011-06-02 Thread lee carroll
Juan I don't think so. you can try indexing fields like myfield_en. myfield_fr, my field_xx if you now what language you are dealing with at index and query time. you can also have seperate cores for your documents for each language if you don't want to complicate your schema again you will need

Multilingual text analysis

2011-06-02 Thread Juan Antonio Farré Basurte
Hello, Some of the possible analyzers that can be applied to a text field, depend on the language of the text to analyze and can be configured for a concrete language. In my case, the text fields can be in many different languages, but each document also includes a field containing the language