Not that I know of... and I'm no expert on it!!  I know there are at
least two possibilities -- ChineseAnalyzer / CJKAnalyzer (from trunk's
modules/analysis), but I don't know the tradeoffs of each.

Hopefully others will chime in here?

However, once you do figure out a good schema, could you please post
back?  I'd like to add it to Solr's example schema as an example field
type (text_example_zh?).

Mike

http://blog.mikemccandless.com

On Sat, May 21, 2011 at 7:20 PM, Andy <angelf...@yahoo.com> wrote:
> Is there any example schema for Chinese that I could use as a guide right now?
>
> Thanks
>
>
> --- On Sat, 5/21/11, Michael McCandless <luc...@mikemccandless.com> wrote:
>
>> From: Michael McCandless <luc...@mikemccandless.com>
>> Subject: Re: chinese SOLR query parser
>> To: solr-user@lucene.apache.org
>> Date: Saturday, May 21, 2011, 6:14 PM
>> Unfortunately, Solr's defaults
>> (example schema) are unusable for
>> non-whitespace languages... see:
>>
>>     http://markmail.org/thread/ww6mhfi3rfpngmc5
>>
>> So it could be you need to turn off
>> autoGeneratePhraseQueries in your
>> fieldType?  We are working towards fixing the example
>> schema (for
>> 3.2/4.0) in https://issues.apache.org/jira/browse/SOLR-2519 ...
>>
>> Also, it could be your web/app server is not using UTF8
>> character
>> encoding, eg Tomcat defaults to ISO-8859-1 -- see
>> http://wiki.apache.org/tomcat/FAQ/CharacterEncoding
>>
>> Mike
>>
>> http://blog.mikemccandless.com
>>
>> On Sat, May 21, 2011 at 3:30 PM, Pradeep Pujari <prade...@rocketmail.com>
>> wrote:
>> > Hi,
>> >
>> > I made changes to schema.xml with CJKAnalyzer. Does
>> naything else required to change in solrconfig.xml for query
>> parser component. Because, I do not get any result back
>> while searching? Looks like the chinese characters are being
>> encoded unable to match in the index. Any help is highly
>> appriciated.
>> >
>> > Thanks
>> > Pradeep.
>> >
>>
>

Reply via email to