Thanks Erik. That helps. -----Original Message----- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Wednesday, November 07, 2007 11:36 AM To: solr-user@lucene.apache.org Subject: Re: Analysis / Query problem
On Nov 7, 2007, at 10:26 AM, Wagner,Harry wrote: > I have the following custom field defined for author names. After > indexing the 2 documents below the admin analysis tool looks right > for field-name=au and field-value=Schröder, Jürgen The highlight > matching also seems right. However, if I search for au:Schröder, > Jürgen using the admin tool I do not get any hits (see below). > This appears to be the case whenever there are 2 non-ascii > characters in the author name. Searching for au:Schröder, Jurgen > finds both of these records. Any idea what is causing this? > > <response> > > <lst name="responseHeader"> > > <int name="status">0</int> > > <int name="QTime">0</int> > > <lst name="params"> > > <str name="indent">on</str> > > <str name="start">0</str> > > <str name="q">au:Schröder, Jürgen</str> One thing to note is that query "au:Schröder, Jürgen" is being translated (try &debugQuery=true to see) to: au:schröder <AND/OR> <defaultField>:jürgen AND/OR depends on how you have things configured, as well as the default field. You probably want to use the ISOLatin1AccentFilterFactory to have the diacritics "flattened" to the ASCII character they look like. Erik