Re: Query multiple fields using configured analyzer stack

Tobias Dittrich Mon, 16 Mar 2009 10:00:51 -0700

Hi Steve,

thanks for your quick response. Quoting the string really isnot a good idea in this case. And it does not what I needanyway since the query is converted into a PhraseQuery andtreated differently.

But thanks for pointing me to the FieldQParserPlugin. Yet Iseem not to get it to work properly. I registered it asplugin in my solrconfig.xml like this:

<queryParser name="field"class="org.apache.solr.search.FieldQParserPlugin"/>

But when I send a query I get the following results (solrjdebug output):


rawquerystring -> {!field f=name}blue tooth
querystring -> {!field f=name}blue tooth
parsedquery -> name:blue name:tooth
parsedquery_toString -> name:blue name:tooth

But I'd expect it to be like name:(blue tooth) name:bluename:tooth name:bluetooth


Here is what my schema.xml looks for name:

<!-- normal german text -->

<fieldType name="text_de" class="solr.TextField"positionIncrementGap="100" omitNorms="true">

  <analyzer type="index">
        [..]
  </analyzer>
  <analyzer type="query">
    <tokenizer class="solr.WhitespaceTokenizerFactory" />

<filter class="solr.WordDelimiterFilterFactory"splitOnCaseChange="1" generateWordParts="1"generateNumberParts="1" catenateWords="1"catenateNumbers="1" catenateAll="1" preserveOriginal="1" />

    <filter class="solr.PositionFilterFactory" />

<filter class="solr.StopFilterFactory"words="stopwords_de.txt" ignoreCase="true" />

    <filter class="solr.LowerCaseFilterFactory" />

<filter class="solr.SnowballPorterFilterFactory"language="German" />

    <filter class="solr.RemoveDuplicatesTokenFilterFactory" />
  </analyzer>
</fieldType>

<field name="name" type="text_de" indexed="true"stored="true" />


Is there anything else I have to configure?

Thanks
Tobi


Steven A Rowe schrieb:

Hi Tobi,

On 3/16/2009 at 9:14 AM, Tobias Dittrich wrote:
how can I query multiple fields in such way that for each of
the fields the configured analyzer stack with Tokenizer is
used for the whole query string?
Lucene's QueryParser (and AFAIK, Solr's QPs too) first break queries on whitespace (except quoted strings), and then sends individual words to be analyzed by the appropriate analyzer.
One way to ensure that an analyzer sees the whole string at once is to enclose 
the query in quotation marks.  This is not ideal.  Another way (that I've never 
used): Solr's FieldQParserPlugin will send the entire string for a field to the 
appropriate analyzer; Chris Hostetter explains here:

<http://www.lucidimagination.com/search/document/ea7b0b27b1b17b1c/re_replacing_fast_functionality_atsesam_no_shinglefilter_exactmatching>
PositionFilter was created to make ShingleFilter work better with query parsing, by making the positions of the generated shingles all be the same, which triggers "synonym" handling - any one of the generated shingles will cause a hit if present in a document:
<http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#head-7563e3bc4d5f7874c4c0ff824671e9ca62f40524>

Steve

Re: Query multiple fields using configured analyzer stack

Reply via email to