Re: inconsistency between analysis.jsp and actual search

Lukas Kahwe Smith Fri, 12 Feb 2010 02:31:50 -0800

On 12.02.2010, at 11:17, Ahmet Arslan wrote:
> analysis.jsp does not do actual query parsing. just shows produced tokens 
> step by step in analysis (charfilter, tokenizer, tokenfilter) phase.
> "admin/analysis.jsp page will show you how your field is processed while 
> indexing and while querying, and if a particular query matches." [1]
> 
> [1]http://wiki.apache.org/solr/FAQ#My_search_returns_too_many_.2BAC8_too_little_.2BAC8_unexpected_results.2C_how_to_debug.3F



I see, thats good to know. Maybe even something that should be noted in the 
analysis.jsp page itself.

Anyways so how can I get "st.gallen" split into two terms at query time?

    <fieldType name="prefix_token" class="solr.TextField" 
positionIncrementGap="1">
      <analyzer type="index">
        ...
      </analyzer>
      <analyzer type="query">
        <charFilter class="solr.MappingCharFilterFactory" 
mapping="mapping-ISOLatin1Accent.txt" />
        <tokenizer class="solr.WhitespaceTokenizerFactory" />
        <filter class="solr.LowerCaseFilterFactory" />
        <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" 
generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0" 
splitOnCaseChange="1"/>
        <filter class="solr.StopFilterFactory" ignoreCase="true" 
words="stopwords.txt" enablePositionIncrements="true" />
      </analyzer>
    </fieldType>

It seems I should probably use the solr.StandardTokenizerFactory anyways, but 
for this case it wouldnt help either.

regards,
Lukas Kahwe Smith
m...@pooteeweet.org

Re: inconsistency between analysis.jsp and actual search

Reply via email to