I a problem searching for one mfg name (out of our 10mm product titles)
and it is indexed in a text type field  having about the same analyzer
settings as the solr example text field definition, and most everything
works fine but we found this one example which I cannot get a direct hit
on.  In the Field Analysis page, It sure looks like it would *have* to
match but sadly during searches it just doesn't.  I can get it to match
by turning off 'split on case change' but that breaks many other
searches like 'appleTV' which need to split on case change to match
'apple tv' in our content!

 

If I search for SterlingTek's <anything> I get zero results.

If I change the casing to Sterlingtek's in my query, I get all the
results.

If I turn off 'split on case change then the first gets results also.

 

See verbose analysis output to see actual filter settings, I put
non-verbose first for easier reading (hope the tables don't get lost
during posting to this group) but the analysis shows complete matchup,
that is what I don't get:

 

Field Analysis

Top of Form

Field                

Field value (Index) 
verbose output  
highlight matches 

SterlingTek's NB-2LH

Field value (Query) 
verbose output 

SterlingTek's NB-2LH

        Bottom of Form

Index Analyzer

SterlingTek's

NB-2LH

 

SterlingTek's

NB-2LH

 

SterlingTek's

NB-2LH

 

Sterling

Tek

NB

2

LH

SterlingTek

 

sterling

tek

nb

2

lh

sterlingtek

 

sterling

tek

nb

2

lh

sterlingtek

 

sterling

tek

nb

2

lh

        
sterlingtek

Note every field is highlighted in the last line above meaning all have
a match, right???

Query Analyzer

SterlingTek's

NB-2LH

 

SterlingTek's

NB-2LH

 

SterlingTek's

NB-2LH

 

Sterling

Tek

NB

2

LH

 

sterling

tek

nb

2

lh

 

sterling

tek

nb

2

lh

 

sterling

tek

nb

2

lh

 

 

VERBOSE OUTPUT FOLLOWS:


Index Analyzer


org.apache.solr.analysis.WhitespaceTokenizerFactory {}

term position

1

2

term text

SterlingTek's

NB-2LH

term type

word

word

source start,end

0,13

14,20

payload

                

org.apache.solr.analysis.SynonymFilterFactory
{synonyms=index_synonyms.txt, expand=true, ignoreCase=true}

term position

1

2

term text

SterlingTek's

NB-2LH

term type

word

word

source start,end

0,13

14,20

payload

                

org.apache.solr.analysis.StopFilterFactory {words=stopwords.txt,
ignoreCase=true}

term position

1

2

term text

SterlingTek's

NB-2LH

term type

word

word

source start,end

0,13

14,20

payload

                

org.apache.solr.analysis.WordDelimiterFilterFactory {preserveOriginal=0,
splitOnCaseChange=1, generateNumberParts=1, catenateWords=1,
generateWordParts=1, catenateAll=0, catenateNumbers=1}

term position

1

2

3

4

5

term text

Sterling

Tek

NB

2

LH

SterlingTek

term type

word

word

word

word

word

word

source start,end

0,8

8,11

14,16

17,18

18,20

0,11

payload

                                        
        

org.apache.solr.analysis.LowerCaseFilterFactory {}

term position

1

2

3

4

5

term text

sterling

tek

nb

2

lh

sterlingtek

term type

word

word

word

word

word

word

source start,end

0,8

8,11

14,16

17,18

18,20

0,11

payload

                                        
        

com.lucidimagination.solrworks.analysis.LucidKStemFilterFactory
{protected=protwords.txt}

term position

1

2

3

4

5

term text

sterling

tek

nb

2

lh

sterlingtek

term type

word

word

word

word

word

word

source start,end

0,8

8,11

14,16

17,18

18,20

0,11

payload

                                        
        

org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory {}

term position

1

2

3

4

5

term text

sterling

tek

nb

2

lh

sterlingtek

term type

word

word

word

word

word

word

source start,end

0,8

8,11

14,16

17,18

18,20

0,11

payload

                                        
        

Query Analyzer


org.apache.solr.analysis.WhitespaceTokenizerFactory {}

term position

1

2

term text

SterlingTek's

NB-2LH

term type

word

word

source start,end

0,13

14,20

payload

                

org.apache.solr.analysis.SynonymFilterFactory
{synonyms=query_synonyms.txt, expand=true, ignoreCase=true}

term position

1

2

term text

SterlingTek's

NB-2LH

term type

word

word

source start,end

0,13

14,20

payload

                

org.apache.solr.analysis.StopFilterFactory {words=stopwords.txt,
ignoreCase=true}

term position

1

2

term text

SterlingTek's

NB-2LH

term type

word

word

source start,end

0,13

14,20

payload

                

org.apache.solr.analysis.WordDelimiterFilterFactory {preserveOriginal=0,
splitOnCaseChange=1, generateNumberParts=1, catenateWords=0,
generateWordParts=1, catenateAll=0, catenateNumbers=0}

term position

1

2

3

4

5

term text

Sterling

Tek

NB

2

LH

term type

word

word

word

word

word

source start,end

0,8

8,11

14,16

17,18

18,20

payload

                                        

org.apache.solr.analysis.LowerCaseFilterFactory {}

term position

1

2

3

4

5

term text

sterling

tek

nb

2

lh

term type

word

word

word

word

word

source start,end

0,8

8,11

14,16

17,18

18,20

payload

                                        

com.lucidimagination.solrworks.analysis.LucidKStemFilterFactory
{protected=protwords.txt}

term position

1

2

3

4

5

term text

sterling

tek

nb

2

lh

term type

word

word

word

word

word

source start,end

0,8

8,11

14,16

17,18

18,20

payload

                                        

org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory {}

term position

1

2

3

4

5

term text

sterling

tek

nb

2

lh

term type

word

word

word

word

word

source start,end

0,8

8,11

14,16

17,18

18,20

payload

                                        

 

Reply via email to