Hi,
  I am using solr for indexing some product data, and wanted to use the
faceted search. My indexed field (mfg) sometimes contains two words
"sony erricson" for example. When I get the facets on the mfg, SOLR
return "sony" and "erricson" as separate hits. There are also some
facets that show up rather mysteriously.

My Unique list of mfg's that is indexed is as follows:
AT&T
BlackBerry?
HTC
LG
Motorola
Nokia
Option
Palm
Pantech
Samsung
Sierra Wireless
Sony Ericsson


The resulting facets being returned is below:
"facet_fields":{
        "mfg":[
         "ericsson",195,
         "soni",156,
         "samsung",155,
         "nokia",90,
         "Ericsson",78,
         "Sony",78,
         "Samsung",62,
         "motorola",55,
         "lg",50,
         "sony",39,
         "Nokia",36,
         "pantech",25,
         "Motorola",22,
         "LG",20,
         "berri",16,
         "black",16,
         "blackberri",16,
         "Pantech",10,
         "BlackBerry",8,
         "blackberry",4,
         "AT",0,
         "HTC",0,
         "Option",0,
         "Palm",0,
         "Sierra",0,
         "T",0,
         "Wireless",0,
         "at",0,
         "att",0,
         "htc",0,
         "option",0,
         "palm",0,
         "sierra",0,
         "t",0,
         "wireless",0]


I have tried playing around with defining the fieldtype using the
following analyzers:
<fieldType name="mfgTextTight" class="solr.TextField"
positionIncrementGap="100" >
  <analyzer>
    <tokenizer class="solr.LetterTokenizerFactory"/>
    <filter class="solr.LowerCaseFilterFactory"/>
    <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
    <filter class="solr.KeepWordFilterFactory"
words="manufacturer.txt"/>
  </analyzer>
</fieldType>
 

Any ideas if its possible to get the same facets as are in the data
that's being indexed or would I have to write my own Filter for this
purpose ?

Thanks
Shantanu Deo
AT&T eCommerce Web Hosting - Release Management
Office: (425)288-6081
email: sd1...@att.com

Reply via email to