I can't explain the results you are seeing, but you also didn't provide us 
with your schema.xml (ie; how are "text" and "text_auto" defined?) or 
enough details ot try and reproduce on a small scale (ie: what does the 
source data look like in the documents where these suggestion values 
are coming from.

If i start up the "bin/solr -e techproducts" example, which is also 
configured to use DocumentDictionaryFactory, I don't see any duplicate 
suggestions...

curl 
'http://localhost:8983/solr/techproducts/suggest?suggest.dictionary=mySuggester&suggest=true&suggest.build=true&wt=json'
{"responseHeader":{"status":0,"QTime":13},"command":"build"}
curl 
'http://localhost:8983/solr/techproducts/suggest?wt=json&indent=true&suggest.dictionary=mySuggester&suggest=true&suggest.q=elec'
{
  "responseHeader":{
    "status":0,
    "QTime":1},
  "suggest":{"mySuggester":{
      "elec":{
        "numFound":3,
        "suggestions":[{
            "term":"electronics and computer1",
            "weight":2199,
            "payload":""},
          {
            "term":"electronics",
            "weight":649,
            "payload":""},
          {
            "term":"electronics and stuff2",
            "weight":279,
            "payload":""}]}}}}

...can you provide us with some precises (and ideally minimal) steps to 
reproduce the problem you are describing?


For Example...

1) "Add XYZ to the 5.5 sample_techproducts_configs solrconfig.xml"
2) "Add ABC to the 5.5 sample_techproducts_configs managed-schema"
3) run this curl command to index a few sample documents...
4) run this curl command to see some suggest results that have duplicates 
in them based on the sample data from step #3


        ?


-Hoss
http://www.lucidworks.com/

Reply via email to