Ranking suggestions based on query count would be trivially easy to spam. Have a bot make my preferred queries over and over again, and "boom" they are the most-preferred.
wunder On Sep 20, 2011, at 3:41 PM, Markus Jelsma wrote: > At least, i assumed this is what the user asked for when i read "which counts > requests and sorts suggestions according to this count" > >> No. The spellchecker and suggester only operate on the index (tf*idf) and >> do not account for user generated input which is what the user asks for. >> >> You need to parse the query logs periodically index query strings and >> #occurences in the query logs as a float value (or use ExternalFileField) >> to obtain a popularity rate to sort on. >> >> This new index can then be queried as auto suggest; n-grams are commonly >> used for this. This means both "redlands" and "reckless" are returned for >> the query "re". Sort it and you've got the desired result. >> >> I would not recommend storing user input (the queries) and the actual >> documents in the same index, for many reasons. >> >>> From http://wiki.apache.org/solr/Suggester : >>> >>> spellcheck.onlyMorePopular=true - if this parameter is set to true then >>> the suggestions will be sorted by weight ("popularity") - the count >>> parameter will effectively limit this to a top-N list of best >>> suggestions. If this is set to false then suggestions are sorted >>> alphabetically. >>> >>> -- >>> View this message in context: >>> http://lucene.472066.n3.nabble.com/autocomplete-with-popularity-tp3352755 >>> p 3352919.html Sent from the Solr - User mailing list archive at >>> Nabble.com.