How can I create a good autosuggest list with phrases?

Shawn Heisey Thu, 04 Aug 2011 08:43:21 -0700

I'm at the point in my Solr deployment where I want to start using itfor autosuggest, but I've run into a snag. Because the fields that Iwant to use for autosuggest are tokenized, I can only get single termsout of it. I would like to have it find common phrases that are betweentwo and five words long, so that if someone starts typing "ang" theirautosuggest list will include "Angelina Jolie" as well as possibly "BradPitt and Angelina Jolie."

My index is already quite large, so I do not want to add shingles. Itried to use the clustering component, but that will only give youhalfway decent results if you make the "rows=" parameter absolutely hugeand therefore things run very slowly. Also, it only works againststored fields, so I can only run it against the field where we retrievecaptions, not the full description. It's impractical to get resultsbased on an entire index, much less all seven shards.

I'm OK with offline analysis to generate a list of suggestions, and I'malso OK with doing that analysis against the MySQL data source ratherthan Solr. I just need some pointers about what software and/ortechniques I can use to generate a good list, and then some idea of howto configure Solr to use that list. Can anyone help?


Thanks,
Shawn

How can I create a good autosuggest list with phrases?

Reply via email to