multi-field suggestions

Michael Sokolov Fri, 18 Apr 2014 07:53:22 -0700

I've been working on getting AnalyzingInfixSuggester to make suggestionsusing tokens drawn from multiple fields. I've done this by copyingtokens from each of those fields into a destination field, and buildingsuggestions using that destination field. This allows me to usedifferent analysis strategies for each of the fields, which I need, butit doesn't address a couple of remaining issues:

1. Some source fields are more important than others, and it would begood to be able to give their tokens greater weight somehow

2. The threshold is applied equally across all tokens, but for somefields we want to suggest singletons (threshold=0), while for others wewant to use the threshold to exclude low-frequency terms.

I looked a little bit at how to extend the whole framework from Solr ondown to handle multiple source fields intrinsically, rather than usingthe copying technique, and it looks like I could possibly managesomething like this by extending DocumentDictionary and plugging in adifferent DictionaryFactory. Does that sound like a good approach? Isthere some better way to approach this problem?


Thanks

-Mike

PS Sorry for the cross-post; I realized after I hit send this wasprobably a better question for solr-user than lucene...

multi-field suggestions

Reply via email to