Hi everybody,

I've just started using solr. Not sure if this is too specific of a problem,
but here goes.

My situation is I have a semi long query and then im searching on very short
documents. The basic issue is I want this query to return documents where
every word/token in the document is matched.  I also have a synonyms file.

The way i've been going about this is using dismaxparserplugin's minimum
match and indexing each document's length, and then do max_tokens_document #
of queries, where each query is mm=x, doc_length=x, q=query_string.

Example:
dog, dogs puppy, and canine are synonyms
query="dog dog cat love puppy canine water not"
doc1="cat love dog" matches
doc2="cat hate water" doesn't match
doc3="hot dog contests rule" matches, but i don't want it to.


doc4 happens because dog, dog, puppy, canine all match, so that's four
matches, and doc3's token_length is also four.  Four words in my query are
matching to the same word in the doc, and each of them counts toward the
minimum match count.

Thanks,

Henry

Reply via email to